• Professional Development
  • Medicine & Nursing
  • Arts & Crafts
  • Health & Wellbeing
  • Personal Development

77 Hadoop courses

Mastering Scala with Apache Spark for the Modern Data Enterprise (TTSK7520)

By Nexus Human

Duration 5 Days 30 CPD hours This course is intended for This intermediate and beyond level course is geared for experienced technical professionals in various roles, such as developers, data analysts, data engineers, software engineers, and machine learning engineers who want to leverage Scala and Spark to tackle complex data challenges and develop scalable, high-performance applications across diverse domains. Practical programming experience is required to participate in the hands-on labs. Overview Working in a hands-on learning environment led by our expert instructor you'll: Develop a basic understanding of Scala and Apache Spark fundamentals, enabling you to confidently create scalable and high-performance applications. Learn how to process large datasets efficiently, helping you handle complex data challenges and make data-driven decisions. Gain hands-on experience with real-time data streaming, allowing you to manage and analyze data as it flows into your applications. Acquire practical knowledge of machine learning algorithms using Spark MLlib, empowering you to create intelligent applications and uncover hidden insights. Master graph processing with GraphX, enabling you to analyze and visualize complex relationships in your data. Discover generative AI technologies using GPT with Spark and Scala, opening up new possibilities for automating content generation and enhancing data analysis. Embark on a journey to master the world of big data with our immersive course on Scala and Spark! Mastering Scala with Apache Spark for the Modern Data Enterprise is a five day hands on course designed to provide you with the essential skills and tools to tackle complex data projects using Scala programming language and Apache Spark, a high-performance data processing engine. Mastering these technologies will enable you to perform a wide range of tasks, from data wrangling and analytics to machine learning and artificial intelligence, across various industries and applications.Guided by our expert instructor, you?ll explore the fundamentals of Scala programming and Apache Spark while gaining valuable hands-on experience with Spark programming, RDDs, DataFrames, Spark SQL, and data sources. You?ll also explore Spark Streaming, performance optimization techniques, and the integration of popular external libraries, tools, and cloud platforms like AWS, Azure, and GCP. Machine learning enthusiasts will delve into Spark MLlib, covering basics of machine learning algorithms, data preparation, feature extraction, and various techniques such as regression, classification, clustering, and recommendation systems. INTRODUCTION TO SCALA * Brief history and motivation * Differences between Scala and Java * Basic Scala syntax and constructs * Scala's functional programming features INTRODUCTION TO APACHE SPARK * Overview and history * Spark components and architecture * Spark ecosystem * Comparing Spark with other big data frameworks BASICS OF SPARK PROGRAMMING SPARKCONTEXT AND SPARKSESSION * Resilient Distributed Datasets (RDDs) * Transformations and Actions * Working with DataFrames SPARK SQL AND DATA SOURCES * Spark SQL library and its advantages * Structured and semi-structured data sources * Reading and writing data in various formats (CSV, JSON, Parquet, Avro, etc.) * Data manipulation using SQL queries BASIC RDD OPERATIONS * Creating and manipulating RDDs * Common transformations and actions on RDDs * Working with key-value data BASIC DATAFRAME AND DATASET OPERATIONS * Creating and manipulating DataFrames and Datasets * Column operations and functions * Filtering, sorting, and aggregating data INTRODUCTION TO SPARK STREAMING * Overview of Spark Streaming * Discretized Stream (DStream) operations * Windowed operations and stateful processing PERFORMANCE OPTIMIZATION BASICS * Best practices for efficient Spark code * Broadcast variables and accumulators * Monitoring Spark applications INTEGRATING EXTERNAL LIBRARIES AND TOOLS, SPARK STREAMING * Using popular external libraries, such as Hadoop and HBase * Integrating with cloud platforms: AWS, Azure, GCP * Connecting to data storage systems: HDFS, S3, Cassandra, etc. INTRODUCTION TO MACHINE LEARNING BASICS * Overview of machine learning * Supervised and unsupervised learning * Common algorithms and use cases INTRODUCTION TO SPARK MLLIB * Overview of Spark MLlib * MLlib's algorithms and utilities * Data preparation and feature extraction LINEAR REGRESSION AND CLASSIFICATION * Linear regression algorithm * Logistic regression for classification * Model evaluation and performance metrics CLUSTERING ALGORITHMS * Overview of clustering algorithms * K-means clustering * Model evaluation and performance metrics COLLABORATIVE FILTERING AND RECOMMENDATION SYSTEMS * Overview of recommendation systems * Collaborative filtering techniques * Implementing recommendations with Spark MLlib INTRODUCTION TO GRAPH PROCESSING * Overview of graph processing * Use cases and applications of graph processing * Graph representations and operations * Introduction to Spark GraphX * Overview of GraphX * Creating and transforming graphs * Graph algorithms in GraphX BIG DATA INNOVATION! USING GPT AND GENERATIVE AI TECHNOLOGIES WITH SPARK AND SCALA * Overview of generative AI technologies * Integrating GPT with Spark and Scala * Practical applications and use cases Bonus Topics / Time Permitting INTRODUCTION TO SPARK NLP * Overview of Spark NLP Preprocessing text data * Text classification and sentiment analysis PUTTING IT ALL TOGETHER * Work on a capstone project that integrates multiple aspects of the course, including data processing, machine learning, graph processing, and generative AI technologies.

Mastering Scala with Apache Spark for the Modern Data Enterprise (TTSK7520)
Delivered on-request, onlineDelivered Online
Price on Enquiry

Snowflake - Build and Architect Data Pipelines Using AWS

By Packt

The course helps you learn Snowflake from scratch and explore a few of its important features. You will build automated pipelines with Snowflake and use the AWS cloud with Snowflake as a data warehouse. You will also explore Snowpark to be worked on the data pipelines.

Snowflake - Build and Architect Data Pipelines Using AWS
Delivered Online On Demand
£52.99

Data Science Model Deployments and Cloud Computing on GCP

By Packt

Are you interested in learning and deploying applications at scale using Google Cloud platform? Do you lack hands-on exposure when it comes to deploying applications and seeing them in action? Then this course is for you. You will also learn microservices and event-driven architectures with real-world use case implementations.

Data Science Model Deployments and Cloud Computing on GCP
Delivered Online On Demand
£56.99

Apache Kafka Series - Learn Apache Kafka for Beginners v3

By Packt

A beginner-level course that follows a step-by-step approach to learning the fundamentals and core concepts of Apache Kafka 3.0. You will work through interesting activities such as programming a Twitter producer and Elasticsearch consumer to understand the various concepts.

Apache Kafka Series - Learn Apache Kafka for Beginners v3
Delivered Online On Demand
£35.99

Spark Programming in Python for Beginners with Apache Spark 3

By Packt

Advance your data skills by mastering Spark programming in Python. This beginner's level course will help you understand the core concepts related to Apache Spark 3 and provide you with knowledge of applying those concepts to build data engineering solutions.

Spark Programming in Python for Beginners with Apache Spark 3
Delivered Online On Demand
£37.99

Data Science, Analytics, and AI for Business and the Real World™

By Packt

This course focuses on understanding all the basic theory and programming skills required as a data scientist, featuring 35+ practical case studies covering common business problems faced by them. This course seeks to fill all those gaps in knowledge that scare off beginners and simultaneously apply your knowledge of data science and deep learning to real-world business problems.

Data Science, Analytics, and AI for Business and the Real World™
Delivered Online On Demand
£101.99

Real-Time Stream Processing Using Apache Spark 3 for Python Developers

By Packt

Get to grips with real-time stream processing using PySpark as well as Spark structured streaming and apply that knowledge to build stream processing solutions. This course is example-driven and follows a working session-like approach.

Real-Time Stream Processing Using Apache Spark 3 for Python Developers
Delivered Online On Demand
£93.99
1...678

Educators matching "Hadoop"

Show all 15
ThinCor

thincor

Thincor academy with almost 10years of experience is one of the best training institute for Networking and Programming. It won’t be wrong if it is said that there is a lot of scope for students and professionals who have completed a course in Networking and programming. Thus we as a team have been putting in a lot of time and energy to study the ongoing trend and provide the best coaching. We strive towards providing a different kind of training that helps students choose a better career. It has been our continuous effort to keep in track the latest technology so as to be able to meet the requirements put forward by the students. If you are looking for a course in JAVA, Networking or hardware courses then you are at the place. When it comes to JAVA we provide both basic and advanced JAVA with latest frameworks like Springs,Hibernate, Struts, Spring boot etc. We also conduct free workshops so that the students get an overall idea of the courses and how it will be helpful. In order to provide world class training Thincor is affiliated with many networking and development based company’s .All our training syllabus are designed as per the latest requirements of the corporate sector. Apart from JAVA we have been giving importance to BIG DATA and Hadoop. We believe in providing quality education. Students who have plans of studying any of these courses might have a lot of doubts but be assured that our team of experienced faculty will provide an in depth knowledge leaving no doubts behind. A good teaching environment, best teaching methods, experienced faculty and a well set up lab is what we believe has helped us make the number one training

EduCBA

educba

POPULAR ALL-IN-ONE BUNDLE COURSES All In One Financial Analyst Bundle | All In One Data Science Bundle | All In One Software Development Bundle | All In One Excel VBA Bundle | All In One Design Bundle | All In One Project Management Bundle | All In One Personal Development Bundle | All In One Marketing Bundle | POPULAR FINANCE CERTIFICATION COURSES Investment Banking Course | Financial Modeling Course | Project Finance Training | Business Valuation Training | Accounting Course Bundle | Financial Analytics Training | Mergers & Acquisition Course | Finance For Non Finance Managers Course | Cryptocurrency Training | US GAAP Course | LBO Modeling Course | Hedge Funds Course | IFRS Training | Credit Risk Modeling Course | GST Training Program | Mutual Funds Training | Private Equity Training | Venture Capital Course | Fixed Income Course | Equity Research Training | Corporate Finance Training Bundle | POPULAR LIVE ONLINE COURSES Advanced Minitab Training | Tableau Desktop Training | PMP Certification Exam Prep Training | Data Science With Python | Machine Learning | Python GUI Training | Python Scripting | Java Programming | Ultimate .Net Training | Financial Modeling | TRENDING EXAM PREP COURSES CFA 1 | CFA 2 | FRM 1 | FRM 2 | CMA 1 | CMA 2 | PMP Training | PMI SP Training | PMI RMP Training | Scrum Master Training | Six Sigma Green Belt Prep Training | PMI ACP Training | CAPM Training | PgMP Training | CBAP Training | POPULAR DATA SCIENCE CERTIFICATION COURSES Data Science with Python Certification Course | Machine Learning Training | Hadoop Training Program | R Programming Training | SAS Training | EViews Econometrics Training | Cloud Computing Training | AWS Training | SPSS Training Program | IoT Training | Tableau Training | Deep Learning Training | Apache Pig Training | Splunk Training Program | Azure Training | Pandas And NumPy Tutorial | Hive Training | SEO Training | Multisim Training | Minitab Training | Salesforce Training | MapReduce Training | MS SQL Training | TensorFlow Training | Informatica Training | Microsoft Power BI Training | Ansible Training | QlikView Training | OpenCV Training | Docker Training | DevOps Training | MATLAB Training | BUSINESS ANALYTICS CERTIFICATION COURSES Business Analytics Training | Statistical Analysis Training | Data Visualization Training | Business Intelligence Training | Forecasting Models Course | Time Series Course | Fraud Analytics Course | Artificial Intelligence Training | Customer Analytics Training | Marketing Analytics Training | Time Series Analysis and Forecasting with Python | Time Series Analysis and Forecasting with Minitab | Time Series Analysis and Forecasting with Tableau | Time Series Analysis and Forecasting with R | Time Series Analysis and Forecasting with Excel | Time Series Analysis and Forecasting with SAS | Predictive Modeling with Python Course | Predictive Analytics Course | Predictive Modeling with SAS Enterprise Miner | POPULAR SOFTWARE AND DEVELOPMENT COURSES Python | iOS | VB.net | JavaScript | Selenium | Java | Android | C++ | Windows Forms | XML Training | Angular JS | Linux | Bootstrap | Swift Training | JDBC Training | Ruby On Rails | J2EE | WPF | HTML Training | Ionic 4 Tutorials | JQuery UI Course | Java Servlet | C Programming | Spring Boot Training Program | Red Hat Linux Training Program | Ubuntu Training Program | Spring Framework Training | ServiceNow | Joomla | ASP.net | Oracle | C# | ADO.net | JQuery | Ethical Hacking | Kali Linux | Cyber Security Training | Penetration Testing | Django | Windows 10 | SQL Training Program | CSS Training | TestNG | PLSQL | MongoDB | JUnit | Robotic Process Automation RPA Training | PHP Training | React JS Redux Training | PostgreSQL Course | POPULAR DESIGN AND DRAWING COURSES AutoCAD Training | SketchUp Course | SolidWorks Training | Unity Training | Unreal Engine Training | MonoGame Course | DirectX Training | Photoshop Training | Corel Draw Training | Illustrator Training | OpenGL Course | POPULAR FILM EDITING AND VFX COURSES After Effects Training | Cinema 4D Training | Pixelan VFX Tutorial | Boris FX Tutorial | Blackmagic Fusion Training | RealFlow Tutorial | Autodesk Combustion Training | Natron Tutorial | Nuke Training | Silhouette FX Tutorial | Sony Vegas Pro Tutorial | Adobe Premiere Pro Tutorial | DaVinci Resolve Training | NewBlueFX Tutorial | Lightworks Tutorial | Avid Media ComposerTraining | FL Studio Training | Melodyne Tutorial | Pro Tools Training | Audacity Training | TOP 3D MODELING AND ANIMATION COURSES Maya Training | Houdini Training | 3D Max Training | Blender Course | ZBrush Training | Cinema 4D Training | Mudbox Course | POPULAR EXCEL AND OFFICE PRODUCTIVITY COURSES Excel Training | Excel Advanced Training | VBA Training | Excel Data Analysis Training | Excel For HR Training | Excel For Finance Training | Excel For Marketing Training | Pivot Table Training | Microsoft Access Training | MS Office Training | MS Office For MAC | Microsoft Word Training | TOP BUSINESS AND MANAGEMENT COURSES Strategic Management Training | Marketing Training Program | Digital Marketing Training | CRM Training | Market Research Training | Content Marketing Course | Logistics And Supply Chain Training | Business Analysis Training | Retail Management Training | Content Development Training | Job Interview Training | Personality Development Course | Quality Management Training | Project Management Training Program | Agile Scrum Training | Primavera Training | Lean Management Training | Project Estimation Training | Project Cost Management Training | Project Planning Course | TRENDING RESOURCES Blog | Python Tutorial | Java Tutorial | C sharp Tutorial | HTML Tutorial | JavaScript Tutorial | PHP Tutorial | Big Data Tutorial | Hadoop Tutorial | Machine Learning Tutorial | Artificial Intelligence Tutorial | DevOps Tutorial | Tableau Tutorial | Photoshop Tutorial | After Effects Tutorial | Illustrator Tutorial | AutoCAD Tutorial | 3ds Max Tutorial | CorelDraw Tutorial | APPS EDUCBA Android AppEDUCBA iOS App Terms & Conditions Disclaimer Privacy Policy & Cookie Policy Shipping Policy ISO 10004:2018 & ISO 9001:2015 Certified © 2023 - EDUCBA. ALL RIGHTS RESERVED. THE CERTIFICATION NAMES ARE THE TRADEMARKS OF THEIR RESPECTIVE OWNERS. Hello,how may I help you?}