• Professional Development
  • Medicine & Nursing
  • Arts & Crafts
  • Health & Wellbeing
  • Personal Development

237 Big Data courses

🔥 Limited Time Offer 🔥

Get a 10% discount on your first order when you use this promo code at checkout: MAY24BAN3X

Apache Spark with Scala - Hands-On with Big Data!

By Packt

This is a comprehensive and practical Apache Spark course. In this course, you will learn and master the art of framing data analysis problems as Spark problems through 20+ hands-on examples, and then scale them up to run on cloud computing services. Explore Spark 3, IntelliJ, Structured Streaming, and a stronger focus on the DataSet API.

Apache Spark with Scala - Hands-On with Big Data!
Delivered Online On Demand
£74.99

Professional Certificate Course in Big Data Infrastructure in London 2024

4.9(261)

By Metropolitan School of Business & Management UK

Dive into the heart of Big Data Infrastructure, exploring storage systems, distributed file frameworks, and processing paradigms. This course provides a comprehensive understanding of key components like HDFS, Apache Spark, and Cassandra, offering insights into their architecture, use cases, and real-world applications. This course is a deep dive into the complex landscape of Big Data Infrastructure. From unravelling the architecture of Apache Spark to dissecting the benefits of distributed file systems, participants gain expertise in assessing, comparing, and implementing various Big Data storage and processing systems. Scalability, fault-tolerance, and industry-specific case studies add practical depth to theoretical knowledge. After the successful completion of this course, you will be able to: * Understand the Components of Big Data Infrastructure, Including Storage Systems, Distributed File Systems, and Processing Frameworks. * Identify the Characteristics and Benefits of Distributed File Systems Such as Hadoop Distributed File System (H.D.F.S). * Describe the Architecture and Capabilities of Apache Spark and its Role in Big Data Processing. * Recognise the Use Cases and Benefits of Apache Cassandra as a Distributed N..O.S.Q.L Database. * Compare and Contrast Different Big Data Storage and Processing Systems Such as Hadoop, Spark, and Cassandra. * Understand the Scalability and Fault-tolerance Mechanisms Used in Big Data Infrastructure, Such as Sharding and Replication. * Appreciate the Challenges Associated with Deploying and Managing Big Data Infrastructure, Such as Hardware and Software Configuration and Security Considerations. Explore the intricacies of Big Data Infrastructure, from understanding storage systems to unraveling the nuances of distributed file frameworks and processing engines. Gain a comprehensive view of scalability, fault-tolerance mechanisms, and industry-specific challenges through engaging case studies. Equip yourself to navigate the dynamic landscape of Big Data with confidence and expertise. * VIDEO - COURSE STRUCTURE AND ASSESSMENT GUIDELINES Watch this video to gain further insight. * NAVIGATING THE MSBM STUDY PORTAL Watch this video to gain further insight. * INTERACTING WITH LECTURES/LEARNING COMPONENTS Watch this video to gain further insight. * BIG DATA INFRASTRUCTURE Self-paced pre-recorded learning content on this topic. * BIG DATA INFRASTRUCTURE Put your knowledge to the test with this quiz. Read each question carefully and choose the response that you feel is correct. All MSBM courses are accredited by the relevant partners and awarding bodies. Please refer to MSBM accreditation in about us for more details. There are no strict entry requirements for this course. Work experience will be an added advantage to understanding the content of the course. The certificate is designed to enhance the learner's knowledge in the field. This certificate is for everyone who is eager to know more and get updated on current ideas in their respective field. We recommend this certificate for the following audience. * Big Data Infrastructure Engineer * Hadoop Administrator * Spark Developer * Cassandra Database Administrator * Big Data Solutions Architect * Data Infrastructure Manager * NoSQL Database Analyst * Big Data Consultant AVERAGE COMPLETION TIME 2 Weeks ACCREDITATION 3 CPD Hours LEVEL Advanced START TIME Anytime 100% ONLINE Study online with ease. UNLIMITED ACCESS 24/7 unlimited access with pre-recorded lectures. LOW FEES Our fees are low and easy to pay online.

Professional Certificate Course in Big Data Infrastructure in London 2024
Delivered Online On Demand
£28

PySpark and AWS: Master Big Data with PySpark and AWS

By Packt

The course is crafted to reflect the most in-demand workplace skills. It will help you understand all the essential concepts and methodologies with regards to PySpark. This course provides a detailed compilation of all the basics, which will motivate you to make quick progress and experience much more than what you have learned.

PySpark and AWS: Master Big Data with PySpark and AWS
Delivered Online On Demand
£101.99

Designing and Building Big Data Applications

By Nexus Human

Duration 4 Days 24 CPD hours This course is intended for This course is best suited to developers, engineers, and architects who want to use use Hadoop and related tools to solve real-world problems. Overview Skills learned in this course include:Creating a data set with Kite SDKDeveloping custom Flume components for data ingestionManaging a multi-stage workflow with OozieAnalyzing data with CrunchWriting user-defined functions for Hive and ImpalaWriting user-defined functions for Hive and ImpalaIndexing data with Cloudera Search Cloudera University?s four-day course for designing and building Big Data applications prepares you to analyze and solve real-world problems using Apache Hadoop and associated tools in the enterprise data hub (EDH). INTRODUCTION APPLICATION ARCHITECTURE * Scenario Explanation * Understanding the Development Environment * Identifying and Collecting Input Data * Selecting Tools for Data Processing and Analysis * Presenting Results to the Use DEFINING & USING DATASETS * Metadata Management * What is Apache Avro? * Avro Schemas * Avro Schema Evolution * Selecting a File Format * Performance Considerations USING THE KITE SDK DATA MODULE * What is the Kite SDK? * Fundamental Data Module Concepts * Creating New Data Sets Using the Kite SDK * Loading, Accessing, and Deleting a Data Set IMPORTING RELATIONAL DATA WITH APACHE SQOOP * What is Apache Sqoop? * Basic Imports * Limiting Results * Improving Sqoop?s Performance * Sqoop 2 CAPTURING DATA WITH APACHE FLUME * What is Apache Flume? * Basic Flume Architecture * Flume Sources * Flume Sinks * Flume Configuration * Logging Application Events to Hadoop DEVELOPING CUSTOM FLUME COMPONENTS * Flume Data Flow and Common Extension Points * Custom Flume Sources * Developing a Flume Pollable Source * Developing a Flume Event-Driven Source * Custom Flume Interceptors * Developing a Header-Modifying Flume Interceptor * Developing a Filtering Flume Interceptor * Writing Avro Objects with a Custom Flume Interceptor MANAGING WORKFLOWS WITH APACHE OOZIE * The Need for Workflow Management * What is Apache Oozie? * Defining an Oozie Workflow * Validation, Packaging, and Deployment * Running and Tracking Workflows Using the CLI * Hue UI for Oozie PROCESSING DATA PIPELINES WITH APACHE CRUNCH * What is Apache Crunch? * Understanding the Crunch Pipeline * Comparing Crunch to Java MapReduce * Working with Crunch Projects * Reading and Writing Data in Crunch * Data Collection API Functions * Utility Classes in the Crunch API WORKING WITH TABLES IN APACHE HIVE * What is Apache Hive? * Accessing Hive * Basic Query Syntax * Creating and Populating Hive Tables * How Hive Reads Data * Using the RegexSerDe in Hive DEVELOPING USER-DEFINED FUNCTIONS * What are User-Defined Functions? * Implementing a User-Defined Function * Deploying Custom Libraries in Hive * Registering a User-Defined Function in Hive EXECUTING INTERACTIVE QUERIES WITH IMPALA * What is Impala? * Comparing Hive to Impala * Running Queries in Impala * Support for User-Defined Functions * Data and Metadata Management UNDERSTANDING CLOUDERA SEARCH * What is Cloudera Search? * Search Architecture * Supported Document Formats INDEXING DATA WITH CLOUDERA SEARCH * Collection and Schema Management * Morphlines * Indexing Data in Batch Mode * Indexing Data in Near Real Time PRESENTING RESULTS TO USERS * Solr Query Syntax * Building a Search UI with Hue * Accessing Impala through JDBC * Powering a Custom Web Application with Impala and Search

Designing and Building Big Data Applications
Delivered on-request, onlineDelivered Online
Price on Enquiry

Scala & Spark-Master Big Data with Scala and Spark

By Packt

Scala is doubtless one of the most in-demand skills for data scientists and data engineers. This competitive course will teach you the essential concepts and methodologies of Scala with a lot of practical implementations.

Scala & Spark-Master Big Data with Scala and Spark
Delivered Online On Demand
£93.99

Master Big Data Ingestion and Analytics with Flume, Sqoop, Hive and Spark

By Packt

A complete course on Sqoop, Flume, and Hive: Ideal for achieving CCA175 and Hortonworks Spark Certification

Master Big Data Ingestion and Analytics with Flume, Sqoop, Hive and Spark
Delivered Online On Demand
£70.99

CCA 159: Expert in Big Data Analytics - Advance Hive and Sqoop

By Packt

Big data certification for non-programmers, business analysts, testers, and SQL developers

CCA 159: Expert in Big Data Analytics - Advance Hive and Sqoop
Delivered Online On Demand
£93.99

Real-Time Stream Processing Using Apache Spark 3 for Scala Developers

By Packt

Learn the process to design and develop big data engineering projects using Apache Spark. This example-driven advanced-level course will help you understand real-time stream processing using Apache Spark and you can apply that knowledge to build real-time stream processing solutions.

Real-Time Stream Processing Using Apache Spark 3 for Scala Developers
Delivered Online On Demand
£22.99

Big Data Analysis & Data Science Diploma

By NextGen Learning

Get ready for an exceptional online learning experience with the Big Data Analysis & Data Science Diploma bundle! This carefully curated collection of 20 premium courses is designed to cater to a variety of interests and disciplines. Dive into a sea of knowledge and skills, tailoring your learning journey to suit your unique aspirations. The Big Data Analysis & Data Science Diploma is a dynamic package, blending the expertise of industry professionals with the flexibility of digital learning. It offers the perfect balance of foundational understanding and advanced insights. Whether you're looking to break into a new field or deepen your existing knowledge, the Data Analysis & Data Science Diploma package has something for everyone. As part of the Big Data Analysis & Data Science Diploma package, you will receive complimentary PDF certificates for all courses in this bundle at no extra cost. Equip yourself with the Data Analysis & Data Science Diploma bundle to confidently navigate your career path or personal development journey. Enrol today and start your career growth! This bundle comprises the following courses: CPD Quality Standards Courses: 1. Big Data Analytics with PySpark Power BI and MongoDB 2. Big Data Analytics with PySpark Tableau Desktop and MongoDB 3. Building Big Data Pipelines with PySpark MongoDB and Bokeh 4. Develop Big Data Pipelines with R & Sparklyr & Tableau 5. Develop Big Data Pipelines with R, Sparklyr & Power BI 6. Complete Python Machine Learning & Data Science Fundamentals Learning Outcome: * Gain comprehensive insights into multiple fields. * Foster critical thinking and problem-solving skills across various disciplines. * Understand industry trends and best practices through the Data Analysis & Data Science Diploma Bundle. * Develop practical skills applicable to real-world situations. * Enhance personal and professional growth with the Data Analysis & Data Science Diploma. * Build a strong knowledge base in your chosen course via the Data Analysis & Data Science Diploma. * Benefit from the flexibility and convenience of online learning. * With the Data Analysis & Data Science Diploma package, validate your learning with a CPD certificate. Each course in this bundle holds a prestigious CPD accreditation, symbolising exceptional quality. The materials, brimming with knowledge, are regularly updated, ensuring their relevance. This bundle promises not just education but an evolving learning experience. Engage with this extraordinary collection, and prepare to enrich your personal and professional development. Embrace the future of learning with the Big Data Analysis & Data Science Diploma, a rich anthology of 15 diverse courses. Each course in the Data Analysis & Data Science Diploma bundle is handpicked by our experts to ensure a wide spectrum of learning opportunities. ThisBig Data Analysis & Data Science Diploma bundle will take you on a unique and enriching educational journey. The bundle encapsulates our mission to provide quality, accessible education for all. Whether you are just starting your career, looking to switch industries, or hoping to enhance your professional skill set, the Big Data Analysis & Data Science Diploma bundle offers you the flexibility and convenience to learn at your own pace. Make the Data Analysis & Data Science Diploma package your trusted companion in your lifelong learning journey. CPD 25 CPD hours / points Accredited by CPD Quality Standards WHO IS THIS COURSE FOR? The Big Data Analysis & Data Science Diploma bundle is perfect for: * Lifelong learners looking to expand their knowledge and skills. * Professionals seeking to enhance their career with CPD certification. * Individuals wanting to explore new fields and disciplines. * Anyone who values flexible, self-paced learning from the comfort of home. CAREER PATH Unleash your potential with the Big Data Analysis & Data Science Diploma bundle. Acquire versatile skills across multiple fields, foster problem-solving abilities, and stay ahead of industry trends. Ideal for those seeking career advancement, a new professional path, or personal growth. Embrace the journey with the Big Data Analysis & Data Science Diploma bundle package. CERTIFICATES CERTIFICATE OF COMPLETION Digital certificate - Included CERTIFICATE OF COMPLETION Hard copy certificate - Included You will get a complimentary Hard Copy Certificate.

Big Data Analysis & Data Science Diploma
Delivered Online On Demand
£41

The Ultimate Hands-On Hadoop

By Packt

This course will show you why Hadoop is one of the best tools to work with big data. With the help of some real-world data sets, you will learn how to use Hadoop and its distributed technologies, such as Spark, Flink, Pig, and Flume, to store, analyze, and scale big data.

The Ultimate Hands-On Hadoop
Delivered Online On Demand
£134.99