Course Outline

Introduction

Overview of "Open Studio for Big Data" Features and Architecture

Setting up Open Studio for Big Data

Navigating the UI

Understanding Big Data Components and Connectors

Connecting to a Hadoop Cluster

Reading and Writing Data

Processing Data with Hive and MapReduce

Analyzing the Results

Improving the Quality of Big Data

Building a Big Data Pipeline

Managing Users, Groups, Roles, and Projects

Deploying Open Studio to Production

Monitoring Open Studio

Troubleshooting

Summary and Conclusion

Requirements

  • An understanding of relational databases
  • An understanding of data warehousing
  • An understanding of ETL (Extract, Transform, Load) concepts

Audience

  • Business intelligence professionals
  • Database professionals
  • SQL Developers
  • ETL Developers
  • Solution architects
  • Data architects
  • Data warehousing professionals
  • System administrators and integrators
 28 Hours

Number of participants



Price per participant

Testimonials (5)

Related Courses

Data Vault: Building a Scalable Data Warehouse

28 Hours

Spark Streaming with Python and Kafka

7 Hours

Confluent KSQL

7 Hours

Apache Ignite for Developers

14 Hours

Unified Batch and Stream Processing with Apache Beam

14 Hours

Apache Apex: Processing Big Data-in-Motion

21 Hours

Apache Storm

28 Hours

Apache NiFi for Administrators

21 Hours

Apache NiFi for Developers

7 Hours

Apache Flink Fundamentals

28 Hours

Python and Spark for Big Data (PySpark)

21 Hours

Introduction to Graph Computing

28 Hours

Artificial Intelligence - the most applied stuff - Data Analysis + Distributed AI + NLP

21 Hours

Apache Spark MLlib

35 Hours

Knowledge Discovery in Databases (KDD)

21 Hours

Related Categories

1