Pengelolaan data terintegrasi menjadi kebutuhan utama organisasi yang bergantung pada laporan analitik dan pengambilan keputusan berbasis fakta. Proses pengolahan data modern menuntut alur kerja terstruktur, kemampuan menangani berbagai sumber data, serta mekanisme transformasi yang menjaga konsistensi dan kualitas informasi sejak tahap awal hingga siap digunakan oleh pengguna bisnis (Kimball & Ross, 2021).
Pembelajaran diarahkan pada penguasaan alur pengolahan data secara menyeluruh, mulai dari penarikan data, pengaturan proses bertahap, hingga pengelolaan ketergantungan antar aktivitas. Peserta mempelajari cara merancang proses yang modular, mengatur urutan eksekusi, menangani kesalahan, serta mengelola berkas dan sumber data agar proses berjalan stabil dan mudah dipelihara. Pendekatan ini membantu membangun solusi pengolahan data yang rapi, terukur, dan dapat digunakan ulang.
Selain pengolahan dasar, materi juga mencakup teknik peningkatan kualitas data melalui pembersihan, validasi, pengelompokan, dan normalisasi. Peserta dikenalkan pada prinsip pemuatan data analitik, optimasi performa, serta praktik terbaik perancangan proyek agar solusi siap diterapkan lintas lingkungan operasional. Fondasi ini mendukung pengembangan sistem analitik yang efisien, konsisten, dan relevan bagi kebutuhan bisnis modern (Gartner, 2023).
OBJECTIVES
1. Memahami Konsep Data Warehouse dan Pentaho Data Integration (PDI)
2. Mendesain dan Menjalankan Transformasi Data
3. Mampu Melakukan Manipulasi Data dan Metadata
4. Mengoptimalkan Performa Transformasi Data
5. Mengimplementasikan Model Data Dimensional dalam Data Warehouse
AUDIENCE
1. Data Engineer
2. Data Analyst
3. Software Developer
4. ETL Developer
5. IT Manager
6. System Architect
PREREQUISITES
Tidak ada training khusus yang dipersyaratkan
CONTENT
1. Getting Started with Pentaho Data Integration
1.1. Pentaho Data Integration and Pentaho BI Suite
1.2. Installing PDI
1.3. Launching the PDI Graphical Designer – Spoon
1.4. Introducing Transformations
2. Getting Started with Transformations
2.1. Designing and Previewing Transformations
2.2. Understanding PDI Data and Metadata
2.3. Handling Errors
3. Creating Basic Task Flows
3.1. Introducing Jobs
3.2. Designing and Running Jobs
3.3. Running Transformations from a Job
3.4. Managing Files
3.5. Understanding and Changing the Flow of Execution
3.6. Knowing the Basics About Kettle Variables
4. Reading and Writing Files
4.1. Reading Data from Files
4.2. Outputting Data to Files
4.3. Working with Big Data and Cloud Sources
5. Manipulating PDI Data and Metadata
5.1. Manipulating Simple Fields
5.2. Working with Complex Structures
6. Cleansing, Validating, and Fixing Data
6.1. Cleansing Data
6.2. Validating Data
6.3. Treating Invalid Data by Splitting and Merging Streams
7. Transforming the Dataset
7.1. Sorting Data
7.2. Working on Groups of Rows
7.3. Converting Rows to Columns
7.4. Normalizing Data
8. Performing Basic Operations with Databases
8.1. Connecting to a Database
8.2. Previewing and Getting Data from a Database
8.3. Inserting, Updating, and Deleting Data
8.4. Verifying a Connection, Running DDL Scripts, and Tasks
9. Loading Data Marts with PDI
9.1. Preparing the Environment
9.2. Introducing Dimensional Modeling
9.3. Loading Dimensions with Data
9.4. Loading Fact Tables
10. Creating Portable and Reusable Transformations
10.1. Defining and Using Kettle Variables
10.2. Creating Reusable Transformations
10.3. Making the Data Flow Between Transformations
10.4. Executing Transformations in an Iterative Way
11. Implementing Metadata Injection
11.1. Introducing Metadata Injection
11.2. Discovering Metadata and Injecting IT
11.3. Identifying Use Cases to Implement Metadata Injection
12. Best Practices for Designing and Deploying a PDI Project
12.1. Setting Up a New Project
12.2. Best Practices to Design Jobs and Transformations
12.3. Maximizing the Performance
12.4. Deploying the Project in Different Environments
Course Features
- Lectures 46
- Quizzes 2
- Duration 40 hours
- Skill level All levels
- Language Indonesia
- Students 11
- Certificate Yes
- Assessments Yes
- 14 Sections
- 46 Lessons
- 40 Hours
- 1. PERSIAPAN2
- 1. GETTING STARTED WITH PENTAHO DATA INTEGRATION4
- 2. GETTING STARTED WITH TRANSFORMATIONS3
- 3. CREATING BASIC TASK FLOWS6
- 4. READING AND WRITING FILES3
- 5. MANIPULATING PDI DATA AND METADATA2
- 6. CLEANSING, VALIDATING, AND FIXING DATA3
- 7. TRANSFORMING THE DATASET4
- 8. PERFORMING BASIC OPERATIONS WITH DATABASES4
- 9. LOADING DATA MARTS WITH PDI4
- 10. CREATING PORTABLE AND REUSABLE TRANSFORMATIONS4
- 11. IMPLEMENTING METADATA INJECTION3
- 12. BEST PRACTICES FOR DESIGNING AND DEPLOYING A PDI PROJECT4
- 3. PENUTUPAN2




