Talend Data Integration Basics

Talend Studio for Data Integration dramatically improves the efficiency of data integration Job design through an easy-to-use graphical development environment. With integrated connectors to source and target systems, it enables rapid deployment and reduces maintenance costs. It supports all types of data integration, migration, and synchronization.

This course helps you use Talend Studio for Data Integration as quickly as possible. It focuses on the basic capabilities of Studio and how you can use it to build reliable, maintainable data integration tasks that solve practical problems, including extracting data from common database and file formats, transforming it, and integrating it into targets.

This course serves as a prerequisite for many other Talend courses, and the skills learned apply to most products.

Durata2 days
(14 hours)
Target audienceAnyone who wants to use the Talend Studio to perform data integration tasks: software developers and development managers
PrerequisitesBasic knowledge of computing, including familiarity with Java or another programming language, SQL, and general database concepts
Course objectives

After completing this course, you will be able to:

  • Create a project
  • Create and run a Job that reads, converts, and writes data
  • Merge data from several sources within a Job
  • Save a schema for repeated use
  • Create and use metadata and context variables within Jobs
  • Connect to, read from, and write to a database from a Job
  • Access a web service from a Job
  • Work with master Jobs and subJobs
  • Build, export, and test-run Jobs outside Studio
  • Implement basic error-handling techniques

Use best practices for Job and component naming, hints, and documentation

Course agenda

Getting started with Talend Studio

  • Starting Talend Studio
  • Creating a first Job
  • Running a Job

Working with files

  • Reading an input file
  • Transforming data
  • Running a Job
  • Combining columns
  • Duplicating a Job

Joining data sources

  • Creating metadata
  • Creating a join
  • Capturing rejects
  • Correcting a lookup

Filtering data

  • Filtering output data
  • Using tMap for multiple filters

Using context variables

  • Understanding and using context variables
  • Using repository context variables

Error handling

  • Detecting and handling basic errors
  • Raising a warning

Generic schemas

  • Setting up sales data files
  • Creating customer metadata
  • Creating product metadata

Working with databases

  • Creating database metadata
  • Creating a customer table
  • Creating a product table
  • Setting up a sales table
  • Joining data
  • Finalizing the Job

Creating master Jobs

  • Controlling Job execution using a master Job

Working with web services

  • Accessing a web service

Running a stand-alone Job

  • Building a Job
  • Modifying a Job

Documenting a Job

  • Using best practices while documenting a Job