Talend Data Quality Pack#1

Dopo aver completato questo corso, sarete in grado di sviluppare i vostri jobs utilizzando la potenza dei componenti avanzati del software e Java; sarete anche in grado di analizzare i votri dati e  massimizzare la qualità.

Anche se la conoscenza di SQL è l’unico prerequisito per qualificarsi a questo corso, la conscenza del business sarà necessaria per interpretare le analisi relative alla qualità dei dati. La conoscenza del linguaggio Java è gradita.    

Sempre basato su reali casi d’uso, questi cinque giorni di corso vi permetteranno di padroneggiare il software Talend Data Quality per analizzare i dati e per pulirli.

 

Course objectives:

  1. Model needs
  2. Control the component library
  3. Implement Jobs
  4. Develop analyses to monitor and improve data quality

Target audience:

  1. Project manager
  2. BI Expert
  3. System Engineer/DBA
  4. Development Engineer
  5. Architect

Pre-requisites:

  1. Knowledge of the SQL language
  2. Knowledge of the Java language is a plus

Teaching Method:

This training course is based on real use cases
Theory: 20%
Practice: 80%

Duration::

5 days or 35 hours

 1. Learn how to use Talend Studio

  1. Model your needs and document a project
  2. Use the Job Designer to generate the code
  3. Manage access to files and access to databases
  4. Use the different transformation components
  5. Centralize metadata in the Repository
  6. Master advanced features
  7. Debug scripts and deploy jobs

 2. Presentation and installation of Talend Data Quality

  1. Presentation and installation of Talend Data Quality

 3. Connect to data sources and run your first analyses

  1. Connect to data sources and run your first analyses

 4. Analyze data from a column using patterns

  1. Generate regular expressions through the Pattern indicators
  2. Define your own built-in patterns
  3. Import patterns from Talend Exchange

 5. Apply your business rules in a Single Table / Multi-Column analysis

  1. Discover the Single Table analysis
  2. Create and apply your business rules
  3. Check the integrity of your data

 6. Generate your own reports

  1. Generate a simple report
  2. Generate a progress report - data quality monitoring

 7. Optimize your quality/clean your data

  1. Remove invalid data
  2. Parse, standardize and consolidate your data

 8. Define correlation analyses

  1. Create a Numerical Correlation analysis
  2. Create a Time Correlation analysis
  3. Create a Nominal Correlation analysis

 9. Take advantage of the Data Quality portal

  1. Use the dashboards
  2. Use the built-in Talend Data Quality reports
  3. Customize your reports in iReport

10. Set up team work - task management

  1. Create, revise and complete your reports

11. Benefit from Talend community support and services

  1. Discover the Best Practices
  2. Promote your patterns in the Talend community
  3. Use the Talend resources and services