Talend Data Quality


New Generation of Data Management

Talend Data Quality offers a new generation of profiling, cleansing, matching and monitoring capabilities that have been created to support data governance. One solution, running in a single work environment, solves your data quality issues and empowers cross-functional teams to get the most out of their data.


Compare Editions | What's New in Talend v5

Data Quality Overview

When comes to clean, accurate data, Talend Data Quality does it all. It identifies anomalies, cleanses inaccurate and inconsistent data, resolves duplicate records and provides the capability to augment and enhance your data. It extends profiling and real-time dashboards for insight into data quality and provides the means to not only identify issues but to create automatic processes to resolve and clean data. Talend Data Quality is enterprise-ready, offering a highly scalable, reusable platform for data management.


Key Capabilities

Data Profiling

The first step in improving the quality of enterprise data is to understand it, completely and fully, making sure it conforms to company and industry standards. Data profiling provide pre-defined tests and business rules to ensure data quality is fit-for-use within your enterprise application.


Standardization

Once the problem areas have been identified, you can correct them. Talend Data Quality includes components that provide valuable standardization capabilities, such as name and address cleansing, correction with external reference data, parsing, third-party address validation and standardization. These powerful tools let you manage data quality across all data domains—including customer, product, financial, and transactional—using a single development environment.


Matching and Survivorship

Talend Data Quality represents a new generation of data matching solutions that replaces the overly complex process used by legacy vendors. Green-screen match rules editing and special matching languages are left in the past.  With Talend, matching is configured within the Talend work environment eliminating the daunting task of editing of rules files and arcane matching languages in multiple GUIs that are associated with some data quality tools. Sophisticated matching algorithms are provided that help you find duplicates and near duplicates in your data.


Data Monitoring

Data Quality Portal provides customizable Web-based data quality monitoring and reporting to help organizations keep watch over crucial data quality metrics that may impact important business processes. By reviewing the metrics on a regular basis and following their trends, a company can follow the evolution (improvement or degradation) of the quality of its data through data profiling. This helps build alignment and highlight areas of improvement.


Enterprise, Big Data and Cloud-Ready

We recognize that delivering clean data between systems is a mission-critical imperative. Talend offers key features that can deliver cleaner data access with outstanding connectivity, performance and scalability and therefore overcome the challenges of implementing data quality as an enterprise-wide function.

Data Quality for Data Governance

Data Profiling

The first step in improving the quality of enterprise data is to understand it and make sure it conforms to company and industry standards. With Talend Data Quality, users can:

  • Profile, analyze, and create informative reports on the status of data quality to help build alignment and highlight areas for improvement
  • Get started quickly with pre-defined tests and business rules to ensure data quality is fit-for-use within your enterprise application
  • Define custom business rules and thresholds to alert you when data quality falls outside of specifications
  • Drill down to specific records with a simple right-click.
  • Quickly and easily put a workflow in place to remedy troublesome spots – profiling results can immediately translate into remediation.
  • Monitor data quality over time to track the impact of new processes on your data quality
  • Share data profiling results with your cross-functional team to enable data governance

Standardization

The first step in improving the quality of enterprise data is to understand it and make sure it conforms to company and industry standards. With Talend Data Quality, users can:

  • Leverage built-in data integration to access any source or use it to enrich data. Take advantage of data freely available reference data to enrich and enhance, or use the traditional vendors.
  • Manage data quality across all data domains—including customer, product, financial, and transactional—using a single development environment
  • Reuse all data quality rules, reference data, and processes across your entire data governance organization.
  • Use the latest parsing technology for both structured and unstructured data. The parser will let you give structure to free-form data field for improved reporting and analytics.
  • Reduce deployment time with Talend’s data quality cheat sheets.  The cheat sheets use simple questions to step you through some of the most common processes of data quality resulting in finished data quality job.

Matching and Survivorship

Talend Data Quality represents a new generation of data matching solutions that moves the process of overly complex, green-screen match rules editing to real-world business users. With the matching functions of Talend Data Quality, users can:

  • Configure matching within the Talend user environment. There is no heavy editing of rules files and multiple GUIs that are associated with most data quality tools.
  • Set up confidence weights and probability on each matching attribute to achieve the results you need
  • Use a unique match studio shows ‘what-if’ with regard to modifying matching techniques, including charts and graphs for key matching metrics
  • Identify duplicate customer data, transactional data or supply chain data with easy to configure matching algorithms
  • Select rules and matching techniques for each individual attribute, allowing you to assemble the best possible match process for your unique data
  • Edit rules in the same environment as all your data management processes without the need to learn matching language or proprietary match codes.

Data Monitoring

Data Quality Portal provides customizable Web-based data quality monitoring and reporting to help organizations keep watch over crucial data quality metrics that may impact important business processes. Users can:

  • Review data quality metrics to enable a data governance team to follow the evolution (improvement or degradation) of the quality of its data through data profiling.
  • Enforce data quality standards as new data is created. Data Quality Portal delivers customized key quality indicators (KQI) to a Web-based portal where teams can collaborate on the process of improving data quality across the enterprise.
  • Generate reports in standard formats to share.  Graphical reports are generated in web page, Adobe Acrobat and Excel. Use reporting data stored in XML and in the repository for business intelligence applications.
  • Trigger events, like the sending of an e-mail, if metrics fall outside specifications. You can immediately know as soon as data quality falls outside of specifications.

Enterprise, Big Data and Cloud-Ready

We recognized that delivering clean data between systems is a mission-critical imperative. Talend offers key features that can deliver cleaner data access with outstanding connectivity, performance and scalability and therefore overcome the challenges of implementing data quality as an enterprise-wide function. Talend Data Quality users can:

  • Universal connectivity to virtually all data sources to access and cleanse any data in any source - within the enterprise or in the cloud
  • Manage data quality jobs centrally to promote reuse and support data governance
  • Deploy data quality services in batch or real time, including data quality firewalls to prevent low-quality data from entering applications
  • Reuse all profiling and data cleansing jobs across all applications and projects with a shared repository
  • Build, test, and deploy data quality services to support all applications on Talend’s services-based architecture (ESB)
  • Set up multiple servers and nodes, including configurations that support Hadoop, clustering and high availability

Enabling Enterprise Initiatives

Talend Data Quality not only may be an initiative itself, but more commonly, it makes an IT initiative better.

Initiative

Data Quality Function

Data Quality Value


Data Migration

Profiling to provide a complete understanding of data challenges early in the project while standardization and matching can be part of data transformation.

Profiling provides a complete understanding of the data before the project team attempts to migrate it. This can help the project team create a more accurate plan for integration. Failure to completely understand data can lead to major costs over-runs and project delays as the customer enters into the ETL-repeat cycle. Data quality functions will help your prospects standardize the data as they move it.


Customer Relationship Management (CRM)

Data quality provides standardized name and address data to the CRM application.

Data quality technology can work as a real-time process, limiting the amount of typos and duplicates in the system, thus leading to improved call center efficiency. The organization will benefit from a cleaner customer list with fewer duplicate records. Data profiling can also help an organization understand and monitor the quality of a purchased list for integration will avoid issues with third-party data.


Enterprise Resource Planning (ERP) and Supply Chain Management (SCM)

Data quality standardizes “parts” data to provide more accurate view of inventory levels

Data quality technology can be used to more accurately report inventory levels, lowering inventory costs. You may also be able to improve bargaining power with suppliers by gaining improved intelligence about their corporate buying power. If data is accurate, prospect will have a more complete picture of the supply chain.


Data Warehouse and Business Intelligence

Data quality ensures disparate data sources will act as one when migrated to a data warehouse.

Data quality makes data warehouse possible by standardizing disparate data. The customer will be able to generate more accurate reports when trying to understand sales patterns, revenue, customer demographics and other important metrics.


Regulatory Compliance

Data quality offers an added assurance that reports are accurate.

Data quality technology helps the client avoid errors in reports on customers and sales figures. Companies concerned with making sure they comply with industry-specific, state and federal regulations should be compelled to make sure the data is accurate in their systems.


Master Data Management (MDM) and

Data quality is a key component of master data management.

An integral part of making applications communicate and share data is to have standardized data.  MDM enhances the basic premise of data quality with additional features like persistent keys, a graphical user interface to mitigate matching, the ability to publish and subscribe to enterprise applications, and more.


See what analysts are saying Read a white paper View a webinar
Testing Talend Data Quality - an independent software review by IAIT
Read how independent reviewer Dr. Götz Güttich from the IAIT test lab grades Talend Data Quality as a “must-have” for companies looking to improve the quality of their data.
This informative review is an unbiased and inclusive review completed in the spring of 2011.
The Butterfly Effect of Poor Data Quality
Poor data quality is more impactful than most people think. Chaos theory, specifically the "butterfly effect", explains how a single piece of poor data can have long-term consequences on your corporate vision. This white paper discusses the ubiquitous nature of data and how data quality monitoring, profiling, cleansing and enrichment can be used to minimize the chaos.
Create Business Friendly Data Quality Dashboards
This one hour webinar presents:
- How to set up your first dashboard
- How to centralize and manage high-performance Excel-based reporting and analysis with Palo
- How to create business-friendly reports
- How to customize reports for your business