What are the Key Concepts of Talend ETL? PDF Print E-mail
Article Index
What are the Key Concepts of Talend ETL?
Data migration
Data synchronisation & replication
ETL for Business Intelligence & Data Warehousing
Data Profiling
All Pages


Data Profiling

Data profiling is the process of examining the data available in any existing data source and collecting statistics and information about that data. With this detailed analysis you’ll be able to:

 

Track data quality

  •  
    • give metrics on data quality including whether the data conforms to company standards
    • find out whether existing data can easily be used for other purposes
    • assess the risk involved in integrating data for new applications, including the challenges of joins

 

Understand your data 

  •  
    • assess whether metadata accurately describes the actual values in the source database
    • have an enterprise view of all data, for uses such as Master Data Management or Data Governance initiatrives.
    • understanding data challenges early in any data intensive project, so that late project surprises are avoided. Finding data problems late in the project can incur time delays and project cost overruns.

 

Talend Open Profiler is a sophisticated yet simple-to-use data profiler facilitating:

    • Connection to databases and files to introspect their structures with the resultant information stores descriptions of their metadata in its Metadata Repository
    • Business users or data management staff to define a set of indicators for each data element that needs to be analysed or monitored, ranging from simple or advanced statistics to text strings analysis, incorporating summary data and statistical distributions
    • The production of sophisticated reports and graphs that let users gauge at a glance the level of quality of the data, and the status of the indicators that were defined

 

 



 
Powered by Joomla!