Browse wiki

From CASRAI dictionary
Data profiling
Draft true  +
Extended definition n/a  +
Meta title csr:Data profiling  +
Original source  +
Research Data Domain true  +
Short definition The statistical analysis and assessment of
The statistical analysis and assessment of the quality of data values within a dataset for consistency, uniqueness and logic. The data profiling process cannot identify inaccurate data; it can only identify business rules violations and anomalies. The insight gained by data profiling can be used to determine how difficult it will be to use existing data for other purposes. It can also be used to provide metrics to assess data quality and help determine whether or not metadata accurately describes the source data. Profiling tools evaluate the actual content, structure and quality of the data by exploring relationships that exist between value collections both within and across datasets. For example, by examining the frequency distribution of different values for each column in a table, an analyst can gain insight into the type and use of each column. Cross-column analysis can be used to expose embedded value dependencies and inter-table analysis allows the analyst to discover overlapping value sets that represent foreign key relationships between entities. RELATED TERM. Data archeology
en entities. RELATED TERM. Data archeology  +
Type Terms  +
UUID cfa0aa99-19b6-414c-b96e-a8dfd5fbf4c6  +
Has query
"Has query" is a predefined property that represents meta information (in form of a subobject) about individual queries.
Data profiling + , Data profiling + , Data profiling + , Data profiling + , Data profiling +
Categories Terms , Drafts , Research Data Domain
Modification date
This property is a special property in this wiki.
12 August 2015 06:13:34  +
hide properties that link here 
  No properties link to this page.


Enter the name of the page to start browsing from.