Enterprise data catalog (EDC) assists enterprises to have a all-inclusive view of metadata , with a single tool it assists in analyzing and understanding large volumes of metadata in the enterprise. When working with EDC we will often be hearing two phrases catalog and Asset. EDC helps in extracting physical ,technical ,business and operational metadata from their enterprise systems.
To arrange all this information Enterprise Data Catalog maintains a centralized repository that stores all the metadata extracted from different external sources, in EDC terminology we can call this centralized repository as catalog. To allow easier finding of data catalog maintains an indexed inventory of all the assets in an enterprise. Assets represent the data objects who information reside in catalog, such as tables, columns, reports, views, and schemas. Apart from basic information on data asset, Catalog also serves with showing basic metadata and statistical information about data assets like profile results, data domains, and data relationships. EDC enables the following functionalities for an enterprise:
Self service analytics
Informatica Enterprise data Catalog intelligently discovers many types of data and their relationships across the enterprise. Pre-built scanners collect metadata from databases, data warehouses, applications, cloud data stores, BI tools, Hadoop and NoSQL, and more. All the metadata is indexed and cataloged in a highly scalable graph database architected for fast updates, smart search, and fast queries. As more and more data is created and propagated throughout the enterprise, similar and duplicate data sets inevitably arise. Informatica Enterprise data Catalog leverages advanced statistical and machine learning algorithms to discover similar data and subsets of data, helping users find the most elevant and trusted data they need.
Asset management- Lineage and impact analysis
Interactively trace data origin through business-friendly summarized lineage views that highlight the end points and not all the complex details in between. A drill-down lineage view expands any lineage path to show columns and lineage diagram metrics. Users can perform detailed impact analysis on upstream and downstream data assets.
The classic saying, “You can’t manage what you can’t measure” is true when it comes to managing data assets. To get the most value from data, you need to understand what you have, where it came from, how it has changed, and what level of trust you have in the data. Informatica Enterprise Catalog answers all these questions and more with complete end-to-end summary and detail lineage, profiling statistics, and 360-degree relationship views, providing a clear picture of your data.
Discovering data based on semantics
Trying to find the data you need across hundreds of enterprise systems may sometimes seem futile. Only through powerful semantic search built on comprehensive metadata services and a scalable infrastructure can one even hope to find relevant data. Informatica Enterprise data Catalog delivers semantic search with intelligent facets to further refine search results. Because Informatica uniquely associates business, technical, and operational metadata, business users can search on business terms to find their data and then browse 360-degree relationship views to find related data assets.
Automatically classify and identify domains and entities such as customer, product, order etc. across all structured and unstructured data assets at the field, column and table level. This is a crucial step in the ability for companies to catalog, govern, and extract value from their data assets. This classified data enables better search, filtering of search results and Axon recommendations. Informatica provides over 60 packaged data domains such as email, credit card number, social security number, country, city, URL, and company name. Users can add their own custom domains too. Data assets can be classified using data rules (i.e., columns with data that matches specific logic defined in the rule) or column name rules (i.e., Finds columns that match column name logic defined in the rule).
Data Governance- Linking business terms and technical assets
Informatica Enterprise Catalog includes integration with Axon Data Governance that provides a central place to define and manage the lifecycle of business terms, definitions, associated reference data, related terms, links, ad hoc documentation, and notes. Axon allows business and IT stewards to collaboratively manage business metadata that includes efficient human workflow automation. Associate business terms with the right technical metadata and Informatica Enterprise data Catalog will even recommend term associations. Axon assets such as terms, policies, and classifications can be easily imported from Informatica Axon and third party tools.