Metadata

Curate your big data to unleash its power
Data curation is the active management of data throughout its lifecycle of interest and usefulness. The lifespan of data is determined by how long analysts and researchers are interested in it, which means as long as it can be reused to create more value. What is data curation? The process

How to make data lakes reliable
Data professionals across industries recognize they must effectively harness data for their businesses to innovate and gain competitive advantage. High quality, reliable data forms the backbone for all successful data endeavors, from reporting and analytics to machine learning. Delta Lake is an open-source storage layer that solves many concerns around data

What is Metadata and why is it as important as the data itself?
Metadata. You may have heard the term before, and may have asked yourself either “what is metadata” or “why is it as important as data?” This article will be an attempt to clear up those two subjects. As this can often be quite dense, let’s jump right in! Metadata can

The Data Lake: A Reservoir or a Swamp? It Depends on Your Approach
Data lakes are based on a simple idea: You can store and analyze massive amounts of raw data at scale. But why data lakes? Here are five reasons why IT leaders are excited about this idea: Unfortunately, lots of companies end up with a data swamp instead of a data

Four Steps for Building a Successful Enterprise Metadata Catalog
With the fast-growing interest in data lakes — a storage solution that allows structured and semi-structured data to live in the same place — attention is turning toward metadata as a way to organize large amounts of diverse enterprise data. Metadata is an ambiguous and generic term, but it most