Dataconomy
  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Glossary
    • Whitepapers
  • Newsletter
  • + More
    • Conversations
    • Events
    • About
      • About
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
Subscribe
No Result
View All Result
  • AI
  • Tech
  • Cybersecurity
  • Finance
  • DeFi & Blockchain
  • Startups
  • Gaming
Dataconomy
  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Glossary
    • Whitepapers
  • Newsletter
  • + More
    • Conversations
    • Events
    • About
      • About
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
Subscribe
No Result
View All Result
Dataconomy
No Result
View All Result

“Finding a relevant, trustworthy dataset can be like finding a needle in a haystack” – Interview with Satyen Sangani

byJuan Salazar
September 19, 2016
in Articles, Artificial Intelligence, Conversations
Home Resources Articles

"Finding a relevant, trustworthy dataset can be like finding a needle in a haystack" - Interview with Satyen SanganiSatyen is the CEO of Alation. Before Alation, Satyen spent nearly a decade at Oracle, ultimately running the Financial Services Warehousing and Performance Management business where he helped customers get insights out of their systems. Prior to Oracle, Satyen was an Associate with the Texas Pacific Group and an Analyst with Morgan Stanley & Co. Satyen holds a Masters from the University of Oxford and a Bachelors from Columbia College, both in Economics.


Dataconomy: What is a data catalog?

Satyen: Much like Amazon helps users buy the right product, a data catalog helps people get the right data. A good data catalog provides rich information on all data within an organization, so members can find a relevant data set, understand what it means and where it came from, trust that it’s accurate and up-to-date, and then put it to use. A modern data catalog will leverage powerful technologies—like crawling and indexing, query log parsing, artificial intelligence, machine learning, and natural language processing—appropriately combined with crowd-sourcing and expert-input, to achieve both broad coverage and high quality of data knowledge. In addition to describing the data, it will also show how it’s been used in the past and ought to be used in the future.

Dataconomy: Who uses a data catalog?

Satyen: Data catalogs are used by data consumers (i.e. people who use data to make reports, models, analyses, products, or decisions) including data analysts, data scientists, statisticians, marketers, product managers, salespeople, customer support personnel, finance and operations workers, and even executives. By making data more searchable and consumable, a data catalog can broaden the data audience and make an organization more data-driven across the board.

Stay Ahead of the Curve!

Don't miss out on the latest insights, trends, and analysis in the world of data, technology, and startups. Subscribe to our newsletter and get exclusive content delivered straight to your inbox.

Data curators and creators also play a role in populating and enriching the data catalog. A modern data catalog will automatically fill in lots of information, freeing humans to add differentiated value.

Dataconomy: Why do today’s data consumers need a data catalog? What’s its value?

Satyen: Today, organizations have more data than ever, so finding a relevant, trustworthy dataset can be like finding a needle in a haystack. And often, many different datasets look similar, so it’s very challenging to determine which is accurate and up-to-date. Data catalogs save data consumers time and help them deliver accurate analyses. This increases organizational trust in data and yields smarter decisions.

Dataconomy: What made you get into this market and why now?

Satyen: In the 90s, the internet was growing faster than Yahoo! could taxonomize it; then Google came along and indexed the web intelligently (leveraging implicit human signals with PageRank) so everyone could actually find useful information.

We saw a similar trend within organizations, that the scale and complexity of data environments was increasing faster than the human workforces tasked with leveraging them. One of our customers has literally tens of millions of data fields and saw that number more than double in just two years, during a time where they had only hired a handful of new analysts. Storing data has been getting easier and easier, but finding it and putting it to use was actually getting harder.

It was clear that someone needed to solve the human problem with data, and to do so in an automated, scalable way that learns from people without requiring human labor. So we did.

Dataconomy: Where do you see the data catalog market going in five years from now?

Satyen: We see data getting further democratized. In five years, anyone who can look at a spreadsheet or a line chart will be using self-service tools to get data, without depending on “techies.” They’ll use natural language in conversational, English-In/Answers-Out interfaces to find insights and make better decisions, much like they use Google today. A data catalog is like Google’s index of the web, a platform on which incredibly empowering apps can be built for end-users.

Like this article? Subscribe to our weekly newsletter to never miss out!

Follow @DataconomyMedia

Image: Gene Han

Tags: AlationsurveillanceUSA

Related Posts

AI chatbots spread false info in 1 of 3 responses

AI chatbots spread false info in 1 of 3 responses

September 5, 2025
OpenAI to mass produce custom AI chip with Broadcom in 2025

OpenAI to mass produce custom AI chip with Broadcom in 2025

September 5, 2025

Digital inheritance technology by Glenn Devitt addresses the $19T asset transfer problem

September 5, 2025
Deepmind finds RAG limit with fixed-size embeddings

Deepmind finds RAG limit with fixed-size embeddings

September 5, 2025
TCL QM9K integrates Gemini with presence detection

TCL QM9K integrates Gemini with presence detection

September 5, 2025
LunaLock ransomware hits artists/clients with AI training threat

LunaLock ransomware hits artists/clients with AI training threat

September 5, 2025
Please login to join discussion

LATEST NEWS

Texas Attorney General files lawsuit over the PowerSchool data breach

iPhone 17 Pro is expected to arrive with 48mp telephoto, variable aperture expected

AI chatbots spread false info in 1 of 3 responses

OpenAI to mass produce custom AI chip with Broadcom in 2025

When two Mark Zuckerbergs collide

Deepmind finds RAG limit with fixed-size embeddings

Dataconomy

COPYRIGHT © DATACONOMY MEDIA GMBH, ALL RIGHTS RESERVED.

  • About
  • Imprint
  • Contact
  • Legal & Privacy

Follow Us

  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Glossary
    • Whitepapers
  • Newsletter
  • + More
    • Conversations
    • Events
    • About
      • About
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
No Result
View All Result
Subscribe

This website uses cookies. By continuing to use this website you are giving consent to cookies being used. Visit our Privacy Policy.