Dataconomy
  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
    • AI Models Leaderboard
  • AI toolsNEW
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
Subscribe
No Result
View All Result
  • AI
  • Tech
  • Cybersecurity
  • Finance
  • DeFi & Blockchain
  • Startups
  • Gaming
Dataconomy
  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
    • AI Models Leaderboard
  • AI toolsNEW
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
Subscribe
No Result
View All Result
Dataconomy
No Result
View All Result

“Finding a relevant, trustworthy dataset can be like finding a needle in a haystack” – Interview with Satyen Sangani

byJuan Salazar
September 19, 2016
in Articles, Artificial Intelligence, Conversations
Home Resources Articles
Share on FacebookShare on TwitterShare on LinkedInShare on WhatsAppShare on e-mail
Google Preferred Source

"Finding a relevant, trustworthy dataset can be like finding a needle in a haystack" - Interview with Satyen SanganiSatyen is the CEO of Alation. Before Alation, Satyen spent nearly a decade at Oracle, ultimately running the Financial Services Warehousing and Performance Management business where he helped customers get insights out of their systems. Prior to Oracle, Satyen was an Associate with the Texas Pacific Group and an Analyst with Morgan Stanley & Co. Satyen holds a Masters from the University of Oxford and a Bachelors from Columbia College, both in Economics.


Dataconomy: What is a data catalog?

Satyen: Much like Amazon helps users buy the right product, a data catalog helps people get the right data. A good data catalog provides rich information on all data within an organization, so members can find a relevant data set, understand what it means and where it came from, trust that it’s accurate and up-to-date, and then put it to use. A modern data catalog will leverage powerful technologies—like crawling and indexing, query log parsing, artificial intelligence, machine learning, and natural language processing—appropriately combined with crowd-sourcing and expert-input, to achieve both broad coverage and high quality of data knowledge. In addition to describing the data, it will also show how it’s been used in the past and ought to be used in the future.

Dataconomy: Who uses a data catalog?

Satyen: Data catalogs are used by data consumers (i.e. people who use data to make reports, models, analyses, products, or decisions) including data analysts, data scientists, statisticians, marketers, product managers, salespeople, customer support personnel, finance and operations workers, and even executives. By making data more searchable and consumable, a data catalog can broaden the data audience and make an organization more data-driven across the board.

Stay Ahead of the Curve!

Don't miss out on the latest insights, trends, and analysis in the world of data, technology, and startups. Subscribe to our newsletter and get exclusive content delivered straight to your inbox.

Data curators and creators also play a role in populating and enriching the data catalog. A modern data catalog will automatically fill in lots of information, freeing humans to add differentiated value.

Dataconomy: Why do today’s data consumers need a data catalog? What’s its value?

Satyen: Today, organizations have more data than ever, so finding a relevant, trustworthy dataset can be like finding a needle in a haystack. And often, many different datasets look similar, so it’s very challenging to determine which is accurate and up-to-date. Data catalogs save data consumers time and help them deliver accurate analyses. This increases organizational trust in data and yields smarter decisions.

Dataconomy: What made you get into this market and why now?

Satyen: In the 90s, the internet was growing faster than Yahoo! could taxonomize it; then Google came along and indexed the web intelligently (leveraging implicit human signals with PageRank) so everyone could actually find useful information.

We saw a similar trend within organizations, that the scale and complexity of data environments was increasing faster than the human workforces tasked with leveraging them. One of our customers has literally tens of millions of data fields and saw that number more than double in just two years, during a time where they had only hired a handful of new analysts. Storing data has been getting easier and easier, but finding it and putting it to use was actually getting harder.

It was clear that someone needed to solve the human problem with data, and to do so in an automated, scalable way that learns from people without requiring human labor. So we did.

Dataconomy: Where do you see the data catalog market going in five years from now?

Satyen: We see data getting further democratized. In five years, anyone who can look at a spreadsheet or a line chart will be using self-service tools to get data, without depending on “techies.” They’ll use natural language in conversational, English-In/Answers-Out interfaces to find insights and make better decisions, much like they use Google today. A data catalog is like Google’s index of the web, a platform on which incredibly empowering apps can be built for end-users.

Like this article? Subscribe to our weekly newsletter to never miss out!

Follow @DataconomyMedia

Image: Gene Han

Tags: AlationsurveillanceUSA

Related Posts

The Atlantic uncovers millions of copyrighted songs in AI training data

The Atlantic uncovers millions of copyrighted songs in AI training data

June 16, 2026
Meta brings AI-powered photo editing and chat features to Facebook

Meta brings AI-powered photo editing and chat features to Facebook

June 16, 2026
AI Hallucinations in Software Engineering: GitHits Raises .75M to Build the “Google for Code”

AI Hallucinations in Software Engineering: GitHits Raises $1.75M to Build the “Google for Code”

June 16, 2026
Data Sovereignty and Document Security: Where Does the Data Actually Live?

Data Sovereignty and Document Security: Where Does the Data Actually Live?

June 15, 2026
OpenAI unveils first official partner program with 0M backing

OpenAI unveils first official partner program with $150M backing

June 15, 2026
Google files lawsuit over AI-assisted phishing operation abusing Gemini

Google files lawsuit over AI-assisted phishing operation abusing Gemini

June 15, 2026
Please login to join discussion

LATEST NEWS

Tesla Cybercab specs show 293-mile estimated EPA range

Google Earth debuts browser-based flight simulator

Samsung unveils Galaxy Book 6 Edge with Snapdragon X2 Elite

The Atlantic uncovers millions of copyrighted songs in AI training data

Meta brings AI-powered photo editing and chat features to Facebook

EA launches advertising platform for in-game brand partnerships

BEST AI MODELS LEADERBOARD

See the best AI models, ranked by intelligence, benchmark results, speed and token price. Find the most suitable LLMs, Text-to-Image, Image Editing, Text-to-Speech, Text-to-Video and Image-to-Video  artificial intelligence model for your tasks and business.

LATEST TOOLS

Face-generator

Audiopen

Picwish

EssayChecker

MyMathSolver

GPT Subtitler

AI Tutor App

Fluency

Dreamhouse AI

Dresma

Dataconomy

COPYRIGHT © DATACONOMY MEDIA GMBH, ALL RIGHTS RESERVED.

  • About
  • Imprint
  • Contact
  • Legal & Privacy

Follow Us

  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
    • AI Models Leaderboard
  • AI tools
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
No Result
View All Result
Subscribe

This website uses cookies to improve your experience. You can choose to accept or reject them. Visit our Privacy Policy.