## Data Science Resource Articles

### Three Mistakes that Set Data Scientists up for Failure

The rise of the data scientists continues and social media is filled with success stories – but what about those who fail? There are no cover articles praising the failures of the many data scientists that don’t live up to the hype and don’t meet the needs of their stakeholders.

### Big Data 101: Intro To Probabilistic Data Structures

Oftentimes while analyzing big data we have a need to make checks on pieces of data like number of items in the dataset, number of unique items, and their occurrence frequency. Hash tables or Hash sets are usually employed for this purpose. But when the dataset becomes so enormous that

### Programming with R – How to Get a Frequency Table of a Categorical Variable as a Data Frame

Categorical data is a kind of data which has a predefined set of values. Taking “Child”, “Adult” or “Senior” instead of keeping the age of a person to be a number is one such example of using age as categorical. However, before using categorical data, one must know about various

### Data Science vs. Data Analytics – Why Does It Matter?

Data Science, Data Analytics, Data Everywhere Jargon can be downright intimidating and seemingly impenetrable to the uninformed. While complicated vernacular is an unfortunate side effect of the similarly complicated world of machines, those involved in computers, data and whole host of other tech-intensive sectors don’t do themselves any favors with

### If you care about Big Data, you care about Stream Processing

As the scale of data grows across organizations with terabytes and petabytes coming into systems every day, running ad hoc queries across the entire dataset to generate important metrics and intelligence is no longer feasible. Once the quantum of data crosses a threshold, even simple questions such as what is

### The Problem With (Statistical) False Friends

I recently stumbled across a research paper, Using Deep Learning and Google Street View to Estimate the Demographic Makeup of the US, which piqued my interest in derivative uses of data, an ongoing research interest of mine. A variety of deep learning techniques were used to draw conclusions about relationships

An easy but effective overview of big data. For doing a full fledged course on BIG DATA please visit : http://insergotechnologies.com/training.html

Great article.. After reading this article i learnt new and useful information about apache samza from this article which helpful to develop my hadoop skills more.. thank you for sharing…

Like to work with temenos on one of our projects (https://www.standfore.com/) and have only good emotions!

Its really Helpful To Every One i Prefer This Blog to Every one in Data Science Field

ahaha...! awesome. I just use the old fashion cartesian calculation https://www.studypug.com/algebra-help/linear-functions/linear-function-distance-formula.