python

Linear Regression Implementation in Python
In this post, we’re going to get our hands dirty with code- but before we do, let me introduce the example problems we’re going to solve today. 1) Predicting House Prices We want to predict the values of particular houses, based on the square footage. 2)Predicting Which TV Show Will

Data Mining Tops LinkedIn’s List of the Hottest Skills in 2014
The list for 25 Hottest Skills of 2014 that got people hired according to the business networking service provider, LinkedIn, came out mid-December last year, and there has been a significant shuffling since 2013. What came out on top as the number one on this list was Statistical Analysis and

Python Packages For Data Mining
Just because you have a “hammer”, doesn’t mean that every problem you come across will be a “nail”. The intelligent key thing is when you use the same hammer to solve what ever problem you came across. Like the same way when we indented to solve a datamining problem we

Top 10 Data Science Skills, and How to Learn Them
One of most popular posts this year came from Ferris Jumah, a data scientist at LinkedIn, who mapped the most popular skills of data scientists by scraping LinkedIn profile data. One of the common comments amongst data scientists who came across this post- as with most of our posts focused

Three Key New Features from Aerospike’s Extensive Upgrade
Back in August, Aerospike announced they were open-sourcing their signature platform. At the beginning of December, they were back again with news of a record-breaking Google Compute Engine Benchmark. Now, to round off what has been an exceptional year for the flash-optimised, in-memory NoSQL database, they’ve released a whole raft

Top 14 Big Data Books of 2014
2014 has been a huge year in big data- and big data publishing. Viktor Mayer-Schoenberger and Kenneth Cukier re-published and added an extra chapter to their bestselling “Big Data”; Nate Silver graced the publishing world with his presence once more with the Best American Infographics of 2014. We’ve compiled a

Which Environment to Choose for Data Science?
Summary: In the past, R seemed like the obvious choice for Data Science projects. This article highlights some of the issues, such as performance and licensing, and then illustrates why Python with its eco-system of dedicated modules like Scikit-learn, Pandas and others has quickly become the rising star amongst Data

The One Language A Data Scientist Must Master
When business leaders read about (and tackle) Big Data, there is a lot to take in. The field is developing so dynamically that many of the industry buzzwords will not have existed until a few short years ago. Just a short list of some programming languages is enough to make

IEEE Ranks Programming Languages, Java Comes Out on Top
IEEE, the “world’s largest professional association for the language of technology”, have released an interactive ranking of programming languages. In the overall list, as well as many of the sub-rankings, Java emerges as the victor. The IEEE blog details how they went about creating their definitive rankings. “Starting from a