This week, we have a special line up of interesting big data news and articles. Alexandre Passant, the co-founder of Music & Data Geeks, looks at the Rolling Stone’s 500 Greatest Songs of All Time and runs some “API-based data-science” to see what insights we can derive from this top-500. We also learnt about Spotify’s plans to use deep learning to improve it’s recommender system, a new data science course in London introduced by Rackspace, and news of Berlin’s first big data industry event!
TOP DATACONOMY ARTICLES
Sex and Drugs and Rock’n’Roll: Analysing the Lyrics of The Rolling Stone 500 Greatest Songs of All Time
The word “love” appears a staggering 48 times more often in The Rolling Stone’s Greatest 500 Songs than the words “sex”, “drugs”, “rock’n’roll” combined. Compelling statistical evidence that widely-acclaimed artists are a bunch of soppy fools.
Dynamic data integration incorporating data quality and master data management (MDM) assures consistency and reliability for upstream analytics and information sharing. A pragmatic approach that treats all of a company’s data as big data will facilitate integration efforts. In Part I, we stressed the importance of data quality. In this post, we focus on MDM and the connection between the two as part of data governance strategy.
In Q2, Microsoft’s cloud infrastructure revenue grew by 164%; Google lagged at only 47%. But Google have a secret weapon in their cloud portfolio, whose release may sky-rocket their market share- Google Cloud Dataflow.
TOP DATACONOMY NEWS
Advanced analytics has been disrupting sports for a number of years, and now it’s extending its influence into Hockey. Earlier this week, Tyler Dellow, a blogger famous for his statistical analysis of hockey players and teams, was hired by the Edmonton Oilers to consult with hockey operations.
Splunk have just announced a 33% price cut on their cloud-based operational intelligence offering, as well as guaranteeing 100% uptime. They’re also offering a free online sandbox for customers who want to try out Splunk’s cloud services, and plans which can up to 5TB of processing power a day.
According to a recent article in The Economic Times, Microsoft is assessing the possibility of building its first data centre in India. If the software giants’ plans come to fruition, it will be the first data centre built by a multinational in the country.
MongoDB, the billion-dollar NoSQL database startup, yesterday announced the appointment of Dev Ittycheria as the new President and CEO, starting September 3. Ittycheria will replace Max Schireson, who has been CEO for 18 months and will now act as vice chairman.
It is widely believed that predicting natural elements- the spread of wildfire, or disease, for example- is entirely possible. Yet when it comes to modelling human behaviour, many like to think there’s a certain unique, unpredictable character to humanity that cannot be pinned down by algorithms. Yet three scholars- South Texas College of Law’s Josh Blackman, Michigan State’s Daniel Martin Katz, and Bommarito Consulting’s Michael Bommarito- have built a predictive model which can determine how US Supreme Court judges will vote, with nearly 70% accuracy.
TOP DATACONOMY JOBS
Teleport Data Scientist finds, scrapes, fuses, structures, visualizes and builds our growing pool of data that we use to power our search engine for digital nomads. We’ll never have perfect data for the entire planet, but thanks to you we’ll able to give useful responses to location searches anyway.
Airbnb is seeking an experienced Data Engineer/ETL specialist to join the data science team. This person would contribute to the vision for data infrastructure and business intelligence tools, work with engineers and data scientists to optimize logging, and establish best practices for table schemas and data storage.
(Image Credit: Rolling Stone)