YARN

Big Data’s Potential For Disruptive Innovation
An innovation that creates a new value network and market, and disrupts an existing market and value network by displacing the leading, highly established alliances, products and firms is known as Disruptive Innovation. Clayton M. Christensen and his coworkers defined and analyzed this phenomenon in the year 1995. But, every

Using Kafka and YARN for Stream Analytics on Hadoop
Understanding Big Data: Stream Analytics and YARN Real-time stream processing is growing in importance, as businesses need to be able to react faster to events as they that occur. Data that is valuable now may be worthless a few hours later. Use cases include sentiment analysis, monitoring and anomaly detection.

How Flink Became an Apache Top-Level Project
A multi-coloured squirrel may not seem like the most obvious choice of logo for a data processing technology; then again, the team behind Apache Flink have hardly done things by the book. What start out as a University research project evolved into a fully-fledged company, complete with artfully-decapitalised name (data

Apache Samza Levels Up to a Top-Level Project; Witnesses Large Scale Adoption
Apache Samza, the distributed stream processing framework, has now graduated to become a Top-Level Project (TLP), the Apache Software Foundation has revealed. “The incubation process at Apache has been great. It has helped us cultivate a strong community, and provided us with the support and infrastructure to make Samza grow,”

Hortonworks’ Comprehensive Certification Program for Enterprise Hadoop Expands Domain with Latest Additions
Enterprise Apache Hadoop providers Hortonworks revealed earlier last month the expansion of the Hortonworks Certified Technology Program to include certification for ‘key capabilities of operations, security and governance focused tools and applications supporting the growth of enterprise Hadoop and ecosystem integration. This comes as a followup to the introduction of

Dating Website eHarmony gets an IT Overhaul
eHarrmony is set to strengthen its technological base using Hadoop, Spark, Docker and possibly OpenStack. Company CTO Thod Nguyen says eHarmony is trying to evolve into a company that’s able to innovate on the IT front, as well as the dimensions-of-compatibility front. In conversation with Gigaom, Nguyen expressed that “A big

MapR’s Latest Distribution Gives a Flavour of Apache Drill, With New Developer Pre-Release
Ushering in the next-generation ANSI SQL to Hadoop, MapR has added a developer pre-release of Apache Drill, coined Drill 0.5, to its latest release. MapR, one of the leaders in enterprise Hadoop technology for big data deployments, has made available the 4.0.1 version of the MapR distribution including Apache Drill

Hadoop: The Components You Need to Know
Follow @DataconomyMedia It’s been suggested that “Hadoop” has become a buzzword, much like the broader signifier “big data”, and I’m inclined to agree. It could certainly be seen to fit Dan Ariely’s analogy of “Big data” being like teenage sex: “everyone talks about it, nobody really knows how to do