Ray Li
Author Archive

Ray Li

Ray Li is a software engineer and data enthusiast who has been blogging at rayli.net for over a decade. You’ll usually find him wrangling data, programming and lifehacking. His latest project can be found at hackerbits.com.

Data Science

Where do I get started with data-driven engineering? How can the 3 I’s of data-driven engineering help me get off to a running start? How can I avoid the common pitfalls of data-driven engineering? What are the 3 I’s? The 3 I’s of data-driven engineering are insights, indicators and investments.

Read More
Data Science 101

Data mining is everywhere, but its story starts many years before Moneyball and Edward Snowden. The following are major milestones and “firsts” in the history of data mining plus how it’s evolved and blended with data science and big data. Data mining is the computational process of exploring and uncovering patterns

Read More
Data Science 101

1. C4.5 What does it do? C4.5 constructs a classifier in the form of a decision tree. In order to do this, C4.5 is given a set of data representing things that are already classified. Wait, what’s a classifier? A classifier is a tool in data mining that takes a

Read More
Data Science 101Understanding Big Data

Highly effective data analysis isn’t learned overnight, but it can be learned faster. Here are 7 habits of data analysis I wish someone told me for effectively incorporating, communicating and investing in data analysis geared towards an engineering team. 1. Value simplicity of analysis over fancy algorithms If you can’t

Read More