In this edition of the humans of data newsletter, we cover two incredible use cases of data science spanning two continents. 

This is followed by our top reads on data science hacks, how Uber cracked the code to Apache Spark, and the difference between the various fields in data science explained in the simplest possible way. 

Oh, and upcoming events that you should consider attending.

Without further ado, let’s dig in, shall we?


72 years of Indian independence

How India used big data to jumpstart its economy in the 1950s

Christine Garcia

Did you know that the Indian government started using big data over half a century ago, long before big data was even a thing? And that big data had a role to play in accelerating India’s economy after independence?

To commemorate 72 years of Indian independence, we’ve put together an intriguing story of big data and its role in building a nation, featuring P.C. Mahalanobis and his fellow humans of data in the government and the ISI (Indian Statistical Institute). 

Read on to find out more about this incredible use case of big data.

Our top reads for you

The most powerful idea in data science

Cassie Kozyrkov

From rabbits in clouds to Elvis’s face in a potato chip, humans find patterns in everything. The key to data science, however, isn’t finding patterns. It’s finding relevant patterns that can help you draw useful insights.

How do you separate the red herrings from useful patterns? It’s one of the toughest problems that data scientists have to grapple with every day. 

But don’t worry, there’s a solution. Read on to find out more about the most powerful solution to win at data science.

Making Apache Spark effortless for all of Uber

Abhishek Modi and Adam Hudson

Apache Spark is critical to Uber’s big data infrastructure. It powers 100,000 Spark applications every day, which help Uber’s engineering team build and run applications at scale. 

However, with great scale comes great complexity! From providing reliable access to data sources to constantly communicating ever-growing infrastructural changes to the entire team, the problems are manifold. 

Read on to find out how Uber makes it easier for its teams to work with Spark.

Data science vs machine learning vs data analytics vs business analytics

Iliya Valchanov

While data science is one of the most popular buzzwords in 2019, it’s also too broad and too vague a term. What’s more, the definitions of the various fields in data science such as AI, data analytics or business analytics are too complex. 

Are you facing a similar issue? Fret not, here’s an article that defines the various fields in data science in the simplest possible terms. Read on.

I wasn’t getting hired as a data scientist. So I sought data on who is.

Hanif Samad

Like most aspiring data science professionals, we’re sure you often wonder what it takes to get a job as a data scientist. Now, stop wondering.

Hanif built a scraper to extract data on people who successfully work as data scientists from their LinkedIn profiles. Using this data, he was able to pinpoint the most common traits that people currently employed as data scientists have.

Read on to find out who actually gets hired as a data scientist.

How the ancient Egyptians built the original skyscrapers with data

Ayswarrya G

Once upon a time in ancient Egypt, the humans of data built the tallest structure ever known to the old world using only ramps, ropes, levers and sheer muscle power. We’re talking about the Great Pyramid of Giza—481 feet (146.6 meters) or 280 Egyptian royal cubits of breathtaking stonework.

Pyramids of Giza

How did the ancient Egyptians build one of the most resilient infrastructures humans have ever known, in an era where humans hadn’t even discovered the wheel, pulley or iron tools?

How did they calculate the exact angles of the edges of the pyramid, align it to the true north, and transport massive blocks of limestone and granite across a vast sandy desert?

Read on to find out the answers to all these questions, and more. 

P.S. Spoiler alert: Data had an important role to play. But that’s all we’re saying!

Save the dates

PyData Delhi | September 7, 2019 | Atlan office, New Delhi

If Python is a part of your everyday life at work, then you cannot miss out on this event! 

PyData meetups bring together the entire community of developers, analysts and scientists to discuss data science tools using Python, R and Julia, as well as best practices and challenges in data management, data science and visualization.

Interested? To find out more details, write to us at [email protected].

Rootconf | September 21, 2019 | YMCA, Pune

Do you work with DevOps? Here’s an opportunity you shouldn’t miss out on! 

Rootconf is an annual conference on devops and cloud infrastructure by HasGeek to discuss issues around security, network management and distributed systems. Attend turbocharged talks from Red Hat, Hotstar, Flipkart, Appsecco and more!

Interested? Register here.


That’s all, folks! Liked this newsletter? Awesome! Please share your feedback with us in the comments section below.

Stay tuned for more such content by subscribing to our fortnightly data newsletter.

Author

Write A Comment

This site uses Akismet to reduce spam. Learn how your comment data is processed.