Search for:
Category

Data Science

Category

What is an outlier? In short, it’s a data point that is significantly different from other data points in a data set. The long story? There isn’t a strong mathematical definition for what is or isn’t an outlier. In the end, detecting and handling outliers is often a somewhat subjective exercise. So how can you dive into a new data…

Last week, we analyzed data on 475 coups around the world from the past 66 years and learned that another coup in Turkey wouldn’t be shocking. This week, we turn our attention to the economic impact of coups. Geopolitics and economics tend to have a messy, confusing relationship. And where there is confusion, there’s often only one way of getting…

On July 15, 2016, a group within the Turkish Armed Forces tried to seize control of the Turkish government. The attempted coup ended a day later when forces loyal to President Recep Tayyip Erdoğan beat back the dissidents. Predictably, there was an explosion of news articles and commentary about the coup, its implications for Turkey, and how this coup compared to previous coups. However, we noticed something odd in these articles.…

Happy Republic Day! Republic Day honors the date that the Constitution of India was ratified, replacing the Government of India Act (1935) as India’s governing document. To celebrate Republic Day in true Atlan style, we combed through data sources and the Constitution itself to investigate what the data shows about the Indian constitution. Here’s what we found. India’s Constitution Was…

Cross tabulation is a method to quantitatively analyze the relationship between multiple variables. Also known as contingency tables or cross tabs, cross tabulation groups variables to understand the correlation between different variables. It also shows how correlations change from one variable grouping to another. It is usually used in statistical analysis to find patterns, trends, and probabilities within raw data. When…

India is currently divided into 36 administrative states and union territories, which are further divided into districts and more granular units. Districts in India are the third geographic layer for data dissemination after national and state-level setups. Many agencies — such as the Census of India, Reserve Bank of India, Ministry of Health and Family Welfare, Directorate of Economics and…