Data Analytics Blog

Are Decision-Makers Disempowered by the Modern Data Stack?

September 23rd, 2020 by Danielle Boccelli

Future Data 2020 Many winter moons ago, I (virtually) attended Future Data 2020, a conference about the next generation of data systems. During the conference, I watched an interesting talk given by Tristan Handy, founder and CEO of Fishtown Analytics, called The Modern Data Stack: Past, Present, and Future. During the talk, Tristan discussed a so-called […]

Read More


Pie Charts

September 4th, 2020 by Danielle Boccelli

Show/Hide Code   Introduction It seems as if people are split on pie charts: either you passionately hate them, or you are indifferent. In this article, we are going to explain why pie charts are problematic and, if you fall into the latter category, what you can do when creating pie charts to avoid upsetting those […]

Read More



Useful Python Snippets

October 26th, 2019 by boxplotanalytics

The goal of this blog post is a compilation of little tidbits and code snippets that address common issues when programming for data analysis in Python. General Snippets Difference between JSON and XML This page gives a great example of the difference between data in JSON format and XML format. It shows the exact same […]

Read More


A/B Testing Example (Two Proportion Hypothesis Test)

September 25th, 2019 by boxplotanalytics

A/B testing (sometimes called split testing) is comparing two versions of a web page, email newsletter, or some other digital content to see which one performs better. A company will compare two web pages by showing the two variants (let’s call them A and B) to similar visitors at the same time. Typically, the company […]

Read More


Population Mean Hypothesis Test Example

September 25th, 2019 by boxplotanalytics

The Situation: Kakes+, a Pennsylvania company that makes terribly unhealthy small pies/cakes, believes that their machines are overfilling their blueberry pies. Kakes+ wants to test this statistically, and has recruited you to come up with a data-backed answer. The pies should weigh 8 ounces each. Step 1: Collect Data You need to weigh the pies […]

Read More


Tables & Linking Data Structures in Excel

September 1st, 2019 by boxplotanalytics

Tables are one of the most important features of Excel, but are often overlooked. Tables and keeping analyses in Excel connected, will drastically increase your efficiency in Excel. Let’s start by understanding how they work with PivotTables. We’re going to use an R Dataset called DoctorContacts. Download the .csv file using this link (and save […]

Read More


Grouping in PivotTables

September 1st, 2019 by boxplotanalytics

Grouping in PivotTables is a way of combining data to perform analyses without having to use functions. You can group numeric columns to turn them into categories, you can group date columns by date ranges to get even intervals, and you can group text columns to put together similar values. We’ll go through all three […]

Read More


Make a Bubble Plot in Excel

September 1st, 2019 by boxplotanalytics

Most people don’t know that bubble plots even exist in Excel. In this blog post, we’ll walk through how to take advantage of these very effective charts! They are great for comparing three quantitative variables at once. For a nice intro to bubble plots, check out Hans Rosling’s very famous Ted Talk. If you don’t […]

Read More


Understanding JOINs in SQL

September 1st, 2019 by boxplotanalytics

Joins in SQL Joins are one of the most important (if not THE most important) concepts in SQL. If you take the time to solidly understand how joins work, you’ll be in an excellent place for writing queries. So, let’s dive in! Join Definitions Joining tables in SQL is a way of combining them. It […]

Read More