Applied Statistics

Pareto Charts in R

A Pareto Chart is a sorted bar chart that displays the frequency (or count) of occurrences that fall in different categories, from greatest frequency on the left to least frequency on the right, with an overlaid line chart that plots the cumulative percentage of occurrences. The vertical axis on the left of the chart shows frequency (or count), and the vertical axis on the right of the chart shows the cumulative percentage. A Pareto Chart is typically used to visualize:

  • Primary types or sources of defects
  • Most frequent reasons for customer complaints
  • Amount of some variable (e.g. money, energy usage, time) that can be attributed to or classified according to a certain category

The Pareto Chart is typically used to separate the “vital few” from the “trivial many” using the Pareto principle, also called the 80/20 Rule, which asserts that approximately 80% of effects come from 20% of causes for many systems. Pareto analysis can thus be used to find, for example, the most critical types or sources of defects, the most common complaints that customers have, or the most essential categories within which to focus problem-solving efforts.

To find out how to implement a Pareto Chart in R, download my PDF at http://nicoleradziwill.com/r/pareto-charts-75-925-copy.pdf —>

2 replies »

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s