All names are classified by a unique rank based on popularity and then by alphabetical order. The most popular name is given the 1st rank, where popularity is determined by the number of occurrences of the name in the MyNameStats.com database. The rank is determined alphabetically for names with the same number of occurrences, from A (1st rank) to Z. No two names will have the same rank.
This shows the percentage of names that are less popular than the specified name. For example the first name John is the most popular and obviously all the other names are less popular. 100% of the names are less popular thus the percentile rank for John is 1.00. Now take a name such as Johandra which is far less popular than John, it has an estimated population of 23, and a percentile rank of 0.62. This can be interpreted as, the name Johandra is more popular than 62% of all names.
Why use percentile rank instead of a simple rank (ie:
The national rank is interesting when it comes to the most popular names like John, James, Barbara and Mary etc...
One can see that Mary is ranked 6 and Barbara is ranked 10. These names will all have a percentile rank of 1,
or very close to 1, so percentile does not tell us much. As names move down in rank,
the national rank become less meaningful. Take a name like Justine, it is ranked nationally at 1,289 and surprisingly,
it also has percentile rank of 1.00 (actually it is slightly less than 1 but due to rounding it displays as 1.00).
So we can conclude that although there is a difference in popularity between Mary and Justine the
difference is relatively small when compared to the very large number of less popular names (99%).
Another great thing about percentile rank is that it allows us to compare different distributions. For example, compare the popularity of the name Johannes as a first name to this same name as a last name. The national rank for Johannes as first name is 3,469 and as a last name it is 8278th. So it looks like there is a sizable difference. This may lead us to conclude that Johannes is used more as a first name. But national rank is not a good measure as there are far more last names than first names. Let's compare the percentile ranks, as a first name it is 0.99 and as a last name 0.99, thus we conclude that the name is equally popular as a first or last name.
To estimate the population a sample proportion is calculated using data from the MyNameStats.com database and it is multiplied by the US population estimate provided by the US Census Bureau for July 2017. An estimation range is provided for the users to show the degree of certainty of the population estimate. Popular names such as John have a relatively high degree of certainty resulting in a narrow range, in the order of less than 1%. Whereas, less common names have a lower degree of certainty and a wider range. The population estimates are calculated using a confidence interval of 95%.
This is the percentage of people with the specific name, times one thousand. Since there are relatively few people that have any one name, the percentage is almost always less than 1%. For example the most popular first name John is held by only 1.66% of the population and the name Elizabeth, the 20th most popular name, is held by 0.5% of the population. By multiplying these numbers by 1000, the figures are a little easier to read and compare, John's proportion per 100k is 1662 and Elizabeth?s is 486.
Additionally proportion per 100k is a standardized measure of popularity that allows users to compare the frequency of occurrence without the bias of population size. This is most relevant when comparing names among states. States with large populations will often have more people with any given name than a smaller state despite the fact that the concentration of people with that name may be higher in a smaller state. Take the name John as an example, there are 538k people named John in California and only 316k in Pennsylvania. Whereas there are 2,468 John's for every 100k Pennsylvanians and only 1,386 John's for every 100k Californians.
SSA Baby Names Population and Proportion /100k
We have taken the last 100 years of baby name data that have been published annually by the Social Security Administration (SSA), and we have calculated the population and proportion per 100k statistics for each baby name.
Each year the SSA releases the list of baby names. The names are from Social Security card applications for births that occurred in that year. The published data includes a count of all names that appeared at least five times in that given year. So the list excludes the less popular names. The lists only provide a snap shot for that year, there is no way of knowing whether any of the people born in the US have left or have died. Over a 100 year period almost everybody that appeared in the first year is since deceased. Follow this link to the SSA website for all the details.
We have taken this data, aggregated it and used life cohort tables (actuarial tables used to estimate the remaining life span of a person of a given age) provided by the Social Security Administration to statistically estimate the size of population of living people with these baby names.
SSA data has some short comings, such as those explained above and that it does not account for the effect of immigration. Nevertheless, it provides a consistent, reliable and readily available source for data. We show it here to provide a benchmark to which one can compare statistics from MyNameStats.com's own data.
First vs. Last
A ratio that compares the occurrences of the name used as a first name versus a last name. This metric is calculated from the MyNameStats.com data. The vertical bar indicates the relative value. The further the bar is to the left more it is used as a first name.
Girls vs. Boys
A ratio that compares the occurences of first names used as a girl's name versus a boy's name. This metric is calculated from the Social Security Administration (SSA) data. The vertical bar indicates the relative value. The further the bar is to the left more it is used as a girl's name. This metric is not shown for less common names where SSA data for the name is not available. When that data is not available no vertical bar is shown.