View Single Post
  #7  
Old 06-22-2015, 01:30 AM
kato13's Avatar
kato13 kato13 is online now
Administrator
 
Join Date: May 2007
Location: Chicago, Il USA
Posts: 3,656
Send a message via ICQ to kato13
Default

In case someone wants the data here is 1990 and 2000 (either for this project or a future one)

http://www.census.gov/topics/populat...namefiles.html

http://www.census.gov/topics/populat..._surnames.html

The 1990 data has count breakdown and first and last names

the 2000 data only has last names but has a racial breakdown of the last names.

I might merge the 1990 and 2000 last name data and then see if I can find a racial breakdown of first names. The reason I quit work on this before was that names like "Suk Gonzales", "Mohammed Wong" and "Jamal Bjorkman" would occur much more frequently than in real life. I know they are possible, (Bernando O'Higgins is one of my favorite names in history,) but if I am spending my time on a tool I push myself to be perfect.

My existing code would probably work for a group of 10 (you can create a fun backstory for for an odd name combination), but the original purpose for my work was to create a database element for every member of my project (from 10k to 50k depending on which version) so I wanted something more accurate.

Last edited by kato13; 06-22-2015 at 03:58 AM.
Reply With Quote