I've created an example script for parsing data from Wikipedia tables. This script parses out city / state data for the top cities in the U.S. by population.
The goal is to eventually parse out Coronavirus tweets by locations mentioned in the tweet.
Conversation
Replying to
The Geonames dumps are great for this kind of thing. Having done this before, I'd suggest cities15000.zip and population filtering. alternateNamesV2.zip is also useful.
download.geonames.org/export/dump/re
1
2
concur; have used Geonames for similar stuff and works well



