Conversation

The first step is to parse all the available metadata for each forum (which is already started in the script under the method "get_forum_data" Once we get all the metadata for all forums, then we iterate through each forum to get all threads. Then we iterate threads to get
1
the actual forum posts. So to recap: 1) Finish the get_forum_data method to add all available metadata (Chrome console is your friend here). 2) Create get_thread_data method. 3) Create get_post_data method.
1
Replying to
Have you found someone to scrape this for you yet? I'm ~1/6 of the way through the 5 million posts, assuming I didn't royally screw things up. This is what I'm collecting:
Database schema for forums and posts
Database schema for threads and users
1
3