Extracting date from a Wiki page
I need some guidance.
I have a personal page, built with Astro Framework.
Then there's a wiki page that has tables and data.
What is the best approach for me to collect some of this data of this wiki page into a JSON, and from this JSON generate a JS that displays this information?
The information would basically request the data from the wiki page.
8 Replies
I mean if you have control of the wiki you could just make and expose the .json
If not you will have to "scrape" it
Any good scraping site?
Or recommended?
Playwright is pretty good I hear
double bonus you can learn it to test with too xD
Puppeteer is the other big one
I will check the documentation.
Yea sorry I don't have any resources I know are good (I have not dove into scraping myself yet so limited know)
If you find something good drop it in #resources maybe. Its kinda a touchy topic too, legality/morality issues around scraping content.
This would be for public information, as it's a wiki.
But if it's too complicated, I will try another approach.
Well, I tried the scrapping thing, but not today LOL
I just created the JSON files manually.