The Doctor ✅ is a user on mastodon.social. You can follow them or interact with them if you have an account anywhere in the fediverse. If you don't, you can sign up here.

oh, maybe you all would know this.

any tips for scraping a #wordpress-based site for the urls of all posts by a particular author? I tried a few combinations of lynx -dump, wget, & grep but don't know enough about any of them.

i.e. https://site.tld/author/authorsname, https://site.tld/author/authorsname/page/2, page/3, etc., where the posts are like https://site.tld/1970/01/01/title-of-post

@nev Does the site in question have the API enabled? You might be able to vacuum up the entire site without needing credentials.

@drwho hmm, I don't know. How would I find out/tap into it?

@nev It's in core now, so you should be able to poke around with it.

developer.wordpress.org/rest-a

@drwho i'm not really a coder and don't know how to even REST or whatever but perhaps this is a good time to start learning!

The Doctor ✅ @drwho

@nev Give this a read - it's Huginn-specific, but it talks about how to interact with a REST API.

drwho.virtadpt.net/archive/201

· Web · 0 · 0