When we are fetching the metadata for an article we make a huge actions API call to do it, in order to make our API requests leaner and faster we want to investigate what we can pull off Wikitext and HTML instead of Actions API call.
Acceptance criteria
Tickets with actionable item about how and what we can extract a being created.
Things to consider
- we can extract Templates
- we can extract Categories
- what other things we can exctract?
Parsers to evaluate
https://gitlab.wikimedia.org/repos/research/html-dumps
https://github.com/earwig/mwparserfromhell
https://www.mediawiki.org/wiki/Alternative_parsers