Why you can't parse HTML with Regex:
HTML is a Chomsky Type 2 grammar (context free grammar) and RegEx is a Chomsky Type 3 grammar (regular grammar). Since Type 2 is more complex than Type 3, it is insufficient in itself to be able to universally parse HTML.

