Log in

No account? Create an account
Vexen Crabtree 2015


Vexen Crabtree's Live Journal

Sociology, Theology, Anti-Religion and Exploration: Forcing Humanity Forwards

Previous Entry Share Next Entry
Vexen Crabtree 2015

(no subject)

Vexen = moody

I want to find some epic HTML task to engage in... but what? I know... I'll remove all the redundant KEYWORD meta tags from all my sites! Then do a few items from my webpages_todo.txt file...

Number 1: Add lots of links to content-pages from photo-pages.

And add some formatted context quotes to pages, the same as what BBC News does[1].

Redo my http://www.vexen.co.uk/books/ directory! That's what I need to do!

[1] For amusement value I phrased that sentance like a stupid person. Innit.

  • 1
(Deleted comment)
You could always "fix" it by turning it into black on white monospace plaintext, no graphics, and lynx compatible. Arguably, that would be a lot better than some of the shite that is out there these days... :)

There's a PHP library called XML_HTMLSax that lets you treat tag-soup HTML as if it was well-formed XML. Which means you can do all kinds of things like modify the attributes of (or delete entirely) specific tags, like the <meta> keyword tags. Perhaps something similar exists for a language you're familiar with, which would let you do that double-quick.

You could also strip extraneous HTML and add a <link> tag to load a stylesheet. But that's bordering on the scary and obsessive, given just how many pages you have :)

What's "tag-soup" HTML like?

Fortunately I wrote a rather nifty notepad program that can do multiple file complex search and replace (or append or prepend to files, etc), so "removing all keyword" tags is a deceptively easy task to do when I get round to doing it.

None of my sites are scripted either!

  • 1