Posted by: Chris Brew | January 11, 2010

XML and Corpora

I just got asked to advise on tutorial materials about XML for a computer scientist starting a corpus encoding project. To get started with the very basics I like Greg Wilson’s Software Carpentry lecture. For more advice on how to go about building up a corpus, see Developing Linguistic Corpora: a Guide to Good Practice, by Martin Wynne and a bunch of Humanities Computing luminaries.

The project is going to work with childrens’ books. That led me to find the Comic Book Markup Language provides a tool for adding analytical markup to (wait for it) Comic Books. It uses TEI, which is great, but heavy-duty.


Leave a Reply

Fill in your details below or click an icon to log in: Logo

You are commenting using your account. Log Out /  Change )

Google+ photo

You are commenting using your Google+ account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )


Connecting to %s


%d bloggers like this: