Use Xalan to process an XHTML source file into a Docbook file:
| java org.apache.xalan.xslt.Process -XSL html2dbk.xsl -IN doc.html > doc.xml | 
See index.src.html for an example of an input file.
If your source files are in HTML, not XHTML, you may find the Tidy tool useful. This is a tool that converts from HTML to XHTML, and can be added to the front of your processing pipeline.
(If you need to process HTML and you don't know or can't figure out from context what a processing pipeline is, html2db.xsl is probably not the right tool for you, and you should look for a local XML or Java guru or for a commercially supported product.)