Writer2xhtml

4
Average: 4 (2 votes)

Writer2xhtml provides some export filters:

  • XHTML and XHTML+MathML export filters for Writer
  • XHTML export filter for Calc

Writer2xhtml is an alternative to the XSLT-based XHTML filters provided with OpenOffice.org.

Writer2xhtml gives you extensive control over the export of formatting (using a configuration file). You can export the formatting in the document to CSS, or you can map the styles in the document to your own stylesheet.

If you have trouble installing Writer2xhtml, please have a look at the installation FAQ.

New: The current development release available from http://writer2latex.sourceforge.net now also exports electronic books in EPUB format.

For information about other features, documentation, source code and development releases see http://writer2latex.sourceforge.net

Currently, the extension is available in Brazilian Portuguese, English, Czech, Danish, Dutch, German, French, Polish, Spanish, Italian, Turkish, Russian and Ukrainian. Thank you to the translators! If you want to translate it into your own language, please see the information on the project web site.

Download extension
Operating System: System Independent
Compatible with: OpenOffice.org 2.2 | StarOffice 8 Update 6 or higher.
Version: 1.0.2
Date: 2010-Jun-09
Size: 297.22 KB

License: Opensource | GNU Lesser General Public License
Downloads: Today: 1 | Week: 43 | Month: 143 | Year: 27,470

Comments

thanks a lot...

... for this extension, it makes my life easier :D great work!
is it still under development, i've just seen it's not working on newer versions of oo?
greetings,
markus

Your'e welcome

Yes, it's still in development. Development releases can be found at http://writer2latex.sourceforge.net.

I'm not aware of problems with newer versions of OOo, could you please add some more info?

Best regards
Henrik

Does not work under linux

I have tested this extension under windows xp (OO 3.1) where it does indeed work as expected.

Unfortunately not all styles are defined in the style section in the header but rather as style definitions directly in the code. Results would be much cleaner if everything was defined in the code.

Under the latest Opensuse 11.2 using OO 3.1 the extension does not work. When choosing to export as xhtml strict 1.1 I get the ollowing error message and the export screen letting me specify the export options does not show:


Error saving the Document .
Wrong parameter.
The operation was started under an invalid parameter.

Would be great if this would work under linux as well, saving me from a reboot.

Cheers J.

Works pretty well with Linux (april 2010)

I used writer2xhtml with Ubuntu 09/10 and OOo 3.2 in order to publish ebooks in EPUB format.
On a whole, it's pretty fast, stable, can process very big and complex files including lots of footnotes, table of contents, etc. I hotly recommend it.

Other comments.

If I understand well what happens, when an OOo file is not totally "clean" (has some overload like italics, graphics and others...), writer2xhtml creates some "sgc" styles to identify them, which in an ideal world, would need to be further processed by the user.

Some overload is pretty necessary. Many times though, it's a user's mistake. Let's begin with these ones. If you informed the user, prior to conversion, that writer2xhtml detected the following sgc styles with a short CSS description of them and a tick box to confirm they are really used, it could save a lot of cleaning work later. If the user dont' understand what it's all about... just click "next"

As for the really used overload styles.
- graphics are "magically" processed. I would advise, when possible though, to avoid this feature which is a little code heavy because it creates a new style for every image... and to process directly the insert of images in html code view.
- italics are a sensitive spot which probably may be refined further. I understand there is no perfect solution, but after conversion I had to add spaces before and after. After this, I had to take out space before point and comas (French typographic rules). The most awkward was dealing with a short Italic sentence including a so-called apostrophe. It stops the sgc style and resumes it immediately after. So you get two of the same sgc to describe one sentence. (ex:Vierge de l'Ascension)
- do you really need a dedicated style to process the ° character ? (like in 36° 12' or like in n° 12)

Thanks you very much for providing a wonderful and efficient tool.

To Roger640

Hi roger640

Thanks for your positive feedback.

Could you please provide a sample document and use this to explain the output you would like to generate?
You can find my email address on the project web site, http://writer2latex.sourceforge.net.

BTW: The next version of Writer2xhtml supports direct export to EPUB (see my web site for more information).

Best regards
Henrik