| 1 |
frodo |
2 |
This README provides information about the psiconv HTML4 output generator. |
| 2 |
|
|
|
| 3 |
|
|
Output files generated using the option `-t HTML4' use cascading style sheets |
| 4 |
|
|
(CSS1) with embedded style rules to specify all text formatting information, |
| 5 |
|
|
rather than using the more commonly-used HTML version 2.0 and 3.2 elements such |
| 6 |
|
|
as <FONT>, <B>, <CENTER> etc. Output files can thus only be properly displayed |
| 7 |
|
|
on the more recent web-browsers such as Netscape 4.x and IE 4.0, but the output |
| 8 |
|
|
is a much more accurate conversion from the original Psion document than is |
| 9 |
|
|
possible with HTML 2.0 or 3.2. Browsers that do not support cascading style |
| 10 |
|
|
sheets will show the text but no character or paragraph formatting. |
| 11 |
|
|
|
| 12 |
|
|
Output files that do not contain user-written HTML constructs should comply |
| 13 |
|
|
with W3C's HTML 4.0 Strict DTD [not checked yet; anj 20-Jun-1999]. |
| 14 |
|
|
|
| 15 |
|
|
The text on the first line of the document header is used as the page title. |
| 16 |
|
|
|
| 17 |
|
|
Hard page-breaks are converted into a horizontal rule <HR>. Unfortunately in |
| 18 |
|
|
Netscape 4.51 any text following in the same paragraph loses its styling, so |
| 19 |
|
|
this should really only be used at the end of a paragraph. |
| 20 |
|
|
|
| 21 |
|
|
Paragraph borders using the types dot-dash and dot-dot-dash are converted to |
| 22 |
|
|
dashed and dotted respectively, as these two Psion types do not have a direct |
| 23 |
|
|
equivalent in HTML 4.0. However Netscape 4.51 only seems to display borders as |
| 24 |
|
|
solid, and in only one color (black). |
| 25 |
|
|
|
| 26 |
|
|
Netscape also doesn't seem to handle superscript and subscript properly using |
| 27 |
|
|
the style sheet approach, so the reason these don't work is due to their bug, |
| 28 |
|
|
not mine. |
| 29 |
|
|
|
| 30 |
|
|
Bullets are supported, but the output is not quite the same as on the Psion. |
| 31 |
|
|
HTML lists don't allow you to pick your own character for the bullet like the |
| 32 |
|
|
Psion does (you could use an image of the relevent character, but you'd have to |
| 33 |
|
|
create it on-the-fly to get the right foreground color for full support), and |
| 34 |
|
|
they're not particularly straight-forward to use, so I don't. |
| 35 |
|
|
|
| 36 |
|
|
Short HTML constructs such as hyperlinks and images can be entered in the Word |
| 37 |
|
|
document by using the Psion characters CTRL+139 and CTRL+155 (the single |
| 38 |
|
|
angle-quote characters lsaquo and rsaquo), which are converted into < and > |
| 39 |
|
|
respectively in the output file. Longer constructs such as tables would |
| 40 |
|
|
probably interact with the paragraph formatting to the detriment of both, |
| 41 |
|
|
although if kept "on one line" (ie within a single Psion paragraph) they may be |
| 42 |
|
|
feasible. |
| 43 |
|
|
|
| 44 |
|
|
The only major facilities provided by Psion Word that are missing from the |
| 45 |
|
|
HTML4 output are support for tabs and embedded objects. Tabs look to be |
| 46 |
|
|
impossible to implement properly because HTML does not provide an equivalent |
| 47 |
|
|
construct (tables are not suitable for anything other than superficial support. |
| 48 |
|
|
The classical typewriter model of tabs needs to know the current print position |
| 49 |
|
|
in order to work out which tab stop to align to next; the print position of a |
| 50 |
|
|
particular character will depend upon the font in use). Embedded objects need |
| 51 |
|
|
the relevent stream formats to be documented and suitable converters |
| 52 |
|
|
implemented (embedded Sheet files could be converted into tables, and Sketches |
| 53 |
|
|
into inline images). |
| 54 |
|
|
|
| 55 |
|
|
- Andrew Johnson <anjohnson@iee.org> |