Table of Contents
Translation Related File Formats
These are the different storage formats for translations and files associated with translations that are supported by the toolkit. See also conformance for standards conformance.
Common concerns in handling these files using our base classes for a uniform API are quoting and escaping.
Primary translation formats
Other translation formats
- .ini (including Inno Setup .isl dialect)
- Java properties (also Mozilla derived properties files)
- Mozilla DTD
- OpenOffice.org GSI (Also called SDF)
- PHP translation arrays
- Qt Linguist .ts (both 1.0 and 1.1 supported, 1.0 has a converter)
- Symbian localization files
- Windows RC files
- Mac OSX strings files (also used on the iPhone) (from version 1.8)
- Adobe flex files (from version 1.8)
- Haiku catkeys (from version 1.8)
Translation Memory formats
Glossary formats
- OmegaT glossary (from v1.5.1)
- UTX (from v1.9.0)
Formats of translatable documents
- JSON (from v1.9.0)
- OpenDocument - all ODF file types
- Text - plain text with blocks separated by whitespace
- Subtitles - various formats (v1.4)
Machine readable formats
In development
Unsupported formats
Things that we would like to support but don't currently:
- Wordfast: glossary tab-delimited “source,target,comment” i.e. like OmegaT but unsure if any extension is required.
- Apple:
- Adobe:
- FrameMaker's Maker Markup Language (MML)
- Microsoft
- Word, Excel, etc (probably through usage of OpenOffice.org)
- OOXML (at least at the text level we don't have to deal with much of the mess inside OOXML). See also: Open XML SDK v1
- Rich Text Format (RTF) see pyrtf-ng
- .NET Resource files (.resx) - Bug 396
- XML related
- Generic XML
- unoconv - Python bindings to OpenOffice.org UNO which could allow manipulation of all formats understood by OpenOffice.org.
- Trados:
- Multiterm XML TSV to MiltiTerm conversion script or XLST
- Tcl: .msg files. Good documentation
- Installers:
- NSIS installer: Existing C++ implementation
- WiX - MSI (Microsoft Installer) creator. Localization instructions, more notes on localisation. This is a custom XML format, another one!
- catgets/gencat: precedes gettext, looking in man packages is the best information I could find. Also LSB requires it. There is some info about the source (msgfile) format on GNU website
- Trados:
- .tmw
- Deja Vu External View: Instructions sent to a translator, Description of external view options and process
Unlikely to be supported
These formats are either: too difficult to implement, undocumented, can be processed using some intermediate format or used by too few people to justify the effort. Or some combination or these issues.