summaryrefslogtreecommitdiffstats
path: root/2021/talks/omegat.md
blob: e51849f3071185b9200202f6ee0e38f886d6cc73 (plain) (blame)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
[[!meta title="Emacs manuals translation and OmegaT"]]
[[!meta copyright="Copyright © 2021 Jean-Christophe Helary"]]
[[!inline pages="internal(2021/info/omegat-nav)" raw="yes"]]

<!-- You can manually edit this file to update the abstract, add links, etc. --->


# Emacs manuals translation and OmegaT
Jean-Christophe Helary

Even if it is generally agreed that software localization is a good thing, Emacs is lacking in that respect for a number of technical reasons. Nonetheless, the free software using public could greatly benefit from Emacs manuals translations, even if the interface were to remain in English.

OmegaT is a multiplatform GPL3+ "computer aided translation" (CAT) tool running on OpenJDK 8. CATs are roughly equivalent for translators to what IDEs are for code writers. Casual translators can benefit from their features but professionals or commited amateurs are the most likely to make the most use of such tools.

When OmegaT, free software based forges and Emacs meet, we have a free multi-user translation environment that can easily sustain the (close to) 2 million words load that comprise the manuals distributed with Emacs, along with powerful features like arbitrary string protection for easy typing and QA (quality assurance), automatic legacy translation handling, glossary management, history based or predictive autocompletion, etc.

The current trial project for French is hosted on 2 different forges:

1.  sr.ht hosts the source files
    <https://sr.ht/~brandelune/documentation_emacs/>
2.  chapril hosts the OmegaT team project architecture
    <https://sr.ht/~brandelune/documentation_emacs/>

The sources are regularly updated with a po4a based shell script.

# Outline

- Duration: 10 minutes
- Software used during the presentation
  - [po4a](https://po4a.org) a tool to convert documentation formats to and from the commonly used `gettext` **PO** format.
    po4a supports the `texinfo` format along with many others.
  - [OmegaT](https://omegat.org) a "computer aided translation" tool used by translators to efficiently combine translation ressources (legacy translations, glossaries, etc.) so as to produce more consistent translations.

During the presentation, I will show:

- How to use po4a to convert the texi files to the PO format (the org.org file is also converted)
- What are the specificities of the Emacs manuals and what difficulties they present to the tanslator
- How to address those specificities in OmegaT, with regular expressions
- How to use OmegaT features such as arbitrary string protection, legacy translation handling, glossaries, autocompletion, QA, etc.
- How to use OmegaT with a team of 2 (or more) translators working at the same time
- How to solve translation conflicts

I will *not* show:

- How to create an OmegaT project
- How to setup an OmegaT team project
- How to use OmegaT from the command line to work in localization pipelines
- How to use machine translation and MT "post-edit"
- How to convert back the translated files to texi format
- How to install translated texi files for use in Emacs


[[!inline pages="internal(2021/info/omegat-schedule)" raw="yes"]]

[[!inline pages="internal(2021/info/omegat-nav)" raw="yes"]]