wiki/2020-04-12-13-07-20.org

30 lines
1.4 KiB
Org Mode
Raw Normal View History

2020-06-23 18:29:05 +00:00
#+TITLE: Text Encoding Initiative
#+AUTHOR: "Gabriel Arazas"
#+EMAIL: "foo.dogsquared@gmail.com"
2020-11-14 21:13:01 +00:00
#+DATE: "2020-04-12 13:07:20 +08:00"
#+DATE_MODIFIED: "2020-09-09 05:16:32 +08:00"
#+LANGUAGE: en
#+OPTIONS: toc:t
#+PROPERTY: header-args :exports both
2020-06-23 18:29:05 +00:00
Having your research stored as text files written in a lightweight markup language is great.
However, certain information can still get lost in the way.
For example, certain words like "London" can either mean the famous capital of England, a city in France, or certain people with the name.
You can find similar situations with Wikipedia disambiguation pages (like the [[https://en.wikipedia.org/wiki/London_(disambiguation)][previous example]]).
[[https://tei-c.org/][Text Encoding Initiative]] (TEI) attempts to solve exactly that.
It's a standard that focuses on the semantic meaning of the words.
Being a standard, it also frees the writers from software dependency and developers have to follow it instead.
The specification uses XML for markup and there are [[https://wiki.tei-c.org/index.php/Category:Tools][various tools]] for creating TEI-specific contents aside from the already existing ecosystem of XML-related tools.
It can also export into various formats through [[https://github.com/TEIC/Stylesheets][XSLT 2.0 stylesheets]] including HTML, LaTeX, and JSON files.
* Relevant notes
- [[file:2020-04-15-14-35-55.org][Note-taking]]
- [[file:2020-04-12-11-20-53.org][Reproducible research]]