TBX-Min Information

TBX-Min is a dialect of TBX designed for simple and straightforward storage of bilingual or monolingual glossaries. The primary use case for TBX-Min is to transmit a glossary to a translator, but TBX-Min can also be used by translators to submit glossaries they have created. It was designed to be easily converted to and from its cousin TBX Basic, as well as other well known glossary storage formats such as UTX. Although its current structure more accurately reflects UTX elements, TBX-Basic elements may be proposed in the future. For more specific information on TBX-Min (including structure, format, etc.), download the Introduction to TBX-Min.

Also, a detailed description can be read in this paper written for TKE 2014 .

Please note that TBX-Min has evolved since the paper was written. Those changes are reflected in this document as "Alternative TBX-Min".

TBX-Min Data Categories

The header of a TBX-Min file contains the following:

The body contains the usual nested entry, langSet and termEntry(terminological entry).

The termEntry element contains:

The langSet element contains one or more tig elements and the attribute "xml:lang"

The tig (term information group) element contains the following:

The noteGrp element:


Note that most of the elements and categories reflect those found in UTX more than TBX-Basic. However, in the future, TBX-Basic categories and elements may be proposed.


Sample TBX-Min Files

This sample pack includes two valid TBX-Min files and an invalid file for testing the RNG and XSD schemas (See the Validation portion of the Tools section).


Tools:

TBX-Min Viewer

TBX-Min Viewer/Editor is a new application intended to allow for quick and user friendly viewing and editing of the contents of a TBX-Min file within a web browser. The viewer/editor can be found at viewer.tbxinfo.net/.

Validation

Tools to validate your TBX-Min file using either the RNG or XSD schemas are available online. You may also download the schemas for use in your own applications.

Support TBX-Min Import/Export Functions

This import/export guide details a simple pseudo-code construct for creating your own import/export function extension.

UTX Conversion

TBX-Min is a minimal TBX dialect designed to contain roughly the same information as a UTX file. For more information on UTX format, see the UTX homepage.

A sample UTX is available for download. It may be used with the TBX-Min samples above to test the converter. In addition, here is a guide providing a walkthrough for the conversion of a UTX file into TBX-Min, viewing it using viewer.tbxinfo.net, and converting the TBX-Min file further into TBX-Basic (the converter below).

You can convert your files between UTX and TBX-Min by uploading your files below.

Convert UTX to TBX-Min:

Convert TBX-Min to UTX:

TBX Basic Conversion

You can convert your files between TBX-Basic and TBX-Min by uploading your files below. The conversion from TBX-Min to TBX-Basic is mostly lossless (see the TBX-Basic to TBX-Min converter report for details). However, the conversion from TBX-Basic to TBX-Min will of necessity lose much of the structured information. For an explanation of the current version of this converter, and a sample input/output file, download the TBX-Basic to TBX-Min converter archive.

Convert TBX-Min to TBX-Basic:

Convert TBX-Basic to TBX-Min: Source Language: Target Language:

Spreadsheet Glossary Converter

You can convert glossaries stored in Spreadsheet format. An instruction guide is available for download along with a collection of sample spreadsheets. You will be asked which columns of the Spreadsheet represent which TBX-Min data categories (such as a 'Source Language' column or a 'Comment' column). Please note that in order for the converter to work, column headers of some sort must exist. You will also be asked about some information about the document in general (which gets used in the Header of the TBX-Min file).

*TBX-Min supports single language as well as two language glossaries, rather than being strictly bilingual. This converter only requires a source language. This feature can be used to pull out individual language glossaries one (or two if you want language pairs) at a time. If you want to do this, simply state the desired language to be the "source" here and then label the appropriate column header on the next page as 'Source Language'*.

Only Excel Spreadsheets or CSV files (.xlsx, .xls, .csv, or .txt [tab delimited format]) are supported currently, however .xlsx or .xls are recommended.

Convert Spreadsheet to TBX-Min:

Source Language* (must be ISO 639 two-letter code i.e. 'en','fr', etc.):
Target Language (must be ISO 639 two-letter code i.e. 'en','fr', etc.):
License:
Creator:
Description:
Directionality*:
Subject:
Glossary ID* (a number preceded by a letter such as 'G' will suffice. i.e. "G001"):