How many segments do I need in my translation memories?

Training Corpus – Translation memories to convert to corpus or publicly available corpus.

Personalized engines

  • 70,000 to 150,000 sentence segments
  • One full-time translator’s work for 3 to 4 years

Customized engines

  • 200,000 to 500,000 sentence segments
  • Supports a team of translators


Using translation memories with only specialized segments yields better custom machine translation. Specializations such as financial & regulatory reports, clinical trials & pharmaceuticals, technical manuals, legal contracts, etc. yield more consistent and accurate translations.

There’s no upper limit on the number of segments, but too many segments may degrade the engine for specific, specialized use.

