Startingblockonline

Welcome to the new world

Latest

How We Translated a 40-Page Reseller Agreement Into German, French, and Polish, Step by Step: The Errors One AI Model Missed

When a startup signs its first reseller in a new country, the contract usually moves faster than the legal review. You have a German distributor ready to commit, a French partner asking for terms in their own language, and a Polish reseller who wants a local lawyer to read the agreement before anyone signs. A founder who has never thought twice about translation is suddenly responsible for one legal document that has to mean exactly the same thing in four languages at once.

That is the situation we set out to document. We took one artifact that early-stage companies deal with constantly, a 40-page reseller agreement, and translated it into German, French, and Polish using the kind of tools a small team would actually reach for. Then we checked the output line by line. This is the step-by-step account of what that took, and where a single AI model quietly got things wrong.

Why one model is a risk on a document that carries legal weight

The reflex in 2026 is to paste the document into whichever AI model you already pay for and move on. For a blog post or an internal memo, that is fine. For a contract it is a gamble, because language models do not fail loudly. They fail by producing a fluent sentence that is subtly wrong.

The scale of that risk is now measured. In its 2025 State of Translation Automation report, Intento evaluated 46 translation systems and found that baseline setups averaged 10 to 15 errors per text, while requirements-driven approaches cut that to between zero and two. On a marketing page, a few errors are a typo problem. On a reseller agreement, one mistranslated liability clause is a renegotiation, or worse.

There is also a commercial reason to get the languages right rather than default to English. CSA Research found that 76 percent of online buyers prefer to purchase in their native language, and 40 percent will not buy from a site in another language at all. A partner deciding whether to trust your contract is no different. The document is your first impression in their market.

The document, and the first two steps

Step one was preparation, not translation. We stripped the agreement down to its structure: defined terms, payment schedule, territory clauses, termination conditions, and governing law. Then we built a short glossary of the terms that had to stay identical across all three languages, things like Reseller, Territory, Net Revenue, and Confidential Information. If those drift even slightly between page 4 and page 31, the contract becomes ambiguous. It is the kind of unglamorous preparation that quietly saves money as a company scales.

Step two was the first-pass translation. We ran the full document through a single leading model, the way most teams would. The result read well. It was grammatical, confident, and in places wrong, which is exactly the problem. The German version rendered Net Revenue two different ways in the same document. The French softened a termination clause so that “may terminate” read closer to “might consider terminating.” Polish, the most morphologically complex of the three, drifted most on the defined terms, inflecting them inconsistently enough that a lawyer would have to stop and confirm whether two phrases referred to the same thing.

Step three: comparing several models instead of trusting one

This is where our process left the usual workflow behind. Instead of accepting one model’s output, we ran the same segments through several models and compared them side by side. Almost nobody does this by hand, because lining up four versions of the same paragraph and spotting where they disagree is slow, tedious work.

We used MachineTranslation.com to automate it. The AI translator that runs translation through 22 AI models at once, evaluates the source context, and surfaces the rendering that most of the models agree on, while flagging the segments where they split. The disagreements were the useful part. Every place the models diverged was a place worth a human look: the termination clause, the inconsistent revenue term, the Polish defined terms. Agreement across models is not proof of correctness, but disagreement is a reliable signal of risk, and on a legal document that signal is exactly what you want surfaced before signing.

That tension is the point, says Rachelle Garcia, AI Lead at Tomedes. “A single model gives you one opinion stated with total confidence. Running 22 models and watching where they disagree turns that confidence into something you can actually check.”

Step four: put human review where the disagreement is

Step four narrowed the human effort to where it mattered. Rather than re-reading 40 pages in three languages, we sent only the flagged segments to professional linguists for verification. That is the practical value of the approach: the machine does not replace the lawyer or the linguist, it tells them where to look. The reviewer confirmed the correct rendering of the termination clause, locked the revenue term to a single translation, and corrected the Polish inflection so the defined terms read consistently from the first page to the last.

What the comparison caught

The errors the single model produced were not exotic. They were terminology drift, a softened legal obligation, and inconsistent handling of a defined term: the everyday failure modes of generative translation on long documents. Internal benchmarks for the multi-model approach put the residual error rate under 2 percent and the reduction in error risk against single-model output at roughly 90 percent. The number that matters to a founder is simpler than either of those. The contract said the same thing in every language, and the review took hours instead of days.

A checklist for any founder translating a high-stakes document

You do not need our exact setup to apply the lesson. Before you translate anything that someone will sign, sue over, or rely on:

  • Build a glossary first. List the terms that must stay identical, and check them last.
  • Never trust a single model on a legal or financial document. Fluent does not mean faithful.
  • Compare outputs across models. Where they disagree is where your risk lives.
  • Put human review on the disagreements, not across the whole document.
  • Translate into the partner’s language, not your own. It is the first signal of how seriously you take their market.
  • If you are running lean, the same principle that applies here applies to the rest of your stack: lean on tooling before headcount. The team’s guide to building a business presence out of digital tools and the wider software and apps coverage are useful companions on that front.

    The work is no longer the translation

    Translation used to be the last item on a founder’s expansion checklist, handled in a rush once the deal was almost done. The economics have flipped. The translation itself is cheap and quick now. The cost lives entirely in the errors you do not catch. A reseller agreement that means three slightly different things in three markets is not a localization problem. It is a legal one wearing a localization disguise. The job is no longer translating the document. It is proving the translation is right before anyone signs it.

    Zorakryn Brynal
    Zorakryn Brynal brings a fresh analytical perspective to emerging technologies and their societal impact. Known for combining data-driven insights with clear, accessible writing, they specialize in demystifying complex technical concepts for general audiences. Their coverage focuses on AI developments, cybersecurity trends, and digital transformation. With a keen interest in how technology shapes human behavior and society, Zorakryn approaches topics through both technical and philosophical lenses. They maintain a balanced view between technological optimism and practical realism. Their engaging writing style connects technical expertise with real-world applications, helping readers understand both the "how" and "why" of technological change. Outside of writing, Zorakryn enjoys urban photography and reading science fiction, which informs their forward-looking perspective on tech trends.