Modelwire
Subscribe

CodePivot: Bootstrapping Multilingual Transpilation in LLMs via Reinforcement Learning without Parallel Corpora

Illustration accompanying: CodePivot: Bootstrapping Multilingual Transpilation in LLMs via Reinforcement Learning without Parallel Corpora

Researchers propose CodePivot, a reinforcement learning method that trains LLMs to translate code across multiple programming languages without requiring parallel training data. The approach addresses a key bottleneck in transpilation: scaling beyond language pairs to support low-resource PLs efficiently.

Modelwire context

Explainer

The key constraint CodePivot sidesteps is data scarcity: parallel corpora (matched code samples in two languages doing the same thing) barely exist for most programming language pairs, which is why prior transpilation work has clustered around Python-to-Java and similar high-resource combinations. Reinforcement learning lets the model self-evaluate correctness through execution signals rather than requiring a human-curated ground-truth translation.

The translation-without-parallel-data problem has a direct analogue in natural language work covered here recently. The 'Fabricator or dynamic translator?' paper from April 16 examined how LLMs generate spurious content during machine translation, and many of those failure modes (hallucination, unhelpful paraphrase) are equally plausible in code transpilation where a wrong function signature compiles but silently misbehaves. Meanwhile, OpenAI's Codex expansion covered the same week shows commercial pressure building around coding AI, which gives the low-resource angle practical urgency: if agentic coding tools are going to operate across polyglot codebases, they need transpilation that goes beyond the popular-language pairs Codex was trained on.

The real test is whether CodePivot's RL reward signal (likely execution-based pass/fail) holds up on languages with limited runtime tooling or ambiguous semantics. Watch for follow-up benchmarks on genuinely low-resource targets like Fortran or COBOL, where execution environments are harder to instrument, within the next two conference cycles.

Coverage we drew on

This analysis is generated by Modelwire’s editorial layer from our archive and the summary above. It is not a substitute for the original reporting. How we write it.

MentionsCodePivot · LLMs

MW

Modelwire Editorial

This synthesis and analysis was prepared by the Modelwire editorial team. We use advanced language models to read, ground, and connect the day’s most significant AI developments, providing original strategic context that helps practitioners and leaders stay ahead of the frontier.

Modelwire summarizes, we don’t republish. The full content lives on arxiv.org. If you’re a publisher and want a different summarization policy for your work, see our takedown page.

CodePivot: Bootstrapping Multilingual Transpilation in LLMs via Reinforcement Learning without Parallel Corpora · Modelwire