Meta's FAIR research team has released Brain2Qwerty v2, a model that reconstructs full sentences from non-invasive brain recordings. The average word error rate drops to 39 percent, and the best participant hits 22 percent. People who lose the ability to speak or move after a brain injury need a way to communicate. Brain implants already do this reliably, but they require risky surgery. Meta's AI division FAIR has been working on a surgery-free alternative for some time and now shows a major improvement with Brain2Qwerty v2. For the study, researchers recorded brain activity from nine healthy volunteers using magnetoencephalography (MEG), a technique that measures magnetic fields outside the skull. Each person was recorded for ten hours. Together, they typed a total of 22,000 sentences. The setup worked like this. Participants heard a sentence, paused briefly, then typed it on a keyboard without seeing the text on screen. The model reconstructs the sentence from brain signals captured during that typing phase.
The direct predecessor, Brain2Qwerty v1, still needed the exact timestamp of every single keystroke to align the signals. Version 2 works with a continuous signal window instead and assigns characters on its own, with no timing information. This asynchronous approach removes a key barrier on the path to real-time use, even though the system hasn't crossed that threshold yet. The harder task only works, the researchers say, because the new dataset contains ten times more recordings per person and far more varied sentences than the original. The model relies on three AI building blocks, according to the team. Deep learning replaced the hand-built recognition steps used before. The system processes signals at three levels: characters, words, and full sentences. And the team used AI agents to write the optimization code themselves. For the sentence level, a language model (Qwen3) is fine-tuned to shape noisy brain signals into coherent sentences. Brain2Qwerty v2 reaches an average word error rate of 39 percent, compared to 55 percent for the raw encoder without a language model. For the best participant, 28 percent of sentences are decoded perfectly, and 47 percent contain at most one wrong word.
The team compares Brain2Qwerty v2 against two simpler methods. The first is the raw encoder, which reads characters directly from the brain signal with no language model smoothing the output. The second is the approach from Brain2Qwerty v1, where an N-gram model corrects the encoder output. That kind of model knows the statistical likelihood of letter sequences from large text collections and patches individual character strings locally, but it doesn't form whole sentences. Performance is measured at three levels. Character error rate (CER) counts wrong letters. Word error rate (WER) counts wrong words. And semantic error rate captures how far the meaning drifts from the target sentence. On words and meaning, Brain2Qwerty v2 wins. The word error rate drops to 39 percent, compared to 55 percent for the raw encoder and 43 percent for the N-gram model from v1. At the character level, the picture flips. Here v2 hits 31 percent errors, worse than the raw encoder (28 percent) and the N-gram model (26 percent). The reason is the language model: It's trained to produce fluent sentences, even when the brain signal doesn't really support them. When in doubt, it invents a grammatically clean but completely wrong sentence. For the worst-performing participant, the model decoded "had she not fallen down the stairs" instead of the target sentence "cars are not allowed on this road." A total miss that drives the character error rate up.
Source: thedecoder