In July 2022, IFORELS publicly announced that its R&D team was actively developing Sefirot — a novel hybrid architecture combining recurrent and transformer models with the ambitious goal of achieving an effectively infinite context window. At the time, the announcement was framed as research rather than a commercial product, yet it represented a forward-looking strategic bet that would shape the company’s technical roadmap for years to come. The model carried a clear name and a precise hypothesis: the future of useful AI systems would be defined not only by model size but by the length of memory they could maintain across extended interactions.
The technical foundation was elegant and timely. In mid-2022, the dominant transformer architecture — which powers nearly all modern large language models — excelled at processing a fixed window of recent tokens with remarkable parallelism and accuracy. However, its attention mechanism scaled poorly with sequence length, causing practical limits of roughly 4,000 tokens in production systems. Recurrent networks, by contrast, could theoretically carry state forward indefinitely, remembering information across very long sequences, but they lacked the transformer’s efficiency on shorter, immediate context. Sefirot’s design proposed a hybrid solution: retain the transformer’s strengths for recent context while offloading long-term memory to a recurrent backbone whose hidden state would not need to be re-processed at every step. This architecture promised to solve the memory bottleneck that constrained real-world applications.
The bet held particular relevance for healthcare AI, IFORELS’s chosen domain. A complete patient record often spans discharge summaries, operative notes, insurance policies, prior admissions, lab results, and denial letters — information that rarely fits inside a 4,000-token window. A model capable of holding an entire chart in memory could reason across the full history in one pass, improving accuracy on complex clinical questions, coding decisions, and evidence synthesis. By naming the project and publishing the architectural direction in July 2022, the company staked an early claim in what would later become known as the long-context race.
This announcement arrived at a moment when most of the AI industry was still focused on scaling model parameters rather than memory. Production systems topped out at modest context lengths, and retrieval-augmented generation (RAG) was emerging as the standard workaround. IFORELS took the contrarian view that true long-context capability would become a fundamental differentiator, especially in high-stakes domains like medicine where missing even a single detail from years earlier could affect care or reimbursement. The research carried a timestamp that would prove prescient: eighteen months before Sefirot.ai shipped as a commercial product and well ahead of the industry’s broader embrace of million-token context windows.
Chief Operational Officer’s (COO) operator background and deep experience with complex, regulated data flows informed the direction. His prior work in enterprise systems integration and healthcare-adjacent IT operations had shown him how fragmented, longitudinal information creates real operational friction. The Sefirot project embodied the same disciplined, long-term thinking that defined IFORELS from its earliest days — investing in foundational infrastructure rather than chasing immediate consumer trends. By July 2022, the company had already moved beyond its 2021 product sprint and was methodically building the technical capabilities that would power its future healthcare AI leadership.
The Sefirot announcement, though quiet at the time, stands as a clear milestone. It demonstrated IFORELS’s willingness to pursue ambitious research aligned with its mission while maintaining focus on practical, enterprise-grade outcomes. The work laid essential groundwork for later breakthroughs, including the 40-million-token internal test and the eventual commercial launch of Sefirot.ai. In doing so, it reinforced the company’s reputation for technical foresight and execution excellence in the rapidly evolving field of AI infrastructure.





