Iuliia Gorshkova, Author at Partenit. Self-Control for Robots

Ontology-Lite: When You Don’t Need Full OWL to Get the Benefits
March 14th, 2026
There’s a specific kind of paralysis that sets in when architects and senior developers start talking about "knowledge representation." We tend to visualize monolithic semantic graphs, reasoners churning through inference chains, and the promise of a perfectly modeled world. The default tool for this is often OWL (Web Ontology Language), the heavyweight champion of the [...]
RLMs Meet IDEs: Recursive Codebase Navigation as a New Workflow
March 13th, 2026
The way we navigate complex codebases is undergoing a fundamental shift. For years, the dominant paradigm has been linear and manual: a developer opens a file, searches for a function, traces a call, jumps to a definition, and repeats. It is a meticulous, often tedious process that relies heavily on human short-term memory and IDE [...]
The Hormonal System of a Robot
March 12th, 2026
No, our robot will not coo at your dog. But the analogy is more engineering than poetry. We build a self-control layer for robots. Not the brain — the thing that keeps the brain from doing something stupid. Partenit sits between the robot's decision-making (whether that's a classical planner, an RL policy, or an LLM [...]
RUG for Software Engineering: Guided Retrieval Over Code + Docs + Tickets
March 12th, 2026
Most of us have been there. You're deep in a new feature, and you need to understand how a specific service handles authentication. Your IDE's global search is screaming with hundreds of matches for the word "token." You find a file, but it's a legacy implementation. You find another; it's a test mock. Finally, after [...]
RUG for Customer Support: Guided Retrieval with Policies and Playbooks
March 11th, 2026
The Uncomfortable Truth About Your Knowledge Base If you’ve ever worked in customer support engineering, you know the feeling. It’s 2 AM, the pager has gone off, and a Tier 1 agent is staring at a blinking cursor in a chat window. They know the customer is angry, but they don’t know if the issue [...]
RLMs and ‘Context Folding’: The Real Alternative to Bigger Windows
March 10th, 2026
There’s a peculiar obsession in the current LLM landscape with the "needle in a haystack" problem. We treat context windows like a cargo hold—if we just make it bigger, we can cram more in without consequence. But as anyone who’s optimized database queries or managed memory-constrained embedded systems knows, throwing hardware at a memory problem [...]
Graph Poisoning and Defensive KG Design: New Security Concerns for GraphRAG
March 9th, 2026
Graph Retrieval-Augmented Generation (GraphRAG) has emerged as a powerful paradigm for enhancing the factual grounding and reasoning capabilities of Large Language Models. By structuring external knowledge into a graph—nodes representing entities and edges representing relationships—systems can traverse complex semantic spaces to retrieve precise context before generating an answer. However, this architectural shift from flat document [...]
Why Did the Robot Do That?
March 9th, 2026
Adding a safety and decision layer to robot AI Most robotics engineers eventually encounter the same frustrating situation. A robot behaves unexpectedly during a test run. The team begins the usual investigation: logs are replayed, controller outputs are inspected, trajectories are plotted and reviewed frame by frame. And yet, even after all that effort, one [...]
Acceleration Tactics: How Research Becomes Product Faster
March 8th, 2026
The Hidden Cost of Waiting There is a specific kind of dread that settles in when you open a project repository and see a folder named experiments or research_spikes. It’s usually full of Jupyter notebooks, half-finished scripts, and a README.md that hasn’t been touched in six months. This is the graveyard of good ideas—concepts that [...]
How Startups Escape the ‘RAG Plateau’
March 7th, 2026
Every engineering team that has seriously deployed a Retrieval-Augmented Generation (RAG) system eventually hits the same wall. It happens around the third month, usually after the initial excitement of connecting a vector database to a large language model (LLM) has worn off. The system works beautifully on the demo dataset, but in production, it starts [...]
Audit Trails as a Product Feature: Why Graph + Rules Win Enterprise Deals
March 6th, 2026
When I first started building AI systems for large organizations, I made a naive assumption. I thought that if the model produced the right answer, the client would be happy. I was wrong. Enterprise sales cycles don't end with a demo that wows the room; they end in a compliance review meeting where someone from [...]
RLM Safety: Bounded Recursion, Budgeting, and Failure Recovery
March 5th, 2026
When we talk about building robust systems that leverage Large Language Models (LLMs) for complex, multi-step reasoning, we inevitably stumble into the territory of Recursive Language Models (RLMs). These are architectures designed to break down a problem, generate sub-tasks, execute them, and then synthesize the results—often repeating this cycle until a solution converges. While the [...]
RUG + Evaluators: Guidance from ‘Critics’ Instead of Rules
March 4th, 2026
When we think about building reliable systems—whether it's a retrieval-augmented generation (RAG) pipeline, a validation framework for machine learning models, or even a complex software module—we often gravitate toward rigid rule sets. We define strict ontologies, write exhaustive validation rules, and hope that our system adheres to them perfectly. But in practice, the world is [...]
Measuring What Actually Matters: Per-Layer DPU Profiling on Kria KV260 with MobileNet and ResNet-50
March 4th, 2026
Ivan Gubochkin, Iuliia Gorshkova, Pavel Salovskii Abstract Aggregate FPS figures tell you whether a neural network meets a real-time threshold, but they reveal nothing about where the time is actually spent. When optimising an edge AI deployment, the question that matters is not “how fast is the whole model?” but “which layers dominate latency, and [...]
Guided Retrieval for Multi-Hop Questions: Why ‘First Chunk Wins’ Is Wrong
March 3rd, 2026
The Perils of Greedy Search in Semantic Space When we first start building systems that attempt to answer complex questions by retrieving information from a corpus, we often fall into a trap of simplicity. The standard retrieval pipeline—take a user question, embed it, find the nearest neighbor in the vector database, and stuff that context [...]
Ontological Memory for Compliance: Turning Regulations into Executable Knowledge
March 2nd, 2026
Compliance has traditionally been treated as a static documentation problem. Teams write policies, store them in PDFs or wikis, and then rely on human interpretation during audits or incident reviews. This approach breaks down in modern software environments where regulations change frequently, systems are distributed across clouds, and the cost of manual verification scales poorly. [...]
Building Ontological Memory: A Minimal Viable Ontology for a Startup
March 1st, 2026
Every startup, by its very nature, begins with a chaotic burst of potential. It’s a collection of brilliant minds, nascent ideas, and frantic energy, all orbiting a single, burning question. In the early days, this chaos is a feature, not a bug. It allows for rapid pivoting and creative leaps. But as the company grows, [...]
RLMs as a Research Workflow: The REPL Pattern for Paper Synthesis
February 28th, 2026
There is a specific kind of fatigue that settles in after the third hour of staring at PDF tabs. You have twelve papers open, a blinking cursor in a notes app, and a growing suspicion that the paper you need is buried in the stack you just skimmed. You remember a graph, a specific equation, [...]
RAG Reading Lists and ‘Community Curation’ as Acceleration Infrastructure
February 27th, 2026
The modern landscape of artificial intelligence, particularly within the domain of Large Language Models (LLMs), is moving at a velocity that borders on the disorienting. We are witnessing a shift from monolithic, closed-system models to more dynamic, agentic architectures. Two acronyms have come to dominate the discourse around these practical applications: RAG (Retrieval-Augmented Generation) and [...]
China Watch: Why Chinese Labs Push Knowledge-Guided Retrieval Hard
February 26th, 2026
There’s a particular rhythm to the way research agendas evolve in different parts of the world. In Silicon Valley, the dominant narrative often revolves around "scaling laws"—the idea that if you throw enough compute and data at a model, it will inevitably become more capable. The focus is on emergent properties, on pushing the boundaries [...]
Institution Watch: Where KG + LLM Research Is Concentrating (2024–2026)
February 25th, 2026
It’s a peculiar moment to be mapping the intellectual geography of Knowledge Graphs and Large Language Models. If you’ve been in the trenches of NLP research over the last few years, you’ve felt the tectonic shift. We moved from the era where knowledge graphs (KGs) were the dominant paradigm for structured reasoning to the LLM [...]
Why ‘Knowledge Graph Construction’ Became an LLM Research Frontier
February 24th, 2026
For years, building a knowledge graph (KG) felt like a distinct, often tedious discipline. You wrote hand-crafted rules, wrestled with brittle regex patterns, and leaned heavily on complex ontologies that required PhD-level patience to maintain. The goal was to transform unstructured text into a structured web of entities and relationships, but the process was labor-intensive [...]
RLM + KG²RAG: The ‘Expansion’ Step as a Recursive Operator
February 23rd, 2026
If you’ve spent any time wrestling with large language models on tasks that require genuine multi-step reasoning, you’ve likely felt the friction. We push the model to "think step-by-step," but often, that thinking is a flat, linear process. It’s a single pass of reasoning, maybe with a few self-corrections, but it lacks the ability to [...]
RLM + RuleRAG: Recursive Reasoning with Rule-Driven Retrieval
February 22nd, 2026
There’s a particular frustration that settles in when you’re trying to coax a large language model into following a complex, multi-step policy. You write a prompt that meticulously details the rules, edge cases, and required outputs. You feed it a document that contains the necessary data. The model responds, and on the surface, it looks [...]
RLM + GraphRAG: Recursive Traversal Over Graph Summaries
February 21st, 2026
There is a specific kind of cognitive dissonance that occurs when you are staring at a graph visualization of a knowledge graph generated by a system like GraphRAG, and you ask it a question that requires both a helicopter view and a microscope. You want to see the forest, but you also need to know [...]
How Startups Actually Combine RAG + Graphs + Rules (A Field Guide)
February 20th, 2026
Let’s be honest: most production AI systems today are just wrappers around a vector store and a large language model. They’re brittle. They hallucinate. They lose context. And when you try to bolt on "reasoning," you often end up with a chain of brittle prompts that feels more like magic than engineering. But there’s a [...]
Ontology-Based Memory vs Vector Memory: When Structure Beats Similarity
February 19th, 2026
When building systems that need to remember things, we often reach for the nearest vector database. It’s the default choice, the tool that promises to solve retrieval with a single API call. But I’ve been thinking a lot lately about the friction that appears when these systems scale beyond simple Q&A. The smooth surface of [...]