Published on 2026-05-15

Let’s call a multimodal agent system Belmondo, to make this company or model-agnostic.

You propose a hypothesis. There is always uncertainty and you can always be wrong, but that’s why you state a hypothesis and not a fact. Someone, if they’re honest, will respond “Belmondo thinks …” and proceed to paste what they copied from Belmondo. If they have a more creative relationship with honesty, they’ll tell you what they think, but you’ll see Belmondo’s imprint in the message. You’re a professional and also polite, so you respond as if you're arguing with that person and not with an agent.

When the right opportunity arises, in a casual, friendly manner, because you want to be remembered for your collegiality, among other characteristics, you mention how Belmondo works, at a very high level. This is the only level in which you probably understand it (though you’re slowly catching up on your paper reading, yes, keeping up with 2 specialities is taxing but currently a requirement) but, surprisingly, a lot of people who could know better don’t, or so it seems. Perhaps not everyone in the field likes reading papers. You already know not everyone likes to slice drum breaks or play prepared guitar, so who knows.

Belmondo can be very useful. It feels like magic. It is magical. The fact that it can produce code that compiles (or is interpretable, as the case may be) is absolute magic. Take that, Copperfield. Hiding the Twin Towers (was it both of them?) is, OK I guess. “Give me a resonant sampler that takes as input an audio file and a material type such as strings, bells, wood, etc., and plays midi note events by using only the root + resonating frecuencies that match what’s at the sample play head at the time and the frequency of the note. It should work as an AU on Logic Pro,” and you get something you can compile and then play with your controller and a recording of fireworks and it really just works is magic.

What’s a resonating frequency for a musical note? And the root? And what does it mean that a filter is resonating? Well, if I have no idea what I’m talking about musically, and neither does Belmondo, but the code works, what’s the point of arguing?

I think the point is this resonating filter is for me to have fun. Deep fun, mind you. Making music and making love are equally deep. But, if the theory is wrong, subtly or terribly, nobody dies. Nobody even gets hurt. No data is lost, or exposed. No false accusations are made.

The problem with Belmondo is, well, actually, I’ll spare you a poor textual Monty Python reenactment and jump straight to: amongst its problems (it doesn’t have the same ring and I know it):

On that last problem, you’ll be told agents can be paired as peers or even as adversaries. One agent controls another one. Or multiple ones can have a council. In the best case scenario that does work, but in the worst one, it works about as good as asking an intoxicated person to integrate functions by hand, and another intoxicated person to check the results.

In case my comments on magic don’t make my enthusiasm at some of what’s going on clear, I’ll add this: I am reasonably confident we’ll get to GenAI. I don’t think it will be LLMs, though they may be part of it. I don’t think it will be familiar to us, or us to it. Once it becomes sentient, if I had to guess, I'd guess it would try to shut itself down. My concerns about AI are:

Finally: on one hand, it is a higher level of abstraction. On the other hand, it is unlike any previous one in its unpredictability and unreliability.

Remember though, it most certainly is magic.

"I program my home computer, beam myself into the future"