This submission is a tale about how I launched an unlimited LLM provider to about 60 hyped people on the waitlist, then immediately served them a fully dysfunctional death-loop model, and how most people, very reasonably, disappeared, but thanks to a few extremely nice people stuck around anyway, we kept the project alive and its still pretty chaotic but gaining traction. To back up a little bit-- I believe that the whole point of AI agents is that they should keep working. They should read file
This content offers a unique, first-hand perspective on the economics of LLM inference. It challenges the industry-standard metered pricing model, which is particularly relevant for the development of autonomous agents that require high-frequency loops.