Running Hermes as my daily driver and hitting the same wall I did with OpenClaw - context bloat killing performance.
Hermes still works but responses are getting sluggish. OpenClaw just died completely under similar load.
Is this expected behavior or is there a fix? @NousResearch @Teknium
Anyone else running into this with local AI models at scale?