Sunday, March 30, 2025
Austere Instincts
Long context LLMs
Around mid-2024 I tried running feeding Hong Kong court cases to long context (local) LLMs and see how they fared.
Didn't work well. Although most of them claimed to support long contexts, they kind of just failed (got repetitive, etc.).
Since I wasn't really using them for important stuff I just set it aside.
[ Fast forward an eternity later (i.e. a couple months) ]
I'm pleasantly surprised to find that more recent models (as ancient as Llama-3.2 3B) actually performed pretty well on such long contexts. Llama-3.2 3B was actually the worst of the bunch, and apparently the recent gemma-3 models did really well.
The only issue is that gemma-3 27B is a bit slow.
Didn't really bother to check whether the difference was due to model performance or llama.cpp bugs. I suspect more of the former than the latter (Llama3.1 8B occasionally glitches out too).
Anyway, that's kinda good news, maybe I actually can create a comprehensive database of HK court case summary and commentary....
Sunday, March 23, 2025
Fear
Fear isn’t just fear
It is the logical invitation of possibility
The imagining of madness requires madness itself.
Sunday, March 16, 2025
What would God do? Solving the Paradox of Tolerance and why "evil" or "bad things" happen in the world.
Friday, March 14, 2025
Objective Function of Intelligence
Wednesday, March 12, 2025
Timescales
A reply I made on HN that is generally applicable (even devoid of context):
Apparently there's a quote attributed to Bill Gates: "people overestimate what they can do in one year and underestimate what they can do in 10 years."
People overestimate the changes that could happen within a couple years, and totally underestimate the changes that would happen in decades.
Perhaps it's a consequence of change having some kind of exponential behavior. The first couple years might not feel like anything in absolute terms, but give it 10 or 20 years and you'll see a huge change in retrospect.
IMHO, I don't think anyone needs to panic now, changes happen fast these days but I don't think things are going to drastically change in these ~2-3 years. But the world is probably going to look very different in 20 years, and in retrospect it will be attributed to seeds planted in these couple years.
In short I think both camps are right, except on different timescales.