More

xiphias2 · 2026-03-10T21:58:40 1773179920

> The right way might be to fight AI slop with AI enforced guard rails.

Whenever I you tried to develop using guardrails with LLMs, I found out that they are much better at ,,cheating'' than a human: getting around the guardrails by creating the ugliest hacks around them.

jillesvangurp · 2026-03-11T17:18:03 1773249483

Mostly works for me. Most of my projects I have some guardrails for what to do around testing, deploying, etc. Seems to work. You are right that LLMs are good at avoiding work and finding loopholes to do so. But generally if you ask codex to "hey look at my gh prs and label the ones that don't meet the contributor guidelines with 'slop'" it might do a decent enough job. Maybe add a skill that spells out criteria. Maybe set up openclaw or similar to do this every morning and then give you the list of prs it will auto close after you say the word.

xiphias2 · 2026-03-10T21:20:15 1773177615

There are even simpler things: the rating system. There's no guarantee that the driver won't see what I rated him, so I won't report them.

There are ways to report if a man has been sexual with a woman, but they somehow just don't get kicked out of the driver network.

Also just a simple example: Uber engineering blog is full of examples of how they rewrote their app in native Android then web then native again, but nothing about how to solve the real problems humans experience when driving with them.

It just feels that they view Uber as a simple logistic problem where drivers / riders are interchangeable and less like Tinder that tries to match people with similar scores abd kicks out the worst.

xiphias2 · 2026-03-08T05:04:31 1772946271

The main problem I see is adding things slowly instead of automatic rewrites.

I remember adding lifetimes in some structs and then wanted to use generics and self pointing with lifetimes because that made sense, and then it didn't work because the composition of some features was not yet part of Rust.

Another thing: there are annotations for lifetimes in function signatures, but not inside the functions where there is a lot of magic happening that makes understanding them and working with them really hard: after finally the borrow checking gave me errors, that's when I just started to getting lots of lifetime errors, which were not shown before.

Rust should add these features but take out the old ones with guaranteed automatic update path.

satvikpendem · 2026-03-08T05:31:55 1772947915

The edition mechanism covers your last paragraph.

xiphias2 · 2026-03-04T15:07:43 1772636863

It's a product category name at least, not a release name, so the next release can be Neo 2

xiphias2 · 2026-03-04T05:12:56 1772601176

This is just not true, people get promoted for delivering impact whether the solution is complex or simple.

The best engineer I know who can work with huge complex systems in a big company usually starts with a complex solution then after he understands what he wants to achieve thinks backwards and reimplements it in the fewest possible lines of code change with the already complex system.

bob001 · 2026-03-04T06:14:57 1772604897

There's exception and geniuses to every rule. In general however a simple solution will be much more difficult to argue a promotion around even if you make a ton of impact. You may get a top rating and a slightly larger bonus however not a promotion.

Every large company has a ladder for promotions that includes many words that basically come down to "complex." "Drive a year long initiative" or "multiple teams" or "large complex task with multiple components" are all examples I've seen.

medi8r · 2026-03-04T07:20:47 1772608847

Yeah that large company promo thing drives me nuts. Perpetual gaslighting "meh... that was too easy lel". Yeah thanks. Often stuff isn't easy but hard to explain why if the solution turned out to look easy.

What is funny is you can dance through the hoops for 3-5 years for promo. Or grind leet for 100 hrs and get it by jumping.

zadikian · 2026-03-04T05:38:49 1772602729

Was going to say something like this. If you're good at keeping things simple, it will help you deliver impact which can get you promoted.

this-is-why · 2026-03-04T06:11:22 1772604682

I’m here to support both of your statements. This is absolutely true from orgs the size of FAANG to startups because I’ve worked at both. Sure smooth talkers get promoted but so do smart people who make things work better by simplifying.

xiphias2 · 2026-03-04T03:17:56 1772594276

The main lesson is quite simple: if you can write the test to be uncheatable, ChatGPT can write the code for it.

xiphias2 · 2026-03-03T22:34:09 1772577249

Is there a way to get sticky model selection back, or the reason is that it is just too expensive to serve alternative models?

For coding I love codex-5.3-xhigh, but for non-coding prompts I still far prefer o3 even if it's considered a legacy model.

I can imagine that its higher tool use is too expensive to serve, but as a pro user I would love it to come back.

xiphias2 · 2026-02-28T21:50:08 1772315408

Sure, and actually the open models are already good enough to do that, it's not like any company could stop any organization that can collect the data from doing this.

They can just improve on it a lot.

xiphias2 · 2026-02-28T17:48:22 1772300902

I don't really understand this reasoning actually:

if OpenClaw usage go up, and a service (OpenAI it looks like) gets lots of usage data for personal assistent usage, they can optimize to make it better for people who get a $200 subscription just because of that use case.

xiphias2 · 2026-02-27T22:23:26 1772231006

For anybody who thinks it's about Trump vs other administration: it's not, both AI surveillance of all people and using it for automatic fight was just bound to happen.

The only question is whether the safety of the models were really done well enough to protect the people and be a net positive force in the world.

I guess if they would be safely trained to do more good than bad (how Dario and SamA said), there wouldn't even be a need for the contract terms.

seanmcau · 2026-02-27T22:28:31 1772231311

It would/will be extremely irresponsible to put non-deterministic and fallible models in charge of weapons. We are not close to having solved the problem of ensuring AI pursues good outcomes

xiphias2 · 2026-02-27T22:49:56 1772232596

I agree completely. Anybody who uses the models extensively know it can do something amazing for a prompt and something awful for another. But I also know that wars are unfortunately real and there are real enemies between countries and they don't want a limited model.

skeledrew · 2026-02-27T23:19:31 1772234371

How exactly does the "limitation" affect any war the US may be in with another country?

xiphias2 · 2026-02-27T23:41:44 1772235704

Probably drones targeting and automatically killing Russian people by a thinking model guessing if its Russian on Ukrainian person is a red line.

Elon Musk already denied Starlink for being used for remote killing, but at some point all these technologies will be nationalized, as they are too important not to be.