Hacker Newsnew | past | comments | ask | show | jobs | submit | influx's commentslogin

At what point do these companies stop releasing models and just use them to bootstrap AGI for themselves?

Plausibly now. "As we wrote in the Project Glasswing announcement, we do not plan to make Mythos Preview generally available"

I remember when they didn't plan to give LLMs internet access for the same safety reasons.

Right now these models are basically good for automation, not innovation. Things like Karpathy's "auto research" where you use the model to automate your hyperparamter sweeps etc. The researcher/engineer decides what experiments they want to run, and builds an LLM harness to automate it, and the bottleneck remains the compute to run these experiments at scale.

Moving beyond LLMs to AGI, not just better LLMs, is going to require architectural and algorithic changes. Maybe an LLM can help suggest directions, but even then it's up to a researcher to take those on board and design and automate experiments to see if any of the ideas pan out.

Companies are already doing this, but they are never going to stop releasing/selling models since that is the product, and the revenue from each generation of model is what helps keep the ship afloat and pay for salaries and compute to develop the next generation.

The endgame isn't "AGI, then world domination" - it's just trying to build a business around selling ever-better models, and praying that the revenue each generation of model generates can keep up with the cost to build it.


Fictional timeline that holds up pretty well so far: https://ai-2027.com/

Welp, that was a scary read.

"So far" is two entries: "AI companies build bigger datacenters" and "AI is being used for AI research with modest success".

I think it is naive to think the government (US or China most probably) will just let some random company control something so powerful and dangerous.

I think it is naive to think that artificial super intelligence will be controlled by anyone.

If it is smarter than all humans combined at everything why would any humans collectively control the ai?

All the ants in your backyard still make no decisions vs you


You'd probably listen to those ants if they put you in a harness and had a little ant-sized remote control that could just, you know, turn you off.

Depending how long they wait to press that button, they might be surprised how little happens when they do.

Isn't the U.S. government at least completely asleep at the wheel or captured by the very same "random" companies? I realize the administration got all pissy with Anthropic but it sounds like the gov and gov contractors are still using their models.

Yeah but they still (at least to public knowledge) do not posses anything that could be called AGI. But as these capabilities increase they'll probably get an offer they can't refuse sooner or later.

When the benchmarks actually mean something

Can LLMs be AGI at all?

What can a SOTA LLM not answer that the average person can? It's already more intelligent than any polymath that ever existed, it just lacks motivation and agency.

And has ADHD, but yeah, I'm fairly convinced that AGI is already here.

My understanding is no. But the definition of AGI isn’t that well defined and has been evolving, making the assessment pretty much impossible

Can an LLM program real AGI faster than a human?

Good question. I would guess no - but it could help you build one. Am I mistaken?

They could help you build an AGI if someone else has already built AGI and published it on GitHub.

I see this statement all the time and it's just strange to me. Yes, the LLMs struggle to form unique ideas - but so do we. Most advancements in human history are incremental. Built on the shoulders of millions of other incremental advancements.

What i don't understand is how we quantify our ability to actually create something novel, truly and uniquely novel. We're discussing the LLMs inability to do that, yet i don't feel i have a firm grasp on what we even possess there.

When pressed i imagine many folks would immediately jest that they can create something never done before, some weird random behavior or noise or drawing or whatever. However many times it's just adjacent to existing norms, or constrained by the inversion of not matching existing norms.

In a lot of cases our incremental novelties feel, to some degree, inevitable. As the foundations of advancement get closer to the new thing being developed it becomes obvious at times. I suspect this form of novelty is a thing LLMs are capable of.

So for me the real question is at what point is innovation so far ahead that it doesn't feel like it was the natural next step. And of course, are LLMs capable of doing this?

I suspect for humans this level of true innovation is effectively random. A genius being more likely to make these "random" connections because they have more data to connect with. But nonetheless random, as ideas of this nature often come without explanation if not built on the backs of prior art.

So yea.. thoughts?


I really love Andrej Karpathy's take on LLMs as being instead of intelligence or sentience, a kind of cortical tissue.

It should be clear from working with LLMs over the past 4 years that they are not consciousness.

Andrej's appearance on the Dwarkesh podcast is great.


To be clear i agree with you, my question is more pointed at us - i'm not sure we have a good understanding of conciousness, nor that we are as we seem. Given how prone to hallucinations we are, how our subtle hormones can drastically alter what we perceive as our intelligence, self identity, etc.

I'm not convinced LLMs are anything amazing in their current form, but i suspect they'll push a self reflection on us.

But clearly i think humans are far more Input-Output than the average person. I'm also not educated on the subject, so what do i know hah.


No I think that’s accurate. They seem more like an oracle to me. Or as someone put it here, it’s a vectorization of (most/all?) human knowledge, which we can replay back in various permutations.

LLMs and human intelligence overlap, but they are not the same. What LLMs show is that we don't need AGI to be impressed. For example, LLMs are not good playing games such as Go [1].

[1] https://arxiv.org/abs/2601.16447


I don't see why not, especially with computer use and vision capabilities. Are you talking about their lack of physical embodiment? AGI is about cognitive ability, not physical. Think of someone like Stephen Hawking, an example of having extraordinary general intelligence despite severe physical limitations.

They already do, but not the way you said, the always have an internal model that is better and use themselves, they release based on competition.

I would assume somewhere in both the companies there's a Ralph loop running with the prompt "Make AGI".

Kinda makes me think of the Infinite Improbability Drive.


Probably right now because they're keeping it for themselves?

Weird how Claude Code itself is still so buggy though (though I get they don't necessarily care)

It isnt that weird. Just look at the gemini-cli repo. Its a gong show. The issue is that LLMs can be wrong sometimes sure but more that all the existing SDL were never meant to iterate this quickly.

If the system (code base in this case) is changing rapidly it increases the probability that any given change will interact poorly with any other given change. No single person in those code bases can have a working understanding of them because they change so quickly. Thus when someone LGTM the PR was the LLM generated they likely do not have a great understanding of the impact it is going to have.


why_not_both.gif

It will arrive in the same DLC as flying cars.

Now, I guess. They aren't releasing this one generally. I assume they are using it internally.

I mean, guess why Anthropic is pulling ahead...? One can have one's cake and eat it too.

Most people who ran AdSense were extremely careful not to look at the ads on their own site, because Google might flag them for intentionally inflating clicks or views.

When I was a teen, like, 15 years ago or so, my Google Ads account got permanently banned because I made the mistake of clicking on my own ad. :)

You know the whole point of the space race was to prove that we could send ICBMs to the USSR right?

> the whole point of the space race was to prove that we could send ICBMs to the USSR right?

No, it wasn’t. The real world seldom has single causation. Some people supported Apollo as a messaging exercise. Most had other reasons.

And in any case, there are easy ways to demonstrate ICBM competence. Pyongyang isn’t going to the Moon to prove it can bomb Alaska.


Do most SaaS actually delete or do they just store a tombstone in the database for your account?

If I understand GDPR and “the Right to be forgotten” properly, then yes - they would have to actually delete the information.

Edit: at least when it comes to PII, which I presume should include photos of you, or any personal detail of you. The content you may have posted there up until then - that might be a different story


Aren't they trying to do a Valve with their store?


You’d be surprised how many features the Python runtime adds each release. It’s not trivial for tooling to keep up with language changes.


I would have thought most of those would have been moved to async Python by now.


async python still uses a single thread for the main loop, it just hides non blocking IO.


The capital class?


They could have, but it seems unlikely they targeted one or two repos and probably cloned thousands or more.


Imagine the outrage here, when a company credit card expires and the cloud provider terminates all their instances, deletes all your storage and blob backups?


That does happen, it's just usually not when the card expires, but when the follow up billing emails get ignored for some period.

This is one of the reasons people have suggested using a different provider for backups.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: