Hacker Newsnew | past | comments | ask | show | jobs | submit | crorella's commentslogin

underrated comment, this is going to be the main differentiator going forward, the more powerful and versatile harness the more the models will be able to achieve and better/more advanced products will come out of it.


* Parallel agents: Run multiple coding tasks at once so you can move faster across projects

* Isolated workspaces: Each task runs in its own worktree so changes stay organized and conflict-free

* Review changes: Inspect diffs and proposed edits before applying them to your codebase

* Stay in flow: Switch between tasks without losing context or progress

* Automate workflows: Reuse repeatable tasks with configurable skills

* Works with your tools: Open changes in your editor and integrate with your existing setup

* Track progress: Follow what each agent is doing in real time

* Windows sandboxing: Run agent mode inside an OS-level sandbox, with bounded permissions and explicit approval for escalation

* Ship faster: Move from idea to working code with less overhead


I am sad to know about this, Dan Simmons had a mind blowing amount of imagination and the ability to turn that into interesting and imaginative books that expanded my imagination when I read them.

I loved Hyperion cantos, Illium and then non sci-fi books like A Winter Haunting and Summer of night (which I read in the wrong order lol).

I am also happy to read that he was a great person overall and a great teacher. May he rest in peace.


Welcome :D



The variety of tasks they can do and will be asked to do is too wide and dissimilar, it will be very hard to have a transversal measurement, at most we will have area specific consensus that model X or Y is better, it is like saying one person is the best coder at everything, that does not exist.


Yea, we're going to need benchmarks that incorporate series of steps of development for a particular language and how good each model is at it.

Like can the model take your plan and ask the right questions where there appear to be holes.

How wide of architecture and system design around your language does it understand.

How does it choose to use algorithms available in the language or common libraries.

How often does it hallucinate features/libraries that aren't there.

How does it perform as context get larger.

And that's for one particular language.


The thrill of competition


Same here! I think it would be good if this could be made by default by the tooling. I've seen others using SQL for the same and even the proposal for a succinct way of representing this handoff data in the most compact way.


It’s like having 3 coins and users preferring one or the other when tossing it because one coin gives consistently more heads (or tails) than the other coin.

What is better is to build a good set of rules and stick to one and then refine those rules over time as you get more experience using the tool or if the tool evolves and digress from the results you expect.


<< What is better is to build a good set of rules and

But, unless you are on a local model you control, you literally can't. Otherwise, good rules will work only as long as the next update allows. I will admit that makes me consider some other options, but those probably shouldn't be 'set and iterate' each time something changes.


what I had in mind when I added that comment was for coding, with the use of .md files. For the web version of chats I agree there is little control on how to tailor the way you want the agent to behave, unless you give a initial "setup" prompt.


At this rate, in a few months we will have probably some high quality shorts entirely generated by this.


It's funny you mention this, I was just thinking this other day we may eventually be in a future where a group hangout party could look like this:

1. Goes to friends' place 2. Usual drinks, whatever gets you going activity 3. Each person writes a prompt 4. Chain them together 5. Watch the resulting movie together

That sounds hilarious and I can't wait to try


I'm vaguely reminded of the excellent Jackbox game Tee Fury, in which players submit slogans for T shirts and "art" separately. Players then get to choose from a few options for slogans and designs to make T shirts which are voted on by the group.

I have fond memories of laughing until I was in tears when playing with a group of friends over drinks during the lockdowns in 2020. Something about the process just naturally results in hilarity (especially if you're in a group where you can be offensive).

It's like exquisite corpse for t-shirts. Or, in your case, shorts.


T shirt game is the best jackbox game!

Whenever one of my friend groups is gathered we always make it a point to do an exquisite corpse story on a piece of paper while we’re inebriated in some way xD Video version will be wild


It's seriously so good, in fact it's so good that every other Jackbox game is vaguely disappointing because nothing is half as fun as Tee Fury lol.


In a few months, we'll have some high quality deep-fakes used to ruin people's personal lives.


We'll see. I think we'll see a high quality feature film first though, shorts are notoriously difficult to pull off.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: