Hacker Newsnew | past | comments | ask | show | jobs | submit | barefootford's commentslogin

How long does it take to run this? How much is automated vs manual evals?


20-30 mins and 10-20 dollars. It's fully automated evals with Gemini 3 Flash as a judge, but I verified manually a lot of them and it grades the outputs reliably.


Really impressive work guys! It seems like YC has funded a few companies attacking this but I think you all might have the best approach so far. Behind the scenes is the agent just editing using text/annotated timelines? I feel like the move is probably text for roughcut/narrative, then a vlm for digesting the initial roughcut, then adding broll and fixing timing issues. Feel free to steal my FCP xml generator. https://github.com/barefootford/buttercut


happy that you liked our approach! also, i think it's a better idea to just give agent these tools and let it figure out its course of actions than giving it a specific workflow to work on - it seems like the world keeps reminding us the bitter lesson [http://www.incompleteideas.net/IncIdeas/BitterLesson.html] more frequently these days

will definitely check the XML exports, ty :)


Theoretically I agree, but practically without guidance agents aren't really able to edit video ATM. Without hand holding Claude will just call ffmpeg and look at a few frames.


yeah we just ask a lot more questions to user to begin with


Sorta wish they would ship the last watches before announcing more that are months away. I’m pretty sure I preordered 9 months ago and still have nothing to show for it.


They're announcing now so that you can switch your preorder if you want. I got my Pebble 2 Duo a while ago and have been wearing it ever since.


Also ordered a pebble 2 duo, and haven't been wearing it ever since.


I think CES might have had something to do with this.


I switched from structured outputs on OpenAI apis to unstructured on Claude (haiku 4.5) and haven't had any issues (yet). But guarantees are always nice.


I've wanted to build a video editing agent this year but I kept putting it off because I thought it would take too long. Having simple Claude Skills as markdowns + a little bit of Ruby made this a lot easier than I expected.

Behind the scenes we transcribe audio, analyze frames, and then build a combined/combed transcript of all the footage. Claude can then build rough cuts by just laying out clips with the correct timestamps. After a yaml rough cut is built, a Ruby library generates basic XML for Final Cut or Premiere.


The design taste on almost all these humanoid bots seems off. If I'm going to have some AI-powered alien in my house I want it to be and appear harmless, helpful, maybe even cute? These look creepy and dystopian.


I look forward to trying this out. Any benchmarks or demos on how long it actually takes to restore? I ended up cooking my own boring S3 backup because previously litestream took 20 minutes to restore something like 1000 rows. It felt extremely unoptimized. How long does restoration take today?


This is so nice. This feels like Postico for sqlite. Definitely switching from TablePlus to this.


gpt doesn't respect masks


Correct. Have tried this without much success despite OpenAI's claims.


Hey I’ve been trying to get Claude code to generate Final Cut xml myself. Mostly I just have a Claude.md with the FCP xml reference and some guidelines. What does this do differently?


Well, the format is incredibly complex. You have to generate the XML correct in every last detail or FCP will crash when you import it. I learned the hard way you need a robust validator system and not just tests alone. Before XML goes out the door you need a last line of defense to catch problems. The go version of cutlass is pretty good at this now and python is catching up.


The idea here (I think) is that it (1) can be a program with code used to generate the XML and (2) is typed.

Just by virtue of being a go program it enables even more sophisticated validation and automation if you want to implement it.


hehe now I feel funny working on the python version too. It depends on my mood, yeah sometimes I want go. But sometimes I want a little python.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: