More

barefootford · 2026-03-03T22:26:09 1772576769

How long does it take to run this? How much is automated vs manual evals?

mateianghel · 2026-03-03T22:30:54 1772577054

20-30 mins and 10-20 dollars. It's fully automated evals with Gemini 3 Flash as a judge, but I verified manually a lot of them and it grades the outputs reliably.

barefootford · 2026-02-26T20:36:15 1772138175

Really impressive work guys! It seems like YC has funded a few companies attacking this but I think you all might have the best approach so far. Behind the scenes is the agent just editing using text/annotated timelines? I feel like the move is probably text for roughcut/narrative, then a vlm for digesting the initial roughcut, then adding broll and fixing timing issues. Feel free to steal my FCP xml generator. https://github.com/barefootford/buttercut

sxmawl · 2026-02-26T21:07:09 1772140029

happy that you liked our approach! also, i think it's a better idea to just give agent these tools and let it figure out its course of actions than giving it a specific workflow to work on - it seems like the world keeps reminding us the bitter lesson [http://www.incompleteideas.net/IncIdeas/BitterLesson.html] more frequently these days

will definitely check the XML exports, ty :)

barefootford · 2026-02-27T01:49:41 1772156981

Theoretically I agree, but practically without guidance agents aren't really able to edit video ATM. Without hand holding Claude will just call ffmpeg and look at a few frames.

sxmawl · 2026-02-27T02:27:55 1772159275

yeah we just ask a lot more questions to user to begin with

barefootford · 2026-01-05T22:18:34 1767651514

Sorta wish they would ship the last watches before announcing more that are months away. I’m pretty sure I preordered 9 months ago and still have nothing to show for it.

modeless · 2026-01-05T22:51:54 1767653514

They're announcing now so that you can switch your preorder if you want. I got my Pebble 2 Duo a while ago and have been wearing it ever since.

barefootford · 2026-01-06T00:55:35 1767660935

Also ordered a pebble 2 duo, and haven't been wearing it ever since.

apparent · 2026-01-05T23:06:20 1767654380

I think CES might have had something to do with this.

barefootford · 2025-11-14T19:24:17 1763148257

I switched from structured outputs on OpenAI apis to unstructured on Claude (haiku 4.5) and haven't had any issues (yet). But guarantees are always nice.

barefootford · 2025-11-10T23:47:22 1762818442

I've wanted to build a video editing agent this year but I kept putting it off because I thought it would take too long. Having simple Claude Skills as markdowns + a little bit of Ruby made this a lot easier than I expected.

Behind the scenes we transcribe audio, analyze frames, and then build a combined/combed transcript of all the footage. Claude can then build rough cuts by just laying out clips with the correct timestamps. After a yaml rough cut is built, a Ruby library generates basic XML for Final Cut or Premiere.

barefootford · 2025-10-09T23:03:15 1760050995

The design taste on almost all these humanoid bots seems off. If I'm going to have some AI-powered alien in my house I want it to be and appear harmless, helpful, maybe even cute? These look creepy and dystopian.

barefootford · 2025-10-03T20:03:46 1759521826

I look forward to trying this out. Any benchmarks or demos on how long it actually takes to restore? I ended up cooking my own boring S3 backup because previously litestream took 20 minutes to restore something like 1000 rows. It felt extremely unoptimized. How long does restoration take today?

barefootford · 2025-08-25T20:43:08 1756154588

This is so nice. This feels like Postico for sqlite. Definitely switching from TablePlus to this.

barefootford · 2025-08-04T20:28:28 1754339308

gpt doesn't respect masks

icelancer · 2025-08-04T20:48:00 1754340480

Correct. Have tried this without much success despite OpenAI's claims.

barefootford · 2025-06-29T16:27:40 1751214460

Hey I’ve been trying to get Claude code to generate Final Cut xml myself. Mostly I just have a Claude.md with the FCP xml reference and some guidelines. What does this do differently?

fcpguru · 2025-06-29T19:53:31 1751226811

Well, the format is incredibly complex. You have to generate the XML correct in every last detail or FCP will crash when you import it. I learned the hard way you need a robust validator system and not just tests alone. Before XML goes out the door you need a last line of defense to catch problems. The go version of cutlass is pretty good at this now and python is catching up.

parentheses · 2025-06-29T18:47:47 1751222867

The idea here (I think) is that it (1) can be a program with code used to generate the XML and (2) is typed.

Just by virtue of being a go program it enables even more sophisticated validation and automation if you want to implement it.

fcpguru · 2025-06-29T22:08:59 1751234939

hehe now I feel funny working on the python version too. It depends on my mood, yeah sometimes I want go. But sometimes I want a little python.