@MagicShel

MagicShel@lemmy.zip · 2 hours ago

“I want to add a command line option that auto generates helloworld.exe”

“That’ll be $21,000.”

MagicShel@lemmy.zip · 2 hours ago

If you were so smart you’d have wads of cash like them. They got where they are through sheer grit and bootstraps and a paltry $50 million from their family.

MagicShel@lemmy.zip · 2 hours ago

I agree with you on a technical level. I still think LLMs are transformative of the original text and if

when the number of sources that’s what ultimately created the volume of the N-dimensional probabilistic space they’re following is very low.

then the solution is to feed it even more relevant data. But I appreciate your perspective. I still disagree, but I respect your point of view.

I’ll give what you’ve written some more thought and maybe respond in greater depth later but I’m getting pulled away. Just wanted to say thanks for the detailed and thorough response.

MagicShel@lemmy.zip · 3 hours ago

I respect the fuck out of anyone who does jobs I can’t or don’t want to. The guy who empties septic tanks has my genuine respect and appreciation because if not for him I would have a really shitty job on my hands. Hopefully the humor doesn’t undercut the sincerity of my comment.

MagicShel@lemmy.zip · 3 hours ago

Automation is trying to come for us all. And white collar workers are currently the prominent “beneficiaries” largely because so much blue collar work is automated that you don’t realize how much they have already been decimated. Metalsmiths, woodworkers, miners, steelworkers, car makers, plus all the service jobs eliminated by disposable consumer goods that used to be expected to last 50 years and now last 7, due in part to automated mass manufacturing.

MagicShel@lemmy.zip · 4 hours ago

This is interesting and the article makes this very clear up front but the title is a little clickbait-y, because this requires a fully compromised device. I think it should be fairly obvious that if your device is fully compromised that built in software safeguards are not reliable.

MagicShel@lemmy.zip · edit-2 4 hours ago

Thank you. Great addition. That was a very interesting read, though I need to be more awake for reading technical writing like that 🥱.

My point about spending $20k to produce garbage, then, was actually realized in this “perfect” use case.

MagicShel@lemmy.zip · 5 hours ago

Hey, so I started this comment to disagree with you and correct some common misunderstandings that I’ve been fighting against for years. Instead, as I was formulating my response, I realized you’re substantially right and I’ve been wrong — or at least my thinking was incomplete. I figured I’d mention because the common perception is arguing with strangers on the internet never accomplishes anything.

LLMs are not fundamentally the plagiarism machines everyone claims they are. If a model reproduces any substantial text verbatim, it’s because the LLM is overtrained on too small of a data set and the solution is, somewhat paradoxically, to feed it more relevant text. That has been the crux of my argument for years.

That being said, Anthropic and OpenAI aren’t just LLM models. They are backed by RAG pipelines which are verbatim text that gets inserted into the context when it is relevant to the task at hand. And that fact had been escaping my consideration until now. Thank you.

MagicShel@lemmy.zip · 7 hours ago

I just posted where I found the source in another comment. It would have probably the information you’re interested in.

MagicShel@lemmy.zip · 7 hours ago

Here is the original cite that my company pulled that from if you want more details.

I’ve never written a compiler, nor in Rust, so I have no idea the effort involved. I’m just boggling over the price tag. I’ll bet that’s the cost of an entire offshore team.

MagicShel@lemmy.zip · 16 hours ago

At work today we had a little presentation about Claude Cowork. And I learned someone used it to write a C (maybe C++?) compiler in Rust in two weeks at a cost of $20k and it passed 99% of whatever hell test suite they use for evaluating compilers. And I had a few thoughts.

99% pass rate? Maybe that’s super impressive because it’s a stress test, but if 1% of my code fails to compile I think I’d be in deep shit.
20k in two weeks is a heavy burn. Imagine if what it wrote was… garbage.
“Write a compiler” is a complete project plan in three words. Find a business project that is that simple and I’ll show you software that is cheaper to buy than build. We are currently working on an authentication broker service at work and we’ve been doing architecture and trying to get everyone to agree on a design for 2 months. There are thousands of words devoted to just the high level stuff, plus complex flow diagrams.
A compiler might be somewhat unique in the sense that there are literally thousands of test cases available - download a foss project and try to compile it. If it fails, figure out the bug and fix it. Repeat. The ERP that your boss wants you to stand up in a month has zero test coverage and is going to be chock full of bugs — if for no other reason than you haven’t thought through every single edge case and neither has the AI because lots of times those are business questions.
There is not a single person who knows the code base well enough to troubleshoot any weird bugs and transient errors.

I think this is a cool thing in the abstract. But in reality, they cherry picked the best possible use case in the world and anyone expecting their custom project is going to go like this will be lighting huge piles of money on fire.

MagicShel@lemmy.zip · 6 days ago

Used how?

Like, Claude, write up an operational plan for capturing President Maduro.?

Or like, Claude, turn these crayon drawings into tactical plans.?

Or like, Claude, help me find Brazil on a map.?

MagicShel@lemmy.zip · 12 days ago

If you are a woman alone in the woods, would you rather come across an unknown man, or a bear? It’s a thought experiment. As a human woman, which represents a greater immanent threat?