I have a similar claude story (much less money though), with the IRS R&D tax credit. The auditing firm initially said we qualify for $0. But then I had claude analyze past R&D reports and our expenses and it found the problem. The auditor had miscategorized our company.
So claude drafted an email even pointing to the right Internal Revenue Code (IRS Law), and specify why we fall under a specific category. The auditor got back to me two days later admitting their mistake and said our company now qualifies for $8k in tax credits. And a few months ago, it identified items in our AWS that saved us $250 a month (paying for itself).
So now I joke that even if I have a claude max plan, I've still come out ahead financially.
By my estimation (guess) you won't actually need to spend that much because the models are already getting a point where they don't need to get a whole lot better to be extremely helpful across many domains.
And it looks like those very helpful capabilities will continue to transfer to smaller models as well, as architectures and training regimes continue to refine.
I can fairly easily imagine a world where the only people needing to spend a lot of money on models are those that are using them to solve truly novel problems. The rest of us will get plenty of use at reasonable costs for the typical day-to-day helpful stuff.
[sci-fi “AGI” scenario] What if those with elite model access philosophize in a way us mere mortals can’t understand, so the elites have to prechew the ideas for us to bring them to our level, and they control the narrative?
In reality now, curious about social implications generally. Does this go beyond problem solving? Maybe the intelligence per token you get via your free library card/membership is insufficient to compete with peers in dating/employment/etc. markets, thus puts you at disadvantage.
To be fair regarding taxes it should be that you get all exemptions by default and the other side telling/justifying you why you don't qualify, instead of you not getting the things you should because you're not sure how to interpret the law or don't know this or that rule exists. Taxes shouldn't be that difficult, and the US version of it seems to be behind even the one I have here in europe (where my taxes are done "for me" for anything non business owner related). If the government is calculating your taxes anyway, they should just give the number to you instead of asking you a number and you better have the same as us or you're guilty of something.
Feels like a system that is deliberatly made to be more punishing for those who can't afford the help or the education to figure things out.
I would agree with you on most situations (like 1040 personal income taxes especially).
But in this particular tax credit, there's no way for the gov to know automatically what percentage of payroll was spent in qualified R&D expenses, since it's day to day business operations. Which is why we are _forced_ to hire an outside firm and pay them thousands of dollars (when Claude did an even better job), just to analyze how much of our time qualified as R&D expenses.
The problem I have is that I am forced to have to find a firm to do this, and most firms won't even work with companies as small as ours. So then we're stuck and losing out on years of R&D tax credits at the moment, when I really don't need them anymore, to be honest.
Claude Code is really good at stuff like this. The other day I tried to recover some images from an SD card that had gone bad. I used GetDataBack to recover files, but they appeared to be malformed and didn't open in image viewers.
I tasked Claude to analyze the files and figure out what's going on, and eventually we figured out that each file had a custom metadata header + thumbnail + actual image concatenated. I had it write a python script and was able to recover all the images with their metadata. It's nothing a human couldn't have figured out, but it was definitely WAY faster than doing it myself.
I've also used Claude in the past to figure out how to break into routers with locked down firmware. It's great at suggesting and trying different approaches.
I have a friend that just picked up a new consulting job resurrecting an ancient Windows desktop application. No source control, no tests. And it's spread out over a dozen different folders with names like "_old", "_new" and "dates". Claude's doing a tremendous job in getting him to grips with what is actually happening in the application, what's relevant, what's not, what's different. I think it's literally saving him days and days at work.
if your friend has access to the binary and can pull it out to different box, they might get a lot out of a ghidra mcp -> https://github.com/LaurieWired/GhidraMCP
I'm not well versed at reverse engineering binaries or interpreting C/assembly so ghidra MCP has been an absolute gamechanger for helping me write tools. Once my project is complete, I plan to learn how to do the analysis myself manually and have cc guide me along the way.
> I have a friend that just picked up a new consulting job resurrecting an ancient Windows desktop application. No source control, no tests. And it's spread out over a dozen different folders with names like "_old", "_new" and "dates".
That doesn't sound very impressive. Not being tracked with a version control system is fixed instantly with a git init, git add ., git commit .no AI required.
Covering the app with tests is also something that requires no AI. At most, coding agents can generate characterization tests in broad sweeps, but we are talking about a delta between hand rolling and vibe-coding of a couple of days.
Where LLM shines is helping developers build up an understanding of what is in place. Running /explain on a codebase can quickly provide you with a high level summary of what's in place.
The relevancy here is that he's denied the git history, versioning, branches, implicit documentation that even bad source control practices would have given him.
That's what the comment is saying. In normal repositories, version control acts as a record of the momentum of the direction the product was taking. If it's just "_old" and "_new," the developer has to read and understand both, which I think is going to be far more time consuming than your estimation.
I did EXACTLY that last night. Was doing by hand for about an hour and got to a point where I didn’t feel competent anymore and asked Claude to take from where I was.
5 minutes later I had almost 3 hours of important footage recovered.
I'm sure data recovery companies are pretty pissed that slightly esoteric data recovery abilities are becoming more accessible for average software devs. They were charging an arm and a leg to remote in and run scripts.
They still have two important moats: (1) expensive hardware tools (even stuff like SATA write blockers are kind of expensive for what they are), spare hard drive collections to swap failed PCBs, etc and (2) the "nobody got fired for hiring us" edge similar to how everyone calls in Crowdstrike/Mandiant after an incident. If a suit-level manager finds out customer data was lost, they are going to want to call in an expert so they can immediately tell the customer they did, not have the same internal team try to figure it out.
As an aside to #1: The cool thing is in modern times the hardware tools have come down stupidly cheap in price. Even SD card recovery is (vaguely) in the right skilled hands in a pseudo-professional home lab these days.
A lot of "Claude Code is best at X" claims are probably user-selection bias.
The people saying it are often exclusively Claude Code users, not people who are actively benchmarking Claude Code against Gemini CLI, OpenAI Codex, GitHub Copilot, and other agent harnesses on the same tasks.
The claim may still be true for certain scenarios, but the evidence is usually anecdotal, not comparative.
When I hear "claude code one-shotted X" and X is a novel problem, I mentally substituted "the agentic harness that I tried one-shotted X," since that's what they're saying.
Getting any smart model to take a look at the task is the sort of lift that the speaker is usually pointing to.
> Bitcoin trader recovers $400,000 using Claude AI after getting 'stoned' and losing wallet password 11 years ago — bot tried 3.5 trillion passwords before decrypting an old wallet backup
Man. I wish I had a lost wallet worth a quarter of that even, technically didn't need Claude for this, just needed any password cracking software.
Explaining your life to an llm, then having it generate permutations of passwords to try does sound like it would work a decent percentage of the time.
A large percentage of passwords aren't a random string of characters but a memorable word + memorable number. There's existing projects that basically do the same, and 3.5 trillion doesn't really make it clear if one of those wouldn't have worked as well, but I can see it having an above random chance to guess a password.
Way back in the day when Bitcoin first came about, I once idly contemplated spending some time and money on it just because it was a very cool technology. At the time it was a bit of a hassle because you had to mine your own.
Then I was especially tempted years later after running into the MtGox booth at CES, and seeing how convenient it had become. I remember asking a guy at the booth if Satoshi was really still anonymous or if any insiders knew about him, and he said "no" but was bit surprised I knew about Satoshi. I guess Bitcoin was still quite niche then even amongst a technical crowd.
I considered buying a few bucks worth of bitcoin then for lulz, but I thought that money was better spent on beer lol.
I've never really regretted spending that money on beer rather than bitcoin, because I knew that even if I did, it would 100% have been on MtGox and I would have lost it in the hack anyway, which would have been even more bitterly frustrating.
I was making a long edit in a crappy wiki UI and my browser froze. It would have taken a long time to redo, hours.
I didn't want to take the chance of force closing and losing everything. I used claude code to extract my text out of the browser internals and filesystem objects.
Many crypto wallets use a key derivation function (KDF) to add an amount of computation (and memory usage) per password tried - to mitigate brute force of weak passwords.
The increase in compute (decrease in brute-force cost) combined with price increases in many crypto tokens means brute-forcing old wallets can become worth it years after passwords were forgotten.
And of course even smaller, local AI models can now easily write optimized scripts to brute-force any given KDF function.
The compute power needed use to be of the order of 5s per password try.
So it effectively mitigate brute force back them, you need a absurd compute power to crack them.
Moore law did its thing, now you can do it with a lot less computer power.
A wallet is just private keys of some specific public keys on the blockchain that have unspent output (UTXO). None of what’s described in this article involves the blockchain, only the storage and protection of the private keys on a local computer.
You can imagine that in your example, you didn’t change the locks on a house, but rather you put the house keys in a secure lock box and you changed the locks on this box.
Changing the locks on a house in this case means transferring from an old wallet to a new wallet and then abandoning the old wallet. That’s exactly what the OP is trying to do. It’s just that you need the original key to do it.
The other day, I asked Claude to track down the leaked Claude Code source so I could study it. It refused, saying “given who made me, I’ll pass.” It gave me some pointers on how to find it myself, which worked.
There isn’t that much of a difference between “help me crack this bitcoin wallet” and “help me crack this executable.”
I don’t exactly have a solid point, just some general observations. First, I think we’ll see AI more and more simply refuse to do any kind of forensics, as forensics becomes more powerful. Second, that implies local models will become more valuable, since they’re the only ones willing to do that kind of work.
I once got myself banned from Claude by researching barbiturates, since they’re connected with suicide. So my third observation is that we’ll see an uptick in people getting punished for trying to do things with AI that people don’t usually do. (Luckily the unban form worked.)
Someone downthread asked “how’d he convince Claude the coins weren’t stolen?” Which is an interesting question, because presumably some people trying to crack a wallet have stolen it. So I guess the fourth observation is that the exact framing you approach an AI with will become more important. There was the classic “do this or I’ll cut off my arm,” which worked a year ago. But in the future it will be more like “hopefully the AI believes my story, or else I’ll get into trouble.”
It’s good there are multiple AI vendors, or else it’d get real dystopian real fast when the de facto AI’s policy becomes something you have no way of working around.
Feels less like "ai cracked crypto" and more like having an insanely patient technical friend sitting next to you for 12 hours doing digital archaeology.
I spent a couple of days mining many years ago and got 2 bitcoins. At the time, they weren't worth the electricity they cost to mine and over time I lost the wallet and all information related to it.
You are better than me. Back when you could mine BTC with the CPU, i had about 2 coins. I found it useless and silly and deleted my wallet at some point :)
>they weren't worth the electricity they cost to mine...I'd love to mine a bitcoin a day on my PC now...
you can!... but they wouldn't be worth the electricity now either. the cost of mining (amortization of hardware costs plus electricty) is the value of bitcoin. if bitcoins are a bargain to mine, more people will mine them thereby reducing rewards.
should you have mined more back then if you had magical perfect knowledge of the future? no: they weren't worth the electricity.
instead you should have bought more of them back then.
On a purely technical level - cool. But I still cannot get over the impression that even in this case LLMs show us how they are mainly useful to grifters. I mean, 5 Bitcoins worth 400K USD. Why? What intrinsic value does Bitcoin deliver? It's like trading for monopoly money.
"the user dumped their entire college computer files in Claude in a last-gasp effort."
Claude has limits that would make this simple statement be much more complicated-
Via Claude "So the chat upload file size limit is actually 500MB per file (not 30MB as many third-party sources claim - those appear to be outdated). The 20-file-per-chat cap and the 30MB-per-file limit in Projects remain consistent across plans.
The real constraint at any subscription level remains the context window - how many tokens Claude can hold in memory at once during a conversation. "
Yesterday I had ChatGPT walk me through fixing my single-node k3s cluster. It required rebuilding the sqlite database (while skipping a few corrupted records), then clearing the containerd cache, and then finally deleting a somehow-corrupted Secret record, and then recreating it.
Without it I would have given up way earlier, but the infinite patience to keep slurping in error messages and continue to troubleshoot really worked out.
Yeah Claude is really, really good there. You tell it the distro and the problem and it will solve it. Saved me a lot of pain when it came to swapping out an encrypted boot drive and was good about emphasizing the order of operations required for what I would consider a higher risk/complexity situation.
Claude found a file on the computer that the wallet owner had not found. Claude didn't crack a password or do anything magic, it just searched for a file that the wallet owner had not thought to search for before.
So, where the wallet owner had previously only tried to access /Users/example/wallet.dat, Claude thought, "why don't I check if there is another wallet.dat file elsewhere on the system?" which it did.
The outcome is the same, it is great that Claude tried something that the wallet owner hadn't tried, but this is more an example of how dumb humans can be rather than how smart Claude is.
The trillions of passwords are a red herring and unrelated to the solve.
Claude found an old wallet and then ran btcrecover on that. The question is why the user could not find an old wallet with any numbers of Unix tools himself.
Since we are dealing with Anthropic, the entire story could be staged of course.
Claude ran a ctrl+f on his file system. Groundbreaking. Insane the dude hadn't figured this out for himself considering a few years salary was just sitting there.
> The bot uncovered an old backup wallet file that it successfully decrypted, while also uncovering a bug in the password configuration that was preventing recovery up to that point.
I know that we're all experts in archaic backup mechanisms and the encryption systems they used, but I think this qualifies as doing more than Ctrl+F
I've tried Claude Code with another LLM, it's very good at doing tasks and figuring things out. So this made me wonder, even though we know how good Claude models is, maybe the true value is in the harness now?
So claude drafted an email even pointing to the right Internal Revenue Code (IRS Law), and specify why we fall under a specific category. The auditor got back to me two days later admitting their mistake and said our company now qualifies for $8k in tax credits. And a few months ago, it identified items in our AWS that saved us $250 a month (paying for itself).
So now I joke that even if I have a claude max plan, I've still come out ahead financially.
This is no joke; for better or worse, I see a day when I'm paying a lot more for this and it will be a bargain.
And it looks like those very helpful capabilities will continue to transfer to smaller models as well, as architectures and training regimes continue to refine.
I can fairly easily imagine a world where the only people needing to spend a lot of money on models are those that are using them to solve truly novel problems. The rest of us will get plenty of use at reasonable costs for the typical day-to-day helpful stuff.
In reality now, curious about social implications generally. Does this go beyond problem solving? Maybe the intelligence per token you get via your free library card/membership is insufficient to compete with peers in dating/employment/etc. markets, thus puts you at disadvantage.
Feels like a system that is deliberatly made to be more punishing for those who can't afford the help or the education to figure things out.
But in this particular tax credit, there's no way for the gov to know automatically what percentage of payroll was spent in qualified R&D expenses, since it's day to day business operations. Which is why we are _forced_ to hire an outside firm and pay them thousands of dollars (when Claude did an even better job), just to analyze how much of our time qualified as R&D expenses.
The problem I have is that I am forced to have to find a firm to do this, and most firms won't even work with companies as small as ours. So then we're stuck and losing out on years of R&D tax credits at the moment, when I really don't need them anymore, to be honest.
I tasked Claude to analyze the files and figure out what's going on, and eventually we figured out that each file had a custom metadata header + thumbnail + actual image concatenated. I had it write a python script and was able to recover all the images with their metadata. It's nothing a human couldn't have figured out, but it was definitely WAY faster than doing it myself.
I've also used Claude in the past to figure out how to break into routers with locked down firmware. It's great at suggesting and trying different approaches.
That doesn't sound very impressive. Not being tracked with a version control system is fixed instantly with a git init, git add ., git commit .no AI required.
Covering the app with tests is also something that requires no AI. At most, coding agents can generate characterization tests in broad sweeps, but we are talking about a delta between hand rolling and vibe-coding of a couple of days.
Where LLM shines is helping developers build up an understanding of what is in place. Running /explain on a codebase can quickly provide you with a high level summary of what's in place.
5 minutes later I had almost 3 hours of important footage recovered.
https://blog.acelab.eu.com/pc-3000-flash-spider-board-adapte...
A lot of "Claude Code is best at X" claims are probably user-selection bias.
The people saying it are often exclusively Claude Code users, not people who are actively benchmarking Claude Code against Gemini CLI, OpenAI Codex, GitHub Copilot, and other agent harnesses on the same tasks.
The claim may still be true for certain scenarios, but the evidence is usually anecdotal, not comparative.
Getting any smart model to take a look at the task is the sort of lift that the speaker is usually pointing to.
TBF the real breakthrough was finding this, though no doubt they couldn't have recovered without Claude
Man. I wish I had a lost wallet worth a quarter of that even, technically didn't need Claude for this, just needed any password cracking software.
A large percentage of passwords aren't a random string of characters but a memorable word + memorable number. There's existing projects that basically do the same, and 3.5 trillion doesn't really make it clear if one of those wouldn't have worked as well, but I can see it having an above random chance to guess a password.
Thank you MtGox.
Then I was especially tempted years later after running into the MtGox booth at CES, and seeing how convenient it had become. I remember asking a guy at the booth if Satoshi was really still anonymous or if any insiders knew about him, and he said "no" but was bit surprised I knew about Satoshi. I guess Bitcoin was still quite niche then even amongst a technical crowd.
I considered buying a few bucks worth of bitcoin then for lulz, but I thought that money was better spent on beer lol.
I've never really regretted spending that money on beer rather than bitcoin, because I knew that even if I did, it would 100% have been on MtGox and I would have lost it in the hack anyway, which would have been even more bitterly frustrating.
A few of pints of beer >> years of regret.
Whew, that brings me back!
I still think about the Bitcoin my buddy paid me for his half of a pizza ~15 years ago... worth 6 figures now haha.
Everyone who had coin in Mt.Gox lost it during a hack. A portion of that was returned to the users who had a loss about a year ago.
The first pizza anybody bought that way cost 10,000 bitcoin, over $billion.
I didn't want to take the chance of force closing and losing everything. I used claude code to extract my text out of the browser internals and filesystem objects.
The increase in compute (decrease in brute-force cost) combined with price increases in many crypto tokens means brute-forcing old wallets can become worth it years after passwords were forgotten.
And of course even smaller, local AI models can now easily write optimized scripts to brute-force any given KDF function.
Moore law did its thing, now you can do it with a lot less computer power.
In the physical world, I can't imagine too many people being happy that old keys to your house still work even after you've changed the locks.
Can someone more informed, help me understand how this worked and why it's ok.
I'm genuinely wanting to become more informed & better understand.
You can imagine that in your example, you didn’t change the locks on a house, but rather you put the house keys in a secure lock box and you changed the locks on this box.
Changing the locks on a house in this case means transferring from an old wallet to a new wallet and then abandoning the old wallet. That’s exactly what the OP is trying to do. It’s just that you need the original key to do it.
The other day, I asked Claude to track down the leaked Claude Code source so I could study it. It refused, saying “given who made me, I’ll pass.” It gave me some pointers on how to find it myself, which worked.
There isn’t that much of a difference between “help me crack this bitcoin wallet” and “help me crack this executable.”
I don’t exactly have a solid point, just some general observations. First, I think we’ll see AI more and more simply refuse to do any kind of forensics, as forensics becomes more powerful. Second, that implies local models will become more valuable, since they’re the only ones willing to do that kind of work.
I once got myself banned from Claude by researching barbiturates, since they’re connected with suicide. So my third observation is that we’ll see an uptick in people getting punished for trying to do things with AI that people don’t usually do. (Luckily the unban form worked.)
Someone downthread asked “how’d he convince Claude the coins weren’t stolen?” Which is an interesting question, because presumably some people trying to crack a wallet have stolen it. So I guess the fourth observation is that the exact framing you approach an AI with will become more important. There was the classic “do this or I’ll cut off my arm,” which worked a year ago. But in the future it will be more like “hopefully the AI believes my story, or else I’ll get into trouble.”
It’s good there are multiple AI vendors, or else it’d get real dystopian real fast when the de facto AI’s policy becomes something you have no way of working around.
I'd love to mine a bitcoin a day on my PC now...
But lets be honest - when BTC hit 100 bucks, we would have cashed it out thinking we were geniuses.
you can!... but they wouldn't be worth the electricity now either. the cost of mining (amortization of hardware costs plus electricty) is the value of bitcoin. if bitcoins are a bargain to mine, more people will mine them thereby reducing rewards.
should you have mined more back then if you had magical perfect knowledge of the future? no: they weren't worth the electricity.
instead you should have bought more of them back then.
https://www.coinbase.com/learn/crypto-glossary/what-is-bitco...
Claude has limits that would make this simple statement be much more complicated-
Via Claude "So the chat upload file size limit is actually 500MB per file (not 30MB as many third-party sources claim - those appear to be outdated). The 20-file-per-chat cap and the 30MB-per-file limit in Projects remain consistent across plans. The real constraint at any subscription level remains the context window - how many tokens Claude can hold in memory at once during a conversation. "
I guess the user simply pointed Claude Code at a local folder containing all the backups and files, and Code went through them via find/ls/etc
Without it I would have given up way earlier, but the infinite patience to keep slurping in error messages and continue to troubleshoot really worked out.
Claude found a file on the computer that the wallet owner had not found. Claude didn't crack a password or do anything magic, it just searched for a file that the wallet owner had not thought to search for before.
So, where the wallet owner had previously only tried to access /Users/example/wallet.dat, Claude thought, "why don't I check if there is another wallet.dat file elsewhere on the system?" which it did.
The outcome is the same, it is great that Claude tried something that the wallet owner hadn't tried, but this is more an example of how dumb humans can be rather than how smart Claude is.
The trillions of passwords are a red herring and unrelated to the solve.
Since we are dealing with Anthropic, the entire story could be staged of course.
I know that we're all experts in archaic backup mechanisms and the encryption systems they used, but I think this qualifies as doing more than Ctrl+F
Also, it is right there in the article.
So I guess he might be glad he didn’t figure it out earlier.