- new
- past
- show
- ask
- show
- jobs
- submit
Mine is a daily bash cronjob that fetches a text-based database and uses grep to build an nftables-apply script with all the IPs for the blocked ASNs. I keep meaning to share it, but it's embarrassingly messy I haven't had time to clean it up...
Anyhoo, now you mention it this is the tack I am going to take in my own network, thanks!
No shit.
Of course, the excuse doesn't even apply: the offense of the tech bosses is not training these models (they had that declared legal the second it became clear only the big companies would be training big models), the offense of all these tech companies is running a piracy site. Taking copyrighted works, storing them, reproducing them and then publishing the results to third parties, in many cases for payment, and organizing this practice knowingly, willingly as a company. Paying others to help them do it. This is the worst copyright offense one can possibly commit. It is what one public prosecutor referred to in the Nappster case as "organizing a criminal cartel to violate criminal law on a huge scale".
Tech bosses weren't sued for downloading, in other words, they were sued for uploading. For asking payment for publishing copyrighted works, without any money going to the authors.
When Kim Dotcom did that, in the words of the US Attorney general, this is "charges of criminal copyright infringement, racketeering, and money laundering" (you see, getting paid for criminal activity is money laundering, a charge that was also made against teenagers selling warez cds)
ChatGPT tells me, unaware and unwilling to discuss the INCREDIBLE unfairness, that in the US, first-time offenders can face up to 5 years in prison, while repeat offenders can face up to 10 years PER OFFENCE. ChatGPT is unwilling to discuss it.
The courts are also unwilling to discuss this, but no worries! New technicality: only a public prosecutor gets to ask ...
Dario Amodei wilfully committed large scale copyright infringement, as did all the tech bosses from Musk to Bezos ... and "strangely" nobody in any court even mentioned how much 10 years times 500,000 is, despite systematically threatening that punsihment repeatedly in the cases against teenagers.
Note that the law is extremely clear that company management IS NOT shielded if ordering criminal actions (violating criminal law, as opposed to violating a contract). In that case, company management carries full criminal culpability, INCREASED from if they did it themselves. Of course, this is only ever applied for refusing to pay tax or court fees.
If the law were applied alike and fairly to individuals and tech bosses, Amodei would have to be VERY lucky the human race still exists by the time his corpse leaves prison.
What do you think the outcome of tightening fair use is going to be? Do you think its going to be most effectual against these big evil AI companies we are meant to fear? Or is it going to end up putting more individual creators on the end of Disneys pitchforks?
Like if you support creating a gun to kill a monster, that's great. But you need to understand that weapons rarely only target the person you want them to. And its unlikely that any bill that specifically targets a certain size or profit margin is going to make it all the way into law without being generalised to the approval of large IP holders.
Its much much (much) better to look at this as an opportunity to erode IP laws for everyone, than to make them worse and hope that your particular enemies are the only ones that are affected.
>That doesn’t mean I’m behind industrialized narcotic production on such a huge scale that it that it starts to distort the economy, and companies looking for new ways to add methamphetamine to every goddamn product.
Thats such a non sequitur. This isnt a weed legalisation argument, its "Do we make IP worse for everyone, because you dont like some people benefiting from fair use".
Meta used allegedly stolen copyrighted materials to train a model they shared for free with the whole world. Is this a recreational use?
It is the same playbook everytime. We dont have to be naive and pretend meta is doing something for other peoples benefit.
Are you unable to access this page?
https://www.llama.com/llama-downloads/
Or this one?
https://lmstudio.ai/models/meta/llama-3.3-70b
>It is use to build a monopoly
How?
>We dont have to be naive and pretend meta is doing something for other peoples benefit.
Meta benefits from the current war of open model competition, but we also benefit from it. In particular, participating in all this makes it hard for them to pull the ladder up when the market changes. They will have to justify why whatever new hotness is better than these existing models already on our hard drives.
Don't they? They release the llama model weights, they do things like this:
https://www.opencompute.org/wiki/Open_Rack/SpecsAndDesigns
They also make significant contributions to Linux and are the originators of popular open source projects like zstd and React.
They make their money from selling ads, not selling licenses.
But I hear you. One of my biggest tells that someone can't be reasoned with is when they resort to whataboutism without any consideration for how 2 situations can actually be different even if there is some commonality. It's a powerful bad faith argument technique. When that style of argument comes up I nod my head and walk away. Some people are just doomed.
'"They then copied those stolen fruits"
How are these fruits "stolen" if they still have what was allegedley stolen?
Dowling v. United States, 473 U.S. 207 (1985): The Supreme Court ruled that the unauthorized sale of phonorecords of copyrighted musical compositions does not constitute "stolen, converted or taken by fraud" goods under the National Stolen Property Act
And even if, arguendo, sure its stolen. The purpose of copyright is to "To promote the Progress of Science and useful Arts, by securing for limited Times to Authors and Inventors the exclusive Right to their respective Writings and Discoveries"
And you would be hard pressed to prove that LLM's haven't advanced the arts and sciences, so at bare minimum transformative, ie fair use.'
If you write a book and I take it and embed its knowledge into my product that is so pervasive that no one needs to buy your book any more (and I don't even credit you so no one knows where that knowledge came from), to you really still have what was stolen? And I didn't even buy a copy of your book to copy it.
The trouble with this analogy is that it proves too much.
Suppose you write a book, and so does someone else, but they have better marketing than you and then people in the market for that genre buy theirs instead of yours. Let's even stipulate that the existence of their book actually lowers your sales, because people who want that kind of book already bought theirs by the time they find out about yours and then some people don't have time to read or can't afford to buy both.
Notice that we haven't yet said a word about the contents of either book. They could be completely independent and they've never even heard of you or your book -- they "didn't even buy a copy of your book to copy it". All we know is that they're the same genre and the existence of theirs is costing you sales. By that logic all competition would thereby be "stealing", and that can't be right.
Which implies that you don't have a property right to the customers.
People on this thread need to focus on what "derivative" and "fair use" mean and understand both are measured on a somewhat fuzzy spectrum, subject to interpretation.
In a perfectly fair world AIs/MLs could vacuum up all human knowledge, fair and square. (In an ideal world, they would do that adhering to polite opt-in/opt-out agreements with copyright holders. We can dream). Input isn't theft.
On output, two magic genies would stand at the gate, the Derivative Genie and Fair Use Genie and review anything spat out by the AI/ML. If it crossed agreed upon thresholds the Genies would bar the gates and issue a stern warning to prompt again (or maybe the AL/ML would auto-adjust the prompt and try again).
So, if your prompt asked for a 300-word poem about thrash metal mosh pit dancing and it spat out a poem where 85% of it match one of the handful of available mosh pit poems in its database, the Derivative Demon would block the output and raise an alarm.
On the other hand, if you asked for a line by line analysis of a famous mosh pit dancing poem (by name) or maybe asked for a satirical spoof of said poem, the Fair Use Demon would overrule the Derivative Demon and give the output a pass.
That's as fair as this could get, especially if you add one more thing: An Appeals Court (maybe corporate, maybe 3rd party, maybe state run) with a Settlement Pool. If a copyright holder could prove the Genies let pass something they shouldn't, the AL/ML would fix that. If real damage is done, the creator would get a settlement from the pool.
The point is that the Input Genie is out of the bottle. Creators just look foolish trying to squeeze it back in. Better, they should focus on making the output Genies and the Appeals process as effective and fair as possible for everyone.
I'm not sure how this plays out legally, but it certainly seems unethical
Would it be fair for Greece to do retroactive term extensions all the way back to Plato and then sue anyone who copies the idea of having a university or uses the Platonic solids or distributes religious texts that incorporate the dualistic theory of the soul?
The attempt to distinguish them is through copying, but that's the part that isn't depriving anyone of anything.
Facts are not copyrightable. Only your particular way of expressing those facts is copyrightable.
> Theft [...] is the act of taking another person's property or services without that person's permission or consent with the intent to deprive the rightful owner of it. --- https://en.wikipedia.org/wiki/Stealing
This isn't a court of law. We don't have to talk like lawyers. If you replaced "theft" with "copyright infringement" in the comment you had such a problem with, what meaningfully changes besides we all have about five additional brain cells?
The obvious difference that copyright is subject to fair use and various other limitations that personal property isn't.
maybe you should look up the definition of property, which is a set of legally recognized rights over a thing, typically including:
* possession (what you're focusing on)
* use
* exclusion
* transfer
The last 3 seem like they have been breached, in legally that's theft.
This can even extend to stealing physical property.
Depending on local laws, stealing a car may not actually be theft if the defendent can prove they intended to return it before the owner got home from work, though it would certainly be considered theft in the colloquial sense of the term, and they would still be guilty of a lesser offense like civil and/or criminal conversion.
I doubt there's even one place where the law works like that.
In a lot of places, that's how it works. A key element of theft is the intent to permanently deprive someone of property.
This is why joyriding isn't classified as auto theft and is instead a lesser offense. It's because joyriding is an intent to temporarily deprive, while GTA is an intent to permanently deprive.
In some jxns (the UK is one), there is a tort called trespass to goods, and an example of this would be "stealing" someone's property to deliver to another location for them to use there. The tort of conversion is similar: interference with someone's property right to treat it as your own (silent as to length of time).
Getting punched in the face also violates rights, yet isn't murder. Murder is specifically about dying.
People seem to think what ai is today is theft. If enough people agree, it will be theft. Big companies dont like this and push the other way. An objectiveness doesnt exist here. It is too wiggly
With theft, the entire damage is the deprivation. It could be an heirloom or some other object that may have been entrusted to you, something that can never be replaced, memorabilia of loved ones. Something that you may have needed in your posession to survive (e.g. a car to go to your job).
With a given copyright violation, the damage is that maybe[1] you made less profit than you could have. The potential for profit is not property. Profit isn't guaranteed.
[1] The loss is not certain, because there's no guarantee that the ones consuming the copyrighted content could have even afforded it.
Also social media profile pics. Great way to get faces for deep fake ads. Most people are just 1 phone call away from being voice cloned. Our likeness isn't all that important either if you think about it.
Maybe meta will clone your writing style and sign into your meta account and message your friends telling them about this awesome new product. Meta owns the account and you uploaded data to it.
Don't be naïve. A corporation would tear the flesh from your body if it meant a better quarterly earnings report.
I wouldn't even go that far. Its an entirely new product. Its like the guy who sold you the keyboard demanding royalties for the software you built.
That the person who wrote the book couldn't predict a new use case for the book in training LLMs, is irrelevant. The book isn't in the LLM. Its not being sold with the LLM. Its one of billions of tools used to create the LLM.
People try and sell this as the AI companies extracting value from the poor little IP holders like Disney. Its maddening. That content is your cultural heritage. It already belongs to you, just some idiot has been granted a lifetime of exclusive exploitation. An LLM is trained on data you already own. Disney et al wants to exploit the new technology to extract even more money out of stuff created often decades ago.
At absolute worst its reverse engineering, which was supposed to be fair use protected in the US but apparently that's been somewhat eroded.
An LLM is essentially a lossy compression of the training data. The book absolutely is in there, it’s just mangled to the point of unrecognizability.
When large quantities of source material are replicable by prompting its a bug not a feature.
>The LLM Wouldnt be here without the copyrighted works
Google wouldn't be here if it hadn't scraped every copyrighted website and used them to form a searchable graph of the internet but we only hear complaints about them when they reproduce entire news articles.
What makes you think you are entitled to tell people what they can and cant do with data they purchased (or otherwise acquired) from you. Extremely honest question. I just cant put myself in your shoes.
Like if I had written anything useful I would be overwhelmingly flattered that my content be considered so worthy for inclusion.
Your profile suggests that you are a philosopher. Did you get into philosophy hoping to exploit the publishing industry to the extent that you can squeeze every cent out of your thoughts, and deny their potential uses downstream?
Its actually crazy how bad things are, I am usually keen on capitalism and exclusivity, but the whole thing with LLMs, I see people pushing hard to tighten the grip of intellectual property. I see people making 50 cents a month on Kindle Unlimited suddenly shocked that someones LLM generated output might be ever so slightly influenced by weights ever so slightly influenced by their work, seemingly thinking they might get some big payday out of it.
Give me a tiny little wedge of understanding of your thought process. Your book is right now, doing a greater social good on your behalf than me running around and removing all the trash from my neighborhood, and the benefits of that social good are going to accrue long after you and I are gone. Your work is now going to live on, in a very tiny way, in these systems forever. I am honestly envious.
If anything, I would be trying to get bad writing removed from LLM training data. Things that I dont want to influence others. But as a potentially honest promoter of your work, you want it removed?
Whats the number? If not 1:1 exactly what you charge for the book, what do you think the proper compensation for slightly influencing training weights you should receive?
Hundreds of years of copyright law. I bought a copy of Windows, but I’m not allowed to modify that data with a cracker and sell a bootleg DVD of it.
I should edit to clarify that I’m not a big fan of Lars Ulrich or Disney, but I don’t think we’re going to get a win here for the recreational IP pirates. What’s more likely is that we’ll end up with some Frankenstein law that favors both Mikey Mouse and OpenAI, and you and I will neither get free movies nor the ability to earn a living off of our creative labor.
But in abstract you should absolutely be able to modify and sell windows.
And before you give me an analogy about how someone could listen to Pink Floyd and then produce works inspired by their influence yada yada: Someone is a human being with human rights, and if we're going to start pretending that training an LLM is in any way analogous to human consumption and creativity, and not an industrial process that encodes input data into a digital artifact, then let's start by saying LLMs have human rights and cannot be owned by a company that charges for access to them.
Yep and so far it looks like the issue with the meta case is they didnt pay for the book. Not that they used it in training data.
>in the most charitable interpretation, using them to create derivative works.
Yeah in the same way I use a hammer to create a derivative table.
>Someone is a human being with human rights, and if we're going to start pretending that training an LLM is in any way analogous to human consumption and creativity.
I dont care about that. Its simply a tool being built using existing tools. Like using a jigsaw to make a step ladder.
Let's not sane-wash what they did here, they didn't just 'forgot to pay for the books', they deliberately and illegally downloaded and used material that wasn't theirs to use.
If you or I did that, we would be jailed or sued into destitution. In a fair world we either should change copyright laws (allowing for anyone to freely pirate all media), or Zuckerberg needs to go to jail.
Yes. Forgot is your word.
But lets face it, there wouldn't be a case to answer for if they had paid retail for each book, torn them up and scanned them and trained on that data.
>Zuckerberg needs to go to jail.
I am comfortable with that but would prefer updating copyright.
It’s called a copyright notice. Same as a license. If you’re running a commercial business you can’t legally just take that piece of work and reuse it. Pick any book off your shelf and pretty well every one of them will have words to the effect of:
All rights reserved. No part of this publication may be reproduced, distributed, or transmitted in any form or by any means, including photocopying, recording, or other electronic or mechanical methods, without the prior written permission of the publisher, except in the case of brief quotations embodied in critical reviews and certain other noncommercial uses permitted by copyright law. For permission requests, write to the publisher, addressed "Attention: Permissions Coordinator," at the address below.
Same as every piece of commercial software has a license which has to be abided by. Same as use of Meta’s service has terms and conditions which HAVE to be agreed to.
So yeah they’re free to break that license but they’re also free to be sued by IP holders for breaking it at scale.
The items they call out around training the models (and attempting to claim that each subsequent model generation should count as an additional instance of infringement) seem far less grounded in the current court interpretations of AI training.
I am not a fan of US copyright law, but if I torrented millions of books, I would be facing a felony charge in criminal court and a (with statutory damages as high as $150,000 per title in cases of willful infringement) multi-billion dollar lawsuit in civil court.
In my opinion, this has nothing to do with whether or not AI training is transformative and this fair use, and everything to do with whether or not the laws apply to everyone equally. If Facebook isn't forced to pay billions and elect a sacrificial executive to serve prison time, then I will remain angry.
That is not what this case is about. It is more about the illegal violation and piracy of copyrighted content done by Meta for commercial use and Zuck knew they were doing it.
Why did Anthropic settle [0] with a multi-billion dollar payout to authors after commercializing their LLMs that was trained off of copyrighted content that was illegally obtained and kept without the authors permission?
There's a reason why they (Anthropic) did not want it to go to trial. (Anthropic knew they would lose and it would completely bankrupt them in the hundreds of billions.)
AI boosters will do anything to justify the mass piracy and illegal obtainment of copyrighted material for commercial use (not research) which that is not fair use in the US. There is no debate on this. [0]
[0] https://images.assettype.com/theleaflet/2025-09-27/mnuaifvw/...
The original work is not replicated identically, why would we replicate a work when it can be more easily seen in original or replaced with an alternative options online. We use AI to produce new outputs to new situations. We already have had drives and networking for plain copying.
Or anything to defend on Meta. If they go out of business, humanity profits.
Elsevier at least works within the (admittedly broken) system, Meta does not.
Not even going to all GPL stuff, that in a better world should have screwed all the slop companies
I've wondered what the legalese justification for letting liability evaporate as it does so often with corps. So far the reasons I'm left with are 'shrugs' and 'the relevant provision (seemingly? apparently?) simply don't apply', neither of which are any good.
I was going to make a joke about how we should attach magnets to Aaron Swartz' corpse, since that'd make for a pretty potent energy source, given how fast he must be spinning. But honestly, I think he would have seen this sort of thing coming, given how his case was handled and how things really haven't gotten any better.
In the years since the basis of the case has been forgotten and replaced with an assumption about piracy, but it was a case about unlawful access.
The law doesn't see it that way, but it is not the ground truth.
This does not comfort me.
All the Aaron Schwartzes of the future could freely share scientific papers with the world.
RICO specifically cites "criminal infringement of a copyright" as laid out in 18 U.S. Code § 2319. If the CEO tells his employees to download hundreds of thousands of works illegally in order to carry out his money-making scheme, how is that not organized crime even if (dubiously) LLM training on the material is fair use?
-----
RICO: https://www.law.cornell.edu/uscode/text/18/part-I/chapter-96
Definitions: https://www.law.cornell.edu/uscode/text/18/1961
> As used in this chapter — (1) “racketeering activity” means (A)[...]; (B) any act which is indictable under any of the following provisions of title 18, United States Code: [...], section 2319 (relating to criminal infringement of a copyright),[...]
18 U.S. Code § 2319 - Criminal infringement of a copyright: https://www.law.cornell.edu/uscode/text/18/2319
-----
edit:
> 18 U.S. Code § 1962 - Prohibited activities
> (c) It shall be unlawful for any person employed by or associated with any enterprise engaged in, or the activities of which affect, interstate or foreign commerce, to conduct or participate, directly or indirectly, in the conduct of such enterprise’s affairs through a pattern of racketeering activity[...].
https://www.law.cornell.edu/uscode/text/18/1962
From the lawsuit:
“Meta — at Zuckerberg’s direction — copied millions of books, journal articles, and other written works without authorization, including those owned or controlled by Plaintiffs and the Class, and then made additional copies of those works to train Llama,” the suit says. “Zuckerberg himself personally authorized and actively encouraged the infringement. Meta also stripped [copyright management information] from the copyrighted works it stole. It did this to conceal its training sources and facilitate their unauthorized use.”
WTF
The rate at which they were spidering and scraping was so far beyond what any other supposedly legit spider was doing, it seemed like the logical explanation.
But a multi-billion dollar corporation downloading millions of copyrighted creative works so that they can reshape the entire labor market by training a new type of artificial intelligence model on that data set? Meh, sounds like Silicon Valley disruption, give the man a medal!
I doubt Meta has deleted their local copy though ...
it is wrong to advocate for everyone to be treated equally unjustly. better to advocate for the removal of the bad laws/structures
I’m not calling for its use as a “retributive tool”. Just that it be applied evenly.
court precedent is a useful tool of advocacy
No, we should apply them equally to Mark Fucking Zuckerberg (which is decidedly not retributive, however much you want to make an emotional appeal) until such time as they are repealed as laws. It’s not really that complicated.
According to comments here that was totally deserved. You should not mess with copyright.
The question to answer is, did it happen and if so is this copyright infringement (not covered by fair use), not which company official authorized it.
Who gave permission to Anthropic/ClosedAI to scan hundreds of thousands of books to feed to their systems which they commercially sell. Why is this the new normal. Even GitHub a month ago was like if you don't opt out we will read even your private repos for AI training.
Tech is turning into next level BS, I don't know if it always was like this but this has pierced even the very bottom.
When you think about the objectives and constraints on the table, and how disproportionately light penalties imposed on large corporations can be, if you can muster any kind of crappy argument, doing absolutely zero licensing is the no-brainer clear win. You get all of the material. You avoid a massive cost. Then the tech friendly Federal courts of the Trump administration will interpret all of the laws as far as possible in your favor and impose the lightest penalties they reasonably can.
It's a no brainer. License none of it, it's more data, it's cheaper, it's easier, the win is blinding. But if you license, you pay so much, if you use anything you didn't license you've tipped your hand on cognizance of guilt, blahblah. The contrast is stark.
Like the fine means nothing to meta, and they'll still be the beneficiaries of their infringement.
In this current state, you really just need to have enough money to bypass this lawsuit and be on your way.
> Authors have sued AI companies for copyright infringement before - and lost.
So, basically nothing will come out of this
So not are these publishers rightfully pissed, Meta didn’t even give them the $6.99 for each epub to begin with. They’ve stolen the whole thing as part of this “fair use” campaign to destroy human authorship free of even the most basic remuneration.
It’s also that Library Genesis was one of the best things on the internet until it came out that Meta had scraped it, at which point it became harder and harder to access. So not only did they pirate, their doing so made it harder for everyone else to enjoy piracy too.
I'd much rather prosecution focus on Zuck's more serious crimes against privacy and civilization as a whole. But maybe this is a small start?
That's not revenue.
No accountability for rich people has funny patterns like this.
Personally, I would be happy if AI companies are what finally take down intellectual monopoly (intellectual property). I know being anti-intellectual-monopoly isn't a common view, but i don't see average people thinking it is so important—as you can see by the huge increases in piracy recently. Could be wrong about this, I haven't done research on public opinion about copyright.
Honestly, this whole case could be great. Either copyright loses, good for us. Or Zuckerberg loses, also good for us.
I would say that copyright loses is better for society than Zuckerberg loses because, my wish for Zuckerberg to lose is from hatred, while my wish for copyright to be abolished is from my wish to help humanity.
Even Supreme Court justices[1] have said the case for copyright is thin.
[1] (before he became a justice) https://en.wikipedia.org/wiki/The_Uneasy_Case_for_Copyright
I see the ability for trillion dollar companies to wash their hands of any wrong doing for stealing all the intellectual property of the world as incredibly dangerous for innovation in the US.
So for me I would rather see reform then an abandonment. There wouldn't be much of anything to pirate if there was no incentive or protection to create in the first place.
In my mind the biggest threat the average person faces would be billionaires who can operate with impunity. Rather then a fine for pirating a book or a movie. It would be nice if the fines were proportionate to the value of the item... I digress
Anyway, the point is - there will be no justice until the citizens of the united states demand it.
This article doesn’t even remotely itemize all of Trumps corruption, but it’s long and extremely damning.
I would hope that anyone still supporting this administration reads this article and does some introspection on why. I’m guessing that ship probably sailed 6 years ago, though.
Also that there are over 2,000 emails with Peter Thiel. Or maybe the part where Sergey Brin was helping Epstein shop for an aircraft carrier (also after conviction). Honestly it was incredibly revealing that none of these people care that he raped kids. I would love to see the Trump files which were withheld but clearly thats never gonna happen.
Anyway, congrats to everyone involved on the MAGA golden age!
Do you have any evidence that files related to Trump were held back? I don’t believe that’s the case.
He’s mentioned in many of the files. I found it particularly interesting that Trump was an FBI informant that worked with the government to get Epstein convicted.
Have you done more than Trump has done to stop human trafficking? If so, please be specific.
And thank you. I’m really happy that Trump was elected. I found this year’s tax credits for social security income, overtime, and car payments on American vehicles to be especially great. Most favored nation drug pricing was also a really impressive achievement!
Trump was Epstein's close friend for many years. There are many photos of Trump partying with Epstein and young women.
It is impossible that Trump did not know what Epstein was up to. At a minimum, he knew about tremendous abuse of children and did nothing to stop it, despite being a wealthy and powerful man. At worst, he fully participated.
Fuck you and your willingness to be misled.
History is an ordered list. Yes, Trump was friends with Epstein for a number of years. But then what happened?
Yes, Trump knew Epstein was a creep. The documentation indicates that he did do something about it.
You can yell at me and be rude all you’d like though. This is the internet after all.
It's extremely clear that Trump did not do that. He continued the friendship.
What would you do in that situation? I'd burn that friend to the ground, legally, socially, every way I could. I certainly wouldn't still be hanging around with him, talking about how cool he is, etc.
Your standards for human behavior suck.
My standards for human behavior are high. So high that I’m disappointed when people join in on slandering a person that doesn’t deserve it.
I’m sure it’s nice to imagine yourself as a brave person who would punish Epstein more than Trump did, but I guess we’ll never know if that’s true or not. I’d wager you’d probably just go along with whatever provided you the most social affirmation.
BTW we haven't even talked about the 100000 other things that make Trump a terrible person. Even if a timeline could be produced that fully exonerates Trump regarding Epstein (it can't), you'd still have so many other reasons he's a trash person.
You are not a good judge of character.
I’m a really good judge of character. When folks don’t think for themselves, and instead join in on unfounded slander campaigns, it usually doesn’t speak well of their character.
I won’t join in. I don’t mind being insulted for it by random strangers on the internet. It’s a small price to pay for advocating for clear thinking and decent morals. If just one person questions their assumptions, then I’ve done my part.
Trump is the most corrupt president in US history, by a few orders of magnitude.
Trump lies continuously, obviously, ridiculously. Hell, how many times has the Iran war "ended" in the last 2 months, per Trump?
You do not have decent morals if you excuse Trump.
Fuck off with your calm fascist game.
Trump's been a known sex pest, liar, and cheat since the 80s. He hasn't changed in the decades since.
Not a lot of people were listening to it until he started to be considered a serious Presidential candidate.
It's literally not. Like, I personally was aware of some of the allegations before he ran for President. And not even the time in 2016, the time he made an effort for the Reform Party nomination in 2000 and bailed early on in the process.
> I’m not sure what more I can tell you.
You can make up whatever fiction you like, the fact is that allegations of sexual and other impropriety against Trump existed (but got far less attention) long before he was a major Presidential candidate, but got a whole lot more attention when he became that.
Trump was made fun of for lying, self promotion and boosting via pseudonyms (eg: "John Barron") since the early 1980s at the very least.
I guess you did not have the experience of seeing his lying sex pest escapades spread across newspaper and tabloid headlines intermittently through the 80s and 90s like a lot of us did.
But sure, no one said aaaaaanything like that about Trump before 2016, no sir!
Ivana Trump accused Donald Trump of rape in 1990 btw.
You should also review the code of conduct for this website and learn to communicate in good faith if you expect to ever be taken seriously. Until then, I hope things get better for you man.
Check yourself in somewhere or go for a walk. You’re deeply unwell.
As for human trafficking in general, Trump has objectively enabled ICE, the FBI and others to aggressively target that sort of behavior.
As for slavery, the US is one of the countries that fought a war to end that thousands of years old practice.
Anyway, I like Trump, and by all accounts, that goes against the grain. You’re going expressly WITH the grain. That doesn’t indicate that had you been born in the 1600’s you’d have spoken out about slavery. You’re a with the grain guy. And with the grain guys will go along with whatever the moral consensus of the times are.
- Mark Zuckerberg
I make an exception for Mark because he never changed his attitude or behavior, and if anything, he simply doubled down on it. I will never, ever, ever, EVER trust him to be a moral or ethical person.
Tired of the double standard that CEOs get away when bad things happen (because they can’t be everywhere all the time) but all the benefits when the company makes a great profit (because they’re personally driving results!).
The difference is how easy they are to sue. Good luck suing Chinese companies over this, but suing American companies is much easier.
Corporations believe in copyright so if they "break" it they should get punished for breaking rules they made up themselves.
Generally the law should be more strict for corporations than for real people.
edit: People downvoting can you argue why you disagree? I do think it's fair for the law to be more strict on the powerful rather than on the powerless.
Until we progress as a society to the point that we can put this system behind us we should at least fight to make enforcement uniform. In fact, uniform enforcement is probably a good starting point for arguing for abolition, as the pain of that enforcement is felt by proles and elites alike.
If this was you or me, we would be in prison for decades and have a fine in the millions. Time for these people to feel consequences.
As someone said, they will probably settle for around 6 billion, that is the same as say a $100 fine for us.
I'm all for strong justice, but you want to imprison an executive for decades for copyright violations?
Ah, found it:
>In April 2023, a 54-year-old programmer named Gary Bowser was released from prison having served 14 months of a 40-month sentence. Good behaviour reduced time behind bars, but now his options are limited. For a while he was crashing on a friend’s couch in Toronto. The weekly physical therapy sessions, which he needs to ease chronic pain, were costing hundreds of dollars every week, and he didn’t have a job. And soon, he would need to start sending cheques to Nintendo. Bowser owes the makers of Super Mario $14.5m (£11.5m), and he’s probably going to spend the rest of his life paying it back.
I'm not even a tiny bit supportive, but there is precedent.
https://www.theguardian.com/games/2024/feb/01/the-man-who-ow...
Why should Zuckerberg be exempt?
Zuckerberg saying anything about copyright infringement is irrelevant to the actions Meta has taken in consuming and promoting the practice, and he should face criminal liability.
Zuckerberg may be CEO, majority shareholder, and on the board of Meta, but he didn't break copyright law, Meta did. So if there were to be a consequence, Meta would pay out the fine. Not sure how you jail a company.
Now, in a company with a real corporate governance structure, the board would look at the loss incurred by said fine, look at Zuckerberg, and immediately fire him for causing the loss. However, like I said before, Zuck's in charge of Meta, so that's not going to happen, and the fine is unlikely to be enough to drastically impact the company's profitability enough to sink his shares, which are the main repository of his wealth. So if he thinks he can make himself richer violating copyright law in the future, he will likely direct Meta to do so.
TL;DR, in the famous words of Bender from Futurama, "Hooray, the system fails again!"
I'm still stuck on how Z telling Meta (or the relevant people at Meta, whatever) to go out there and do illegal shit doesn't make a court say that he's functionally done said illegal shit, or at least encouraged the company to do, and that he should thus be liable for that. It's not like there's much plausible deniability here. It'd be one thing if the lower ranks thought it'd be fine and did it of their own accord. It's quite another for Z to tell people to go nuts doing illegal shit.
The DMCA makes facilitation of copyright infringement illegal. Telling people to do copyright infringement is surely facilitation of copyright infringement. Surely then, Z having broken the DMCA is a fairly open and shut case, modulo calculating the damages. But apparently not?
I don't even play one on TV.
I wonder if, somehow, you could use or extend RICO statutes to cover this sort of thing.
You jail the CEO and the others will stand up and take note.
"But they'll complain" who gives a fuck.
What you'd need is something that either removes that protection past a certain amount of value, or, to tell entities like Meta - which are basically sole proprietorships with window dressing - that they're not entitled to the protection of incorporation if they don't enact a real corporate governance model.
Unless you have an SBA loan. Then the suing party can't get blood from a stone, but the federal government sure can.
> the fine is unlikely to be enough to drastically impact the company's profitability enough to sink his shares
You lack imagination :-) but you've identified both the problem and the solution.
It could be possible to construct a legalistic jail for a company whereby if it has committed the type of crime that a human could be jailed for, then it could be frozen for the duration, say ten years, and all its assets, shareholder funds, contracts, everything were frozen and impounded.
Of course this seems completely ludicrous because it’s so “out there” but it’s worth having the thought experiment. Things like “corporate manslaughter” really have few consequences for the corporation itself - if it was actually jailed for twenty years and shareholders and officers left frozen out and on pause, then it might be the kind of punishment that really counted for something.
In a way, they are. Those two groups are often the same due to incentives packages. The money lost to the fine is money not put in the earnings-per-share or R&D or whatever. That's the opportunity cost of paying the fine.
The problem from the "discourage bad corporate behavior" standpoint is that you can generate enough money from breaking the law to cover the fine and make more money than you would have had you not broken the law. Or maybe it's not a fine. Maybe it's a judgment from a civil case. Same issue.
You need to greatly increase the financial consequences to both clean out all gains the shareholders could have made from the illicit behavior and make it harder for the company to be competitive in the marketplace going forward.
I always heard that criminals should be thrown in jail, it's time we started doing it to the real criminals.
Now, I personally find the idea of imprisoning people for copyright offenses horrific, but I don't think it's remotely insane that someone else might come to that conclusion, given that we broadly accept it as a society.
[0] https://www.ussc.gov/sites/default/files/pdf/research-and-pu...
Fines don't do anything to deter bad behavior. Either:
* The company pays
* They pay and the company mysteriously increases next year's comp / grants a "loan" / etc
* D&O insurer pays
In all three cases the money comes out of the shareholders' hides. It provides zero personal deterrence. The payoff matrix, as seen by a sociopath, makes it rational to always defect against the common good.
The only punishment that can really focus attention is physical imprisonment in a facility they can't choose.
SOX did this for financial reporting and gee shucks it turned out executives can follow the law after all!
They stole the life's work of millions of people.
In less civilized times, they likely would have been drawn and quartered by strong horses, and had their limbs drug to the 4 corners of the continent as a warning to anyone else that would consider doing it again.
Consider the case of someone who gets banned but Facebook keeps collecting money on their business account. Or consider the case of Facebook's video metrics scandal, or... whatever. It's a little fuzzy translating how much value equates to how much stock price equates to how much real-world is-this-useful-to-me but it does matter when FB is accused of marketing (Aaron Greenspan, thinkcomp, has brought this up, in his 2019 testimony to UK parliament) advertising to more people in a region or country than actually physically exist
So fraud builds on itself, you have more fraud money to pay lawyers to try to defend you in fraud cases
The title is clickbait at it's worst. The situation around copyright and AI is stock standard "CEO makes a decision in an area that is clear as mud".
(Copied from a comment of mine written more than three years ago: <https://news.ycombinator.com/item?id=33582047>)
Learning from copyrighted content is legal - for both humans and AI. If Meta is in hot water for anything, it's piracy and/or storage of copyrighted material.
I know there's a complaint that AI can verbatim repeat that work. But so can human savants. No one is suing human savants for reading their books.
Producing copyrighted material, of course. Training on copyrighted material... I just don't see it.
EDIT: Making a perfectly valid point, but it's unpopular, so down I go.
A machine training on all copyrighted materials in the world for commercial purposes at an industrial scale makes it disproportionate.
If a company hired hundreds of savants, then it would be illegal for them to read books?
I don't follow.
And even if we grant that those savants are also very skilled at creating "market substitutes" based on their training that are capable of competing with the original works, their maximum creative output would only be a relatively small number of new works, because they can only work at human speed.
Can you cite something in the copyright laws themselves that suggest this scale distinction?
The one million savants are humans, not machines. Humans get more rights automatically in our world today. That's the moral reason for why your example is not the same. The legal stuff will be worked out in the courts and legislatures of every country in the next 5 years.
This principle is quite universal and can be found in many places, including the US constitution and US (supreme) court decisions, many international jurisdictions, treaties and conventions.
I don't understand why it should be allowed for one savant to study and answer questions about one book, but wrong for a company to hire one million savants to answer questions about one million books.
And I'm asking where in the law or case law this is supported.
Sarah Silverman as the most prominent example.
The AI won't even know where the page of text it's seeing came from, and people will avoid your book as they can just ask the AI. So you make less money. (Talking about specialized technical books here.)
Suppose they did, and some guy was filling stadiums regularly to hear him recite an entire audio book. That would probably get the attention of someone's lawyers.
If it's illegal for AIs it should be illegal for humans, too. Is that really what you're arguing? It should be illegal for savants to read books?
Read a book, that's fine. Write a book, that's fine. Read a book and then write a book that is 99.9% the same as the book that you read and sell it for profit without a license from the original author, that's infringement.
That's what all these lawsuits are about - it's the training not the reproduction. I already agreed in my first comment that the reproduction is off limits.
In this case, it appears that Meta torrented illegal copies of the work to do the training. Obviously that's bad. But conflating that with training itself doesn't follow.
Pirating content is illegal, regardless of if it is to train an LLM.
Usage of LLMs trained on unlicensed content (basically all of them) might or might not be illegal.
Using any method to reproduce a copyrighted work by using that original as input in a way that supplants the market value of the original is probably illegal.
At least that is my rudimentary understanding.
I don't think anyone thinks that all training is a copyright violation if all the training data is licensed. For example a LLM trained on CC0 content would be fine with basically everyone.
The problem is that training happens on data that is not licensed for that use. Some of that data also is pirated which makes it even clearer that it is illegal.
If you supplant the value of the original with the original as input then you probably have some legal questions to answer.
It's a "rules for thee and not for me" argument.
the distinction isn't particularly clear cut with an open source model. If it is able to reproduce copyright protected work with high fidelity such that the works produced would be derivative, that's like trying to get around laws against distribution of protected works by handing them to you in a zip file.
It's a kind of copyright washing to hand you the data as a binary blob and an algorithm to extract them out of it. That wouldn't really fly with any other technology.
And that's really where a lot of the value is mind you, these models are best thought of as lossily compressed versions of their input data. Otherwise Facebook ought to be perfectly fine to train them on public domain data.
That seems very possible to me, and undermines the "training is copyright violation" argument. It's not the training, it's the output.
If you’re struggling to comprehend that a person reading a book is different then you’re a bad bot.
How about then to grant AI all other rights, for example, to allow voting?(sarcasm)
Just from a rational argumentation point of view. Clearly if a law is written saying as much, then sure. But there is no such copyright law like that yet.
Correct. Because until very recently there was no need.
Yes it's very different. Humans need to eat, sleep, and pay taxes. You also have to pay them competitive wages.
There's nothing in the law to support your argument either. The law however does say, very unambiguously, that copying without permission isn't allowed . There aren't exceptions for "training" just because it's superficially similar to a human activity (reading a book). A human isn't allowed to hand-copy Harry Potter. Even if they bought all the Harry Potter books.
> "81.7TB"
https://en.wikipedia.org/wiki/United_States_v._Swartz
> "approximately 70 gigabytes"
How are these fruits "stolen" if they still have what was allegedley stolen?
Dowling v. United States, 473 U.S. 207 (1985): The Supreme Court ruled that the unauthorized sale of phonorecords of copyrighted musical compositions does not constitute "stolen, converted or taken by fraud" goods under the National Stolen Property Act
And even if, arguendo, sure its stolen. The purpose of copyright is to "To promote the Progress of Science and useful Arts, by securing for limited Times to Authors and Inventors the exclusive Right to their respective Writings and Discoveries"
And you would be hard pressed to prove that LLM's haven't advanced the arts and sciences, so at bare minimum transformative, ie fair use.
Royalties are owed and continuously owed as these models are deployed and doing inference. How is it any different to paying a small pittance to someone every time a song is played?
The LLaMA models were released openly. Copies exist everywhere in the world. You aren't going to be able to charge someone for running `llama.cpp`; a court order ceases to have practical relevance at that point.
"I made enough copies for everyone" isn't a valid defense for copyright infringement.
Second, royalties are not required to cite a source.
Can you imagine how disastrous it would be to everything from news reporting to scientific publishing if that was the case?
I don’t get why the training process doesn’t count as any other form of transformation but then I’m not a lawyer.
Also I believe performing covers is legal
The previous infringement case with Anthropic said that while training an AI was transformative and not itself an infringement, pirating works for that purpose still was definitely infringement all by itself. The settlement was $1.5bn, so close to $3k for each of the 500k they pirated, so if Zuckerberg pirated "millions" (plural) it is quite plausible his settlement could be $6bn.
[1] See, e.g. https://en.wikipedia.org/wiki/Oink%27s_Pink_Palace#Legal_pro...
Cory Doctorow wrote a nice summary of the Zuckerstreisand book by Sarah Wynn-Williams.
"First, Facebook becomes too big to fail.
Then, Facebook becomes too big to jail.
Finally, Facebook becomes too big to care."
That Mark never fails to deliver.
> Afterwards, Kaplan gives her a negative performance review because she was "unresponsive" to his emails and texts while she was dying in an ICU.
Holy shit.
24 songs and was at one point $80k per song, almost 20 years ago. Let's let Zuck off with an even 100k per infringement.
[0] https://w2.eff.org/IP/P2P/riaa-v-thepeople.html
It just seems obvious to me that a profit seeking venture should be held to a higher standard when it comes to infringing on the property rights of other companies and individuals, especially if they seek to enforce their own.
Those kids weren’t hypocritically enforcing their own property rights and making employees sign ndas while downloading shit from tpb.
The hypocrisy is what has at least me upset
It’s currently just as bad but in a different way, imho.
The ability for labels (or whoever owns the rights) to wantonly invoke automated DMCA copyright strikes and demonetization on social media channels like YouTube is borderline criminal to me.
Their lobby did a great job getting them more than they deserved (specifically with regards to the facilitation of capricious invoking of DMCA), but the abuse of the rules limits the growth of the creator economy in very unhealthy ways.
The difference in scope here should be obvious.
We can similarly punish drug dealers while not punishing drug users. In fact it's already policy in large parts of the USA.
"Thats such a non sequitur. This isnt a weed legalisation argument, its "Do we make IP worse for everyone, because you dont like some people benefiting from fair use"."
Of course some kids are going to be charged for this kind of shit, it's still a rules for thee but not for me world, the 'not for me' folks are just a hell of a lot more brazen about it.
Those students are not Zuckenberg. They will not be treated as Zuckenberg. The legal theories that apply to them dont apply to Zuckenberg and vice versa. They do not have money to mount defense and if they do, they will be in debt till the end of their lives.
His constant violation of people's privacy is also horrendous and worthy of condemnation, but that's not directly related to the copyright infringement matter. It's a separate issue.
But we all know it’ll be a slap on the wrist for Meta and nothing will change.
Anything less than this means it's rules for thee and not for me. Laws cease to have meaning when people realize and internalize the idea that they are just tools of the elite to keep the poors in line instead of proper instruments of justice that apply to everyone equally. That's an extremely dangerous thing for the public to realize and internalize, for obvious reasons.
"We were just following the rules" got people justifiably hanged not so many years ago. There must be principle behind what it is you would enforce, or you're not one of the good guys. If you give a shit about "currently illegal", I won't spend any more time listening to or worrying about what you think should be legal.
And when you're targeting someone / something with unlimited lawyers, you'd better have ironclad evidence that exactly that happened in exactly the way the claim is written.
But untreated (at the time, no penicillin) syphilis turned him into a mental pre-teen after his release, so I guess the universe serves some justice where the laws of the land do not.
It's pretty clear that Meta wasn't about profit, given that no amount of "sunk cost" could explain what happened. That had more to do with self-aggrandizement and his belief that he was some sort of digital messiah that would get to usher humanity into another world.
He bought the best protection around for breaking the law.
IIRC, Facebook's cash is more like $81-82 billion.
This common argument to not take market cap valuations seriously doesn't hold.
True, Meta as an entire entity is not liquid. A forced sale in entirety would produce a massive reduction in compensation. But that is a highly unlikely and contingent reduction.
It is also true that if you have Meta's equivalent in cash, the value of the cash is likely to drop, while the value of Meta likely to grow, over any appreciable time. In that sense, $X cash is worth much "less" than the $X market cap.
These seeming contradictions are the result of different practical tradeoffs in structures of wealth. Not because market caps reflect misleading or overstated accounting.
Or do you mean a greater vs. lesser market cap? As compared to what?
If market cap was intrinsic value underlying itself, the business would be irrelevant. That is a circular “origin” of value that even novice investors would want to sell out of. That doesn’t work.
Success that matters for investors isn’t evidenced by a high market cap. But by a market cap not keeping up with business growth. I.e. shares becoming undervalued. By actual/predicted growth increasing faster than cap, or cap falling faster than actual/predicted downturns.
(I think, someone please correct me of I'm wrong?)
Yes, there are specialized products catered to billionaires. But those aren't getting them better rates than someone with a $200k portfolio (Zuck is not conventionally a less risky borrower than the Options Clearing Corporation!). They exist to work around the fact that some borrowers can't just casually liquidate their stock on the open market, let alone at face value. By all accounts these products are more expensive than retail.
Mostly this is an expensive (but maybe still less expensive than taxes, depending on the rate environment—it's more of a no-brainer in ZIRPland) way to diversify out of a single-stock portfolio without selling by adding leverage. At Zuck's age, it's still very unlikely to make sense to borrow instead of sell to spend. He's been known to pay real taxes in the past, they just look small relative to his imputed wealth growth because rich people don't spend a lot relative to their wealth growth because they, quite by definition, have a lot of wealth.
Any asset a bank is willing to take is collateral has the same issue, it’s just very pronounced in this instance.
If you take your idea at face value, anyone who borrows against their property to renovate/upgrade would be up for tax.
Also, you'd totally gut retail home equity lending as collateral damage, with disastrous social policy consequences.
It's similar to a reverse mortgage. Say Fred and Wilma own a house worth $4MM with no mortgage on it. With a reverse mortgage a bank will lend them $2MM. Fred and Wilma make no payments and continue to live in their house, spending the $2MM while the interest on that loan just increases the amount they owe the bank. After both Fred and Wilma have passed away the house is sold and the proceeds are used to pay back the outstanding loan. If there's still money left over, it goes to their heirs. If the sale comes up short, the bank loses money, which is why these reverse mortgages are typically less than 50% of the value of the house and they typically have higher interest rates than conventional mortgages. From Fred and Wilma's point of view, they can use the value of their house now, while continuing to live in it. They essentially spend their children's inheritance.
There's some interesting exceptions, like how Musk has managed to sell Tesla shares totalling more or less as much as the business itself has made in total lifetime revenue; but even then, Musk's theoretical net worth is very different from how much he could get if he was forced to sell all his shares suddenly.
Owner-CEOs like Musk and Zuckerberg get all the effects of such randomness, but the only examples I can think of such people getting into billion-dollar legal troubles tend to be examples which go on to sink their companies completely, so I'm not sure what impact a fine of "merely" 10% of cash reserves would do to investor confidence as expressed in share price. And this is not the only legal case Meta's facing right now.
MacKenzie Scott (Jeff Bezos' ex wife) show it can be turned into real money. As of December 2025 She had given away $7.1 billion in 2025 charitable donations, and $26.3 billion since 2019.
In reality there is the ability to execute on the shares to turn them into real money.
Jeff Bezos holds less than 10% of Amazon stock himself. Which is a huge amount of money, and a not insignificant amount of which can be turned into "real" money and even with some decline is still a phenomenal amount.
In that same time period the stock valuation has more than doubled.
You have a house? You can sell it next month for a certain price, sell it tomorrow for a bit less.
You own every house in your town? You can still sell a few for “full price” but liquidation of all of them is going to be a shock to the market.
I sincerely doubt that Meta's share price would crash as a result of Zuckerberg getting an expensive judgement.
If he did the right thing, then we should all support his choice to use it under fair use.
Freedom means that the state shouldn't punish a public benefactor.
It makes me furious to see programmers fighting against an open source hero.
If it was closed source for Meta profit, I understand.
But they gave it away free, so it infuriates me that people support damages for a public benefactor.
Churches and schools get free money from the government. We need a rule that open AI (not the company, I mean the actuality), can torrent whatever they want because it's for the public good.
Otherwise the rich companies win and can pay their sources and the small guys are screwed.
If Meta has to pay for their training data, they will need to profit from it and won't be able to offer it free.
Nobody in their right mind would ever support the publishers here.