Nvidia’s ISP piracy defense backfires as judge refuses to dismiss copyright lawsuit over more than 197,000 pirated books — scripts in NeMo Framework allegedly ‘have no other purpose’ than to speed up infringement

Published: 4 days ago (May 7, 2026 at 07:54 AM EDT)

2 min read

Source: Tom’s Hardware

Nvidia logo

U.S. District Judge Jon Tigar denied Nvidia’s request to dismiss a copyright infringement lawsuit. The case alleges that Nvidia’s AI‑powered NeMo Megatron Framework was used to facilitate the illegal downloading and preprocessing of copyrighted eBooks.

The lawsuit

Datasets involved:
- Bibliotik – a private eBook torrent tracker containing over 197,000 books.
- Books3 – a dataset that incorporated Bibliotik data.
- The Pile – an 800 GB collection that included Books3 and was used to train Nvidia’s large language models (LLMs).
Allegations: Specific scripts within the NeMo Megatron Framework were designed solely to speed up the acquisition and processing of the copyrighted material, giving them “no other purpose” than to facilitate infringement.
Judge Tigar’s reasoning: The court distinguished Nvidia’s situation from cases like Sony and Cox, noting that the scripts themselves, not the broader framework, were allegedly intended for infringing use.

Nvidia’s defense

Nvidia argued that the NeMo Megatron Framework has legitimate, non‑infringing uses and cited the Supreme Court’s Cox v. Sony decision, which held that service providers are not automatically liable for users’ piracy. The company claimed that, under precedent, merely providing a service to the public does not constitute copyright infringement.

Meta: Facing a lawsuit alleging the use of pirated material for training its models. Meta has argued that using such material is legal if the content is not directly “seeded” into its products.
- Source: Tom’s Hardware – Meta defends using pirated material
Google: Advocating for AI‑scraping to be treated as fair use, emphasizing the need for “copyright systems that enable appropriate and fair use” while allowing opt‑outs for data owners.
- Source: Tom’s Hardware – Google AI scraping as fair use

These cases illustrate the broader legal debate over whether AI developers can rely on existing copyright doctrines when training models on large, often uncurated datasets.

Nvidia’s ISP piracy defense backfires as judge refuses to dismiss copyright lawsuit over more than 197,000 pirated books — scripts in NeMo Framework allegedly ‘have no other purpose’ than to speed up infringement

The lawsuit

Nvidia’s defense

Related posts

AMD's excellent Radeon RX 9070 with 16 GB of VRAM hits all-time low pricing — PowerColor Hellhound variant is 23% off list price

MIT researchers revive 40-year-old triangular zipper concept now made possible by 3D printing, creates shape-shifting robots and deployable structures — 3D-printed 'Y-Zipper' turns floppy tentacles into rigid beams in seconds

Former Epic director is building a European rival to the Unreal and Unity game engines — 'The Immense Engine' dev sees opportunity for AI agents to 'do the work of ten or fifteen people'

China's Hanyuan-2 debuts as 'world's first' dual-core quantum computer — 200-qubit claims incredible power efficiency, but lacks critical performance benchmarks

The lawsuit

Nvidia’s defense

Related AI copyright cases

Related posts

AMD's excellent Radeon RX 9070 with 16 GB of VRAM hits all-time low pricing — PowerColor Hellhound variant is 23% off list price

MIT researchers revive 40-year-old triangular zipper concept now made possible by 3D printing, creates shape-shifting robots and deployable structures — 3D-printed 'Y-Zipper' turns floppy tentacles into rigid beams in seconds

Former Epic director is building a European rival to the Unreal and Unity game engines — 'The Immense Engine' dev sees opportunity for AI agents to 'do the work of ten or fifteen people'

China's Hanyuan-2 debuts as 'world's first' dual-core quantum computer — 200-qubit claims incredible power efficiency, but lacks critical performance benchmarks