Law.com Subscribers SAVE 30%

Call 855-808-4530 or email [email protected] to receive your discount on a new subscription.

Internet 'Data Scraping'

By Anthony J. Dreyer and Jamie Stockton
August 02, 2013

The proliferation of Internet access and mobile devices has led to an exponential explosion of content on the Web, creating a vast repository of “publicly available” information. This includes not only news, business and financial information, but also personal data, movie and restaurant reviews, concert ticket sales, flight information, and a virtually endless array of other categories. This same technological explosion, however, has made it far easier for third parties to extract this data for commercial sale and use ' and to do so for free and without authorization. This data extraction, commonly referred to as “scraping,” “crawling,” or “spidering” (collectively “scraping”), creates legal issues and concerns for both sides of this issue ' those who want to scrape, and those who want to protect against scraping of their websites. See, EF Cultural Travel BV v. Zefer, 318 F.3d 58, 60 (1st Cir. 2003) (“A scraper, also called a 'robot' or 'bot,' is nothing more than a computer program that accesses information contained in a succession of webpages stored on the accessed computer”); eBay v. Bidder's Edge, 100 F. Supp. 2d 1058, 1060 (N.D. Cal. 2000).

While it is possible to embed instructions on websites that inform the scraping software whether scraping is permitted (called “robot.txt” files), compliance with such instructions is voluntary. See, Bidder's Edge , at 1061.

Read These Next
Why So Many Great Lawyers Stink at Business Development and What Law Firms Are Doing About It Image

Why is it that those who are best skilled at advocating for others are ill-equipped at advocating for their own skills and what to do about it?

Bankruptcy Sales: Finding a Diamond In the Rough Image

There is no efficient market for the sale of bankruptcy assets. Inefficient markets yield a transactional drag, potentially dampening the ability of debtors and trustees to maximize value for creditors. This article identifies ways in which investors may more easily discover bankruptcy asset sales.

The DOJ's Corporate Enforcement Policy: One Year Later Image

The DOJ's Criminal Division issued three declinations since the issuance of the revised CEP a year ago. Review of these cases gives insight into DOJ's implementation of the new policy in practice.

A Lawyer's System for Active Reading Image

Active reading comprises many daily tasks lawyers engage in, including highlighting, annotating, note taking, comparing and searching texts. It demands more than flipping or turning pages.

Blockchain Domains: New Developments for Brand Owners Image

Blockchain domain names offer decentralized alternatives to traditional DNS-based domain names, promising enhanced security, privacy and censorship resistance. However, these benefits come with significant challenges, particularly for brand owners seeking to protect their trademarks in these new digital spaces.