Children's Internet Protection Act (CIPA) Ruling eBook

United States District Court for the Eastern District of Pennsylvania
This eBook from the Gutenberg Project consists of approximately 196 pages of information about Children's Internet Protection Act (CIPA) Ruling.

Children's Internet Protection Act (CIPA) Ruling eBook

United States District Court for the Eastern District of Pennsylvania
This eBook from the Gutenberg Project consists of approximately 196 pages of information about Children's Internet Protection Act (CIPA) Ruling.
identify 120, it would have performed with a recall of 40%.  This would be analogous to a filter that underblocked 60% of the material in a category.  In automated classification systems, there is always a tradeoff between precision and recall.  In the animal-picture example, the recall could be improved by using a looser set of criteria to identify the dog pictures in the set, such as any animal with four legs, and all the dogs would be identified, but cats and other animals would also be included, with a resulting loss of precision.  The same tradeoff exists between rates of overblocking and underblocking in filtering systems that use automated classification systems.  For example, an automated system that classifies any Web page that contains the word “sex” as sexually explicit will underblock much less, but overblock much more, than a system that classifies any Web page containing the phrase “free pictures of people having sex” as sexually explicit.

This tradeoff between overblocking and underblocking also applies not just to automated classification systems, but also to filters that use only human review.  Given the approximately two billion pages that exist on the Web, the 1.5 million new pages that are added daily, and the rate at which content on existing pages changes, if a filtering company blocks only those Web pages that have been reviewed by humans, it will be impossible, as a practical matter, to avoid vast amounts of underblocking.  Techniques used by human reviewers such as blocking at the Ip address level, domain name level, or directory level reduce the rates of underblocking, but necessarily increase the rates of overblocking, as discussed above.  To use a simple example, it would be easy to design a filter intended to block sexually explicit speech that completely avoids overblocking.  Such a filter would have only a single sexually explicit Web site on its control list, which could be re-reviewed daily to ensure that its content does not change.  While there would be no overblocking problem with such a filter, such a filter would have a severe underblocking problem, as it would fail to block all the sexually explicit speech on the Web other than the one site on its control list.  Similarly, it would also be easy to design a filter intended to block sexually explicit speech that completely avoids underblocking.  Such a filter would operate by permitting users to view only a single Web site, e.g., the Sesame Street Web site.  While there would be no underblocking problem with such a filter, it would have a severe overblocking problem, as it would block access to millions of non-sexually explicit sites on the Web other than the Sesame Street site.

Copyrights
Project Gutenberg
Children's Internet Protection Act (CIPA) Ruling from Project Gutenberg. Public domain.