skip to main content
article

Internet content filtering using isotonic separation on content category ratings

Published:01 February 2007Publication History
Skip Abstract Section

Abstract

The World Wide Web has enabled anybody with a low cost Internet connection to access vast information repositories. Some of these repositories contain information (e.g., hate speech and pornography) that is considered objectionable, especially for children to view. Several efforts---legal and technical---are underway to protect children and the generic public from accessing this type of content. We propose a technical approach utilizing a recently proposed technique called isotonic separation for filtering with content metadata if they satisfy monotone conditions. We illustrate this approach using a category rating method of PICS. In essence, we formulate the Internet content filtering problem as a classification problem on content metadata and report on experiments we conducted with the isotonic separation technique.

References

  1. Ahuja, R. K., Magnanti, T. L., and Orlin, J. B. 1993. Network Flows. Prentice-Hall, Englewood Cliffs, NJ.Google ScholarGoogle Scholar
  2. Balkin, J. M., Noveck, B. S., and Roosevelt, K. 1999. Filtering the Internet: A best practice model. Working paper, Yale Information Society Project, Yale Law School.Google ScholarGoogle Scholar
  3. Barlow, R. E., Bartholemew, D. J., Bremner, J. M., and Brunk, H. D. 1972. Statistical Inference under Order Restrictions. John Wiley & Sons, New York.Google ScholarGoogle Scholar
  4. Brickley, D. and Guha, R. V. 2004. RDF vocabulary description language 1.0: RDF schema. Tech. Rep. REC-rdf-schema-20040210, World Wide Web Consortium.Google ScholarGoogle Scholar
  5. Brickley, D. and Swick, R. R. 2000. PICS rating vocabularies in XML/RDF. Tech. Rep. REC-rdf-pics-20000327, World Wide Web Consortium.Google ScholarGoogle Scholar
  6. Brown, D. E., Corruble, V., and Pittard, C. L. 1993. A comparison of decision tree classifiers with backpropagation neural networks for multimodal classification problems. Pattern Recognition 26, 6, 953--961.Google ScholarGoogle ScholarCross RefCross Ref
  7. Caulkins, J. P., Ding, W., Duncan, G., Krishnan, R., and Nyberg, E. 2006. A method for managing access to web pages: Filtering by statistical classification (FSC) applied to text. Decision Support Systems 42, 1, 144--161. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Chandrasekaran, R., Ryu, Y. U., and Jacob, V. 1998. Breast cancer diagnosis using an isotonic separation approach. Working paper, Department of Information Systems and Operations Management, School of Management, The University of Texas at Dallas.Google ScholarGoogle Scholar
  9. Chandrasekaran, R., Ryu, Y. U., Jacob, V., and Hong, S. 2005. Isotonic separation. INFORMS J. Comput. 17, 4, 462--474. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Clark, P. and Boswell, R. 1991. Rule induction with CN2: Some recent improvements. In Proceedings of the Fifth European Working Session on Learning (EWSL-91). Springer, Berlin, 151--163. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Clark, P. and Niblett, T. 1989. The CN2 induction algorithm. Mach. Learn. 3, 4, 261--283. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Consumers Union. 2005. Filtering software: Better, but still fallible. Consumer Reports 70, 6 (June), 36--38.Google ScholarGoogle Scholar
  13. Cranor, L. F. 2002. Web Privacy with P3P. O'Reilly & Associates, Inc., Sebastopol, CA. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Cranor, L. F., Langheinrich, M., Marchiori, M., Presler-Marshall, M., and Reagle, J. 2002. The platform for privacy preferences 1.0 (P3P1.0) specification. Tech. Rep. REC-P3P-20020416, World Wide Web Consortium.Google ScholarGoogle Scholar
  15. Evans, C., Feather, C. D. W., Hopmann, A., Presler-Marshall, M., and Resnick, P. 1997. PICSRules 1.1 specification. Tech. Rep. REC-PICSRules-971229, World Wide Web Consortium.Google ScholarGoogle Scholar
  16. Garfinkel, S. 1997. Web Security & Commerce. O'Reilly & Associates, Inc., Sebastopol, CA. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Kiang, M. Y. 2003. A comparative assessment of classification methods. Decision Supp. Syst. 35, 5, 441--454. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Krauskopf, T., Miller, J., Resnick, P., and Treese, W. 1996. PICS label distribution label syntax and communication protocols, version 1.1. Tech. Rep. REC-PICS-labels-961031, World Wide Web Consortium.Google ScholarGoogle Scholar
  19. Manola, F. and Miller, E. 2004. RDF primer. Tech. Rep. REC-rdf-primer-20040210, World Wide Web Consortium.Google ScholarGoogle Scholar
  20. McBride, B., Wenning, R., and Cranor, L. 2002. An RDF schema for P3P. Tech. Rep. NOTE-p3p-rdfschema-20020125, World Wide Web Consortium.Google ScholarGoogle Scholar
  21. Miller, J., Resnick, P., and Singer, D. 1996. Rating services and rating systems (and their machine readable descriptions), version 1.1. Tech. Rep. REC-PICS-services-961031, World Wide Web Consortium.Google ScholarGoogle Scholar
  22. Murthy, S. K., Kasif, S., and Salzberg, S. 1994. A system for induction of oblique decision trees. J. Art. Intell. Res. 2, 1, 1--32.Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Murty, K. G. 1992. Network Programming. Prentice-Hall, Englewood Cliffs, NJ. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Quinlan, J. R. 1986. Induction to decision trees. Machine Learning 1, 81--106. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Quinlan, J. R. 1993. C4.5: Programs for Machine Learning. Morgan Kaufmann, San Mateo, CA. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Ryu, Y. U., Chandrasekaran, R., and Jacob, V. 2004. Prognosis using an isotonic prediction technique. Manag. Sci. 50, 6, 777--785. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Ryu, Y. U. and Yue, W. T. 2005. Firm bankruptcy prediction: Experimental comparison of isotonic separation and other classification approaches. IEEE Trans. Syst., Man, and Cybernetics, Part A: Systems and Humans 35, 5, 727--737. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Wang, J. Z., Li, J., Wiederhold, G., and Firschein, O. 1998b. Classifying objectionable websites based on image content. In Interactive Distributed Multimedia Systems and Telecommunication Services, T. Plagemann and V. Goebel, Eds. Lecture Notes in Computer Science, vol. 1483. Springer-Verlag, Berlin, 113--124. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Wang, J. Z., Li, J., Wiederhold, G., and Firschein, O. 1998a. System for screening objectionable images. Comput. Comm. 21, 15, 1355--1360. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Internet content filtering using isotonic separation on content category ratings

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in

        Full Access

        • Published in

          cover image ACM Transactions on Internet Technology
          ACM Transactions on Internet Technology  Volume 7, Issue 1
          February 2007
          184 pages
          ISSN:1533-5399
          EISSN:1557-6051
          DOI:10.1145/1189740
          Issue’s Table of Contents

          Copyright © 2007 ACM

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 1 February 2007
          Published in toit Volume 7, Issue 1

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • article

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader
        About Cookies On This Site

        We use cookies to ensure that we give you the best experience on our website.

        Learn more

        Got it!