SIGN IN
SIGN UP
Learning URL patterns for webpage de-duplication
Full Text:
PDF
Buy this Article
Authors:
Hema Swetha Koppula
Yahoo! Labs, Bangalore, India
Krishna P. Leela
Yahoo! Labs, Bangalore, India
Amit Agarwal
Picsquare.com, Bangalore, India
Krishna Prasad Chitrapura
Yahoo! Labs, Bangalore, India
Sachin Garg
Yahoo! Labs, Bangalore, India
Amit Sasturkar
Yahoo! Inc., Sunnyvale, CA, USA
2010 Article
Bibliometrics
· Downloads (6 Weeks): 5
· Downloads (12 Months): 34
· Downloads (cumulative): 305
· Citation Count: 7
Published in:
· Proceeding
WSDM '10
Proceedings of the third ACM international conference on Web search and data mining
Pages 381-390
ACM
New York, NY
, USA
©2010
table of contents
ISBN: 978-1-60558-889-6
doi>
10.1145/1718487.1718535
Tools and Resources
Buy this Article
Request Permissions
TOC Service:
Email
RSS
Save to Binder
Export Formats:
BibTeX
EndNote
ACM Ref
Upcoming Conference:
WSDM 2014
Share:
|
Tags:
algorithms
decision trees
general
generalization
mapreduce
page importance
performance
search engines
search process
site-specific delimiters
webpage de-duplication
Feedback
|
Switch to
single page view
(no tabs)
**Javascript is not enabled and is required for the "tabbed view" or switch to the
single page view
**
Powered by
The ACM Guide to Computing Literature
All Tags
Export Formats
Save to Binder