Google News is a free news aggregator provided and operated by Google Inc, selecting most up-to-date information from thousands of publications by an automatic aggregation algorithm. Launched in September 2002, the service was tagged as a beta test for over three years until January 2006. The initial idea was developed by Krishna Bharat.
more from Wikipedia
Collaborative filtering
Collaborative filtering (CF) is a technique used by some recommender systems. Collaborative filtering has two senses, a narrow one and a more general one. In general, collaborative filtering is the process of filtering for information or patterns using techniques involving collaboration among multiple agents, viewpoints, data sources, etc. Applications of collaborative filtering typically involve very large data sets.
more from Wikipedia
Personalization
Personalization involves using technology to accommodate the differences between individuals. Once confined mainly to the Web, it is increasingly becoming a factor in education, health care, television, and in both "business to business" and "business to consumer" settings.
more from Wikipedia
Google
Google Inc. is an American multinational corporation which provides Internet-related products and services, including Internet search, cloud computing, software and advertising technologies. Advertising revenues from AdWords generate almost all of the company's profits. The company was founded by Larry Page and Sergey Brin while both attended Stanford University. Together, Brin and Page own about 16 percent of the company's stake.
more from Wikipedia
Scalability
In electronics scalability is the ability of a system, network, or process, to handle growing amount of work in a capable manner or its ability to be enlarged to accommodate that growth. For example, it can refer to the capability of a system to increase total throughput under an increased load when resources (typically hardware) are added.
more from Wikipedia
MinHash
In computer science, MinHash (or the min-wise independent permutations locality sensitive hashing scheme) is a technique for quickly estimating how similar two sets are. The scheme was invented by Andrei BroderĀ , and initially used in the AltaVista search engine to detect duplicate web pages and eliminate them from search results. It has also been applied in large-scale clustering problems, such as clustering documents by the similarity of their sets of words.
more from Wikipedia
Probabilistic latent semantic analysis
Probabilistic latent semantic analysis (PLSA), also known as probabilistic latent semantic indexing (PLSI, especially in information retrieval circles) is a statistical technique for the analysis of two-mode and co-occurrence data. In effect, one can derive a low dimensional representation of the observed variables in terms of their affinity to certain hidden variables, just as in latent semantic analysis. PLSA evolved from latent semantic analysis, adding a sounder probabilistic model.
more from Wikipedia
Latent semantic indexing
Latent Semantic Indexing (LSI) is an indexing and retrieval method that uses a mathematical technique called Singular value decomposition (SVD) to identify patterns in the relationships between the terms and concepts contained in an unstructured collection of text. LSI is based on the principle that words that are used in the same contexts tend to have similar meanings.
more from Wikipedia