PROBABILISTIC APPROACHES TO BIG DATA PROCESSING IN COLLABORATIVE FILTERING BASED RECOMMENDATION SYSTEMS

Роман Анатолійович Руденський, Вікторія Володимирівна Руденська

Abstract


This article investigates the state and prospective of recommendation systems implementation under the conditions of Big Data development. The article reveals key problems that arise when collaborative fil-tering based recommendations are implemented and suggests approach to overcome these problems. The proposed approach is based on ideas of expanding original user-item rating matrix and implementing minhash trick to estimate Jaccard similarity measure. Ability to track users’ behavior and account for it while decision making revealed the new field related to market research, data and computer science that turned into online recommendations system. 

Keywords


recommendation systems; big data; collaborative filtering; Jaccard similarity; minhash

Full Text:

PDF

References


Harper, F. M. & Li, X. & Chen, Y. & Konstan, J. A. (2005). An economic model of user rating in an online recommender system, in User Modeling 2005, vol. 3538 of Lecture Notes in Computer Science, Springer, August 2005, 307–316,

Resnick, P. & Iacovou, N. & Suchak, M. Bergstrom, P. & Riedl, J. (1994) GroupLens: an open architecture for collaborative filtering of netnews,” in ACM CSCW ’94, 175–186.

Bennett J. & Lanning, S. (2007). The netflix prize, in KDD Cup and Workshop’07.

Herlocker, J. & Konstan, J. A. & Riedl, J. (2002). An empirical analysis of design choices in neighborhood-based collaborative filtering algorithms,” Information Retrieval, vol. 5, no. 4, 287–310.

Sarwar, B. M. & Karypis, G. & Kon-stan, J. A. & Riedl, J. T. (2000). Application of dimensionality reduction in recommender system — a case study,” in WebKDD 2000.

Brand, M. (2003). Fast online SVD revisions for lightweight recommender systems, in SIAM International Conference on Data Mining, 37–46.

Broder, Andrei Z. (1997). On the resemblance and containment of documents", Compression and Complexity of Sequences: Proceedings, Positano, Amalfitan Coast, Salerno, Italy, June 11-13, 1997 (PDF), IEEE, 21–29.

Broder, Andrei Z. & Charikar, Moses & Frieze, Alan M. & Mitzenmacher, Michael (1998), Minwise independent permutations, Proc. 30th ACM Symposium on Theory of Computing (STOC '98), New York, NY, USA: Association for Computing Machinery, 327–336

Leskovec, J. & Rajaraman, A. & Ullman, J.D. (2014). Mining of massive datasets. Cambridge University PressReferences , 476.


Refbacks

  • There are currently no refbacks.