July  2016, 1(2&3): 261-274. doi: 10.3934/bdia.2016008

Time aware topic based recommender system

1. 

Department of Electrical Engineering and Computer Science, York University, 4700 Keele Street, Toronto, Ontario M3J 1P3, Canada, Canada, Canada, Canada

Received  August 2016 Revised  October 2016 Published  November 2016

News recommender systems efficiently handle the overwhelming number of news articles, simplify navigations, and retrieve relevant information. Many conventional news recommender systems use collaborative filtering to make recommendations based on the behavior of users in the system. In this approach, the introduction of new users or new items can cause the cold start problem, as there will be insufficient data on these new entries for the collaborative filtering to draw any inferences for new users or items. Content-based news recommender systems emerged to address the cold start problem. However, many content-based news recommender systems consider documents as a bag-of-words neglecting the hidden themes of the news articles. In this paper, we propose a news recommender system leveraging topic models and time spent on each article. We build an automated recommender system that is able to filter news articles and make recommendations based on users' preferences. We use topic models to identify the thematic structure of the corpus. These themes are incorporated into a content-based recommender system to filter news articles that contain themes that are of less interest to users and to recommend articles that are thematically similar to users' preferences. Our experimental studies show that utilizing topic modeling and spent time on a single article can outperform the state of the arts recommendation techniques. The resulting recommender system based on the proposed method is currently operational at The Globe and Mail (http://www.theglobeandmail.com/).
Citation: Elnaz Delpisheh, Aijun An, Heidar Davoudi, Emad Gohari Boroujerdi. Time aware topic based recommender system. Big Data & Information Analytics, 2016, 1 (2&3) : 261-274. doi: 10.3934/bdia.2016008
References:
[1]

G. Adomavicius and A. Tuzhilin, Toward the next generation of recommender systems: A survey of the state-of-the-art and possible extensions,, IEEE Transaction on Knowledge and Data Engineering, 17 (2005), 734. doi: 10.1109/TKDE.2005.99. Google Scholar

[2]

D. M. Andrzejewski, Incorporating Domain Knowledge in Latent Topic Models,, PhD thesis, (2010). Google Scholar

[3]

D. M. Blei, A. Y. Ng and M. I. Jordan, Latent dirichlet allocation,, The Journal of Machine Learning Research, 3 (2003), 993. Google Scholar

[4]

J. Bobadilla, F. Ortega, A. Hernando and J. Bernal, A collaborative filtering approach to mitigate the new user cold start problem,, Knowledge-Based System, 26 (2012), 225. doi: 10.1016/j.knosys.2011.07.021. Google Scholar

[5]

H. Borges and A. Lorena, A survey on recommender systems for news data,, in Smart Information and Knowledge Management (eds. E. Szczerbicki and N. Nguyen), (2010), 129. doi: 10.1007/978-3-642-04584-4_6. Google Scholar

[6]

R. Burke, Hybrid recommender systems: Survey and experiments,, User Modeling and User-Adapted Interaction, 12 (2002), 331. doi: 10.1023/A:1021240730564. Google Scholar

[7]

S.-H. Cha, Comprehensive survey on distance/similarity measures between probability density functions,, International Journal of Mathematical Models and Methods in Applied Sciences, 1 (2007), 300. Google Scholar

[8]

T.-M. Chang and W.-F. Hsiao, Lda-based personalized document recommendation,, Proceedings of the PACIS, (2013). Google Scholar

[9]

M. D. Ekstrand, J. T. Riedl and J. A. Konstan, Collaborative filtering recommender systems,, Journal of Foundations and Trends in Human-Computer Interaction, 4 (2011), 81. doi: 10.1561/1100000009. Google Scholar

[10]

T. Griffiths, Gibbs sampling in the generative model of latent dirichlet allocation,, Standford University, 518 (2002), 1. Google Scholar

[11]

T. L. Griffiths and M. Steyvers, Finding scientific topics,, Proceeding of the National Academy of Sciences of the United States of America, 101 (2004), 5228. doi: 10.1073/pnas.0307752101. Google Scholar

[12]

X. He, T. Chen, M.-Y. Kan and X. Chen, Trirank: Review-aware explainable recommendation by modeling aspects,, in Proceedings of the 24th ACM International on Conference on Information and Knowledge Management, (2015), 1661. Google Scholar

[13]

G. Heinrich, Parameter estimation for text analysis,, , (). Google Scholar

[14]

M. D. Hoffman, D. M. Blei and F. R. Bach, Online learning for latent dirichlet allocation,, in NIPS (eds. J. D. Lafferty, (2010), 856. Google Scholar

[15]

D. Jurafsky and J. H. Martin, Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition,, 1st edition, (2000). Google Scholar

[16]

Q. V. Le and T. Mikolov, Distributed representations of sentences and documents,, CoRR, (). Google Scholar

[17]

D. D. Lee and H. S. Seung, Algorithms for non-negative matrix factorization,, in Advances in Neural Information Processing Systems 13 (eds. T. K. Leen, (2001), 556. Google Scholar

[18]

G. Linden, B. Smith and J. York, Amazon.com recommendations: Item-to-item collaborative filtering,, IEEE Internet Computing, 7 (2003), 76. doi: 10.1109/MIC.2003.1167344. Google Scholar

[19]

C. D. Manning and H. Schütze, Foundations of Statistical Natural Language Processing,, The MIT Press, (1999). Google Scholar

[20]

A. K. McCallum, Mallet: A machine learning for language toolkit, 2002,, , (). Google Scholar

[21]

T. Mikolov, I. Sutskever, K. Chen, G. Corrado and J. Dean, Distributed representations of words and phrases and their compositionality,, CoRR, (). Google Scholar

[22]

B. N. Miller, I. Albert, S. K. Lam, J. A. Konstan and J. Riedl, Movielens unplugged: Experiences with an occasionally connected recommender system,, in Proceedings of the 8th International Conference on Intelligent User Interfaces, (2003), 263. doi: 10.1145/604045.604094. Google Scholar

[23]

D. Z. Mária Bieliková Michal Kompan, Effective hierarchical vector-based news representation for personalized recommendation,, Computer Science and Information Systems, (): 303. Google Scholar

[24]

F. Ricci, L. Rokach and B. Shapira, Recommender Systems Handbook, chapter Introduction to Recommender Systems Handbook,, Springer US, (2011). Google Scholar

[25]

S. Tuarob, L. C. Pouchard and C. L. Giles, Automatic tag recommendation for metadata annotation using probabilistic topic modeling,, in Proceedings of the 13th ACM/IEEE-CS Joint Conference on Digital Libraries, (2013), 239. doi: 10.1145/2467696.2467706. Google Scholar

show all references

References:
[1]

G. Adomavicius and A. Tuzhilin, Toward the next generation of recommender systems: A survey of the state-of-the-art and possible extensions,, IEEE Transaction on Knowledge and Data Engineering, 17 (2005), 734. doi: 10.1109/TKDE.2005.99. Google Scholar

[2]

D. M. Andrzejewski, Incorporating Domain Knowledge in Latent Topic Models,, PhD thesis, (2010). Google Scholar

[3]

D. M. Blei, A. Y. Ng and M. I. Jordan, Latent dirichlet allocation,, The Journal of Machine Learning Research, 3 (2003), 993. Google Scholar

[4]

J. Bobadilla, F. Ortega, A. Hernando and J. Bernal, A collaborative filtering approach to mitigate the new user cold start problem,, Knowledge-Based System, 26 (2012), 225. doi: 10.1016/j.knosys.2011.07.021. Google Scholar

[5]

H. Borges and A. Lorena, A survey on recommender systems for news data,, in Smart Information and Knowledge Management (eds. E. Szczerbicki and N. Nguyen), (2010), 129. doi: 10.1007/978-3-642-04584-4_6. Google Scholar

[6]

R. Burke, Hybrid recommender systems: Survey and experiments,, User Modeling and User-Adapted Interaction, 12 (2002), 331. doi: 10.1023/A:1021240730564. Google Scholar

[7]

S.-H. Cha, Comprehensive survey on distance/similarity measures between probability density functions,, International Journal of Mathematical Models and Methods in Applied Sciences, 1 (2007), 300. Google Scholar

[8]

T.-M. Chang and W.-F. Hsiao, Lda-based personalized document recommendation,, Proceedings of the PACIS, (2013). Google Scholar

[9]

M. D. Ekstrand, J. T. Riedl and J. A. Konstan, Collaborative filtering recommender systems,, Journal of Foundations and Trends in Human-Computer Interaction, 4 (2011), 81. doi: 10.1561/1100000009. Google Scholar

[10]

T. Griffiths, Gibbs sampling in the generative model of latent dirichlet allocation,, Standford University, 518 (2002), 1. Google Scholar

[11]

T. L. Griffiths and M. Steyvers, Finding scientific topics,, Proceeding of the National Academy of Sciences of the United States of America, 101 (2004), 5228. doi: 10.1073/pnas.0307752101. Google Scholar

[12]

X. He, T. Chen, M.-Y. Kan and X. Chen, Trirank: Review-aware explainable recommendation by modeling aspects,, in Proceedings of the 24th ACM International on Conference on Information and Knowledge Management, (2015), 1661. Google Scholar

[13]

G. Heinrich, Parameter estimation for text analysis,, , (). Google Scholar

[14]

M. D. Hoffman, D. M. Blei and F. R. Bach, Online learning for latent dirichlet allocation,, in NIPS (eds. J. D. Lafferty, (2010), 856. Google Scholar

[15]

D. Jurafsky and J. H. Martin, Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition,, 1st edition, (2000). Google Scholar

[16]

Q. V. Le and T. Mikolov, Distributed representations of sentences and documents,, CoRR, (). Google Scholar

[17]

D. D. Lee and H. S. Seung, Algorithms for non-negative matrix factorization,, in Advances in Neural Information Processing Systems 13 (eds. T. K. Leen, (2001), 556. Google Scholar

[18]

G. Linden, B. Smith and J. York, Amazon.com recommendations: Item-to-item collaborative filtering,, IEEE Internet Computing, 7 (2003), 76. doi: 10.1109/MIC.2003.1167344. Google Scholar

[19]

C. D. Manning and H. Schütze, Foundations of Statistical Natural Language Processing,, The MIT Press, (1999). Google Scholar

[20]

A. K. McCallum, Mallet: A machine learning for language toolkit, 2002,, , (). Google Scholar

[21]

T. Mikolov, I. Sutskever, K. Chen, G. Corrado and J. Dean, Distributed representations of words and phrases and their compositionality,, CoRR, (). Google Scholar

[22]

B. N. Miller, I. Albert, S. K. Lam, J. A. Konstan and J. Riedl, Movielens unplugged: Experiences with an occasionally connected recommender system,, in Proceedings of the 8th International Conference on Intelligent User Interfaces, (2003), 263. doi: 10.1145/604045.604094. Google Scholar

[23]

D. Z. Mária Bieliková Michal Kompan, Effective hierarchical vector-based news representation for personalized recommendation,, Computer Science and Information Systems, (): 303. Google Scholar

[24]

F. Ricci, L. Rokach and B. Shapira, Recommender Systems Handbook, chapter Introduction to Recommender Systems Handbook,, Springer US, (2011). Google Scholar

[25]

S. Tuarob, L. C. Pouchard and C. L. Giles, Automatic tag recommendation for metadata annotation using probabilistic topic modeling,, in Proceedings of the 13th ACM/IEEE-CS Joint Conference on Digital Libraries, (2013), 239. doi: 10.1145/2467696.2467706. Google Scholar

[1]

Jose Carlos Camacho, Maria de los Santos Bruzon. Similarity reductions of a nonlinear model for vibrations of beams. Conference Publications, 2015, 2015 (special) : 176-184. doi: 10.3934/proc.2015.0176

[2]

Carine Lucas, Antoine Rousseau. Cosine effect in ocean models. Discrete & Continuous Dynamical Systems - B, 2010, 13 (4) : 841-857. doi: 10.3934/dcdsb.2010.13.841

[3]

Tracy L. Stepien, Hal L. Smith. Existence and uniqueness of similarity solutions of a generalized heat equation arising in a model of cell migration. Discrete & Continuous Dynamical Systems - A, 2015, 35 (7) : 3203-3216. doi: 10.3934/dcds.2015.35.3203

[4]

Anis Theljani, Ke Chen. An augmented lagrangian method for solving a new variational model based on gradients similarity measures and high order regulariation for multimodality registration. Inverse Problems & Imaging, 2019, 13 (2) : 309-335. doi: 10.3934/ipi.2019016

[5]

A. Alamo, J. M. Sanz-Serna. Word combinatorics for stochastic differential equations: Splitting integrators. Communications on Pure & Applied Analysis, 2019, 18 (4) : 2163-2195. doi: 10.3934/cpaa.2019097

[6]

Randall Dougherty and Thomas Jech. Left-distributive embedding algebras. Electronic Research Announcements, 1997, 3: 28-37.

[7]

Xiaoming Yan, Ping Cao, Minghui Zhang, Ke Liu. The optimal production and sales policy for a new product with negative word-of-mouth. Journal of Industrial & Management Optimization, 2011, 7 (1) : 117-137. doi: 10.3934/jimo.2011.7.117

[8]

José Gómez-Torrecillas, F. J. Lobillo, Gabriel Navarro. Convolutional codes with a matrix-algebra word-ambient. Advances in Mathematics of Communications, 2016, 10 (1) : 29-43. doi: 10.3934/amc.2016.10.29

[9]

Andrea Natale, François-Xavier Vialard. Embedding Camassa-Holm equations in incompressible Euler. Journal of Geometric Mechanics, 2019, 11 (2) : 205-223. doi: 10.3934/jgm.2019011

[10]

Thais Bardini Idalino, Lucia Moura. Embedding cover-free families and cryptographical applications. Advances in Mathematics of Communications, 2019, 13 (4) : 629-643. doi: 10.3934/amc.2019039

[11]

Vassilis G. Papanicolaou, Kyriaki Vasilakopoulou. Similarity solutions of a multidimensional replicator dynamics integrodifferential equation. Journal of Dynamics & Games, 2016, 3 (1) : 51-74. doi: 10.3934/jdg.2016003

[12]

A. M. Vershik. Polymorphisms, Markov processes, quasi-similarity. Discrete & Continuous Dynamical Systems - A, 2005, 13 (5) : 1305-1324. doi: 10.3934/dcds.2005.13.1305

[13]

Boran Hu, Zehui Cheng, Zhangbing Zhou. Web services recommendation leveraging semantic similarity computing. Mathematical Foundations of Computing, 2018, 1 (2) : 101-119. doi: 10.3934/mfc.2018006

[14]

Palle Jorgensen, Feng Tian. Dynamical properties of endomorphisms, multiresolutions, similarity and orthogonality relations. Discrete & Continuous Dynamical Systems - S, 2019, 12 (8) : 2307-2348. doi: 10.3934/dcdss.2019146

[15]

Agust Sverrir Egilsson. On embedding the $1:1:2$ resonance space in a Poisson manifold. Electronic Research Announcements, 1995, 1: 48-56.

[16]

Filippo Gazzola. Critical exponents which relate embedding inequalities with quasilinear elliptic problems. Conference Publications, 2003, 2003 (Special) : 327-335. doi: 10.3934/proc.2003.2003.327

[17]

Françoise Demengel, Thomas Dumas. Extremal functions for an embedding from some anisotropic space, involving the "one Laplacian". Discrete & Continuous Dynamical Systems - A, 2019, 39 (2) : 1135-1155. doi: 10.3934/dcds.2019048

[18]

Xumin Jiang. Isometric embedding with nonnegative Gauss curvature under the graph setting. Discrete & Continuous Dynamical Systems - A, 2019, 39 (6) : 3463-3477. doi: 10.3934/dcds.2019143

[19]

Adam Bobrowski, Adam Gregosiewicz, Małgorzata Murat. Functionals-preserving cosine families generated by Laplace operators in C[0,1]. Discrete & Continuous Dynamical Systems - B, 2015, 20 (7) : 1877-1895. doi: 10.3934/dcdsb.2015.20.1877

[20]

Patrick Fischer. Multiresolution analysis for 2D turbulence. Part 1: Wavelets vs cosine packets, a comparative study. Discrete & Continuous Dynamical Systems - B, 2005, 5 (3) : 659-686. doi: 10.3934/dcdsb.2005.5.659

 Impact Factor: 

Metrics

  • PDF downloads (7)
  • HTML views (0)
  • Cited by (0)

[Back to Top]