
Known Condition:
Set {C}——users
Set {S}——possible recommended items
utility function u—— meansures the usefulness of s to user c;
matrix R ——User-Item Rating Matrix


For each user c∈C, choose s'∈S that maximizes the user's utility.


C = {Alice, Bob, Cindy, David};
S = {K-PAX, life of Brian, Memento, Notorious};
Recommander sytem predicts the unknown ratings from the matrix R.

K-PAX life of Brian Memento Notorious
Alice 4 3 2 4
Bob Φ 4 5 5
Cindy 2 2 4 Φ
David 3 Φ 5 2


  1. Recommandation system (RS):
  2. Heuristic-based RS: define the utility function and empirically validating its performance
  3. Model-based RS: Estimating the utility function that optimizes certain performance criterion, such as the mean square error.
  4. Content-based RS: The user will be recommended items similar to the ones the user perferred in the past.
    1. Limited Content Analysis
    2. Overspecialization
    3. New User Problem
  5. Collaborative RS: The user will be recommended items that people with similar tastes and preferences liked in the past.
    1. New User problem
    2. New Item problem
    3. Sparsity
  6. Hybrid RS: These methods combine collaborative and content-based methods.
  7. Utility function: function that meansures the usefulness of item s to user c. In content-based system, the utility function is defined as:
    u(c, s) = score(ContentBasedProfile( c), Content(s)).



Dimensionality Reduction

    1. Abstract: Presents two different experiments to reduce the dimensionality of recommender system databases. One compares the effectiveness of system at predicting connsumer perferences, the other compares the effectiveness of sytem at producing Top-N list. Experiments shows the potential to meet the challenges of recommender systems.
    2. Explore :
    Abstract: Model collaborative filtering task as the classification or regression problem in machine learning and Apply SVD to reduce the dimensionality.
    2. Explore :



Collaborative filtering system

Heuristic-based Methods

Model-based Methods

  1. Learning collaborative information filters (ICML, 1998) (pdf)
    Abstract: Model collaborative filtering task as the classification or regression problem in machine learning and Apply SVD to reduce the dimensionality. (convert the training data,the sparse matrix of user ratings to Boolean feature vetors, resulting in a matrix filled with zeros and ones).
    2. Explore :
  2. Empirical analysis of predictive algorithms for collaborative filtering (UAI, 1998) pdf
    1. Abstract: Cluster Models and Bayes Network model is embedded. Train the parameter by training data and predict the unknown ratings.
       Naive Bayes: p(C = c, v_1, v_2, ... v_n)= Pr(C = c)\prod_{i=1} ^n (v_i|C=c) 
    2. Explore :
  3. Using Probabilistic Relational Models for Collaborative Filtering (WebKDD 1999)(pdf)
    1. Abstract: Apply probabilistic relational models (PRM),which is similar to Entity relational model, to the system.
    2. Explore :
  4. Eigentaste: A constant Time Collaborative Filtering Algorithm (Information Retrival J, 2001)(pdf)
    1. Abstract: Select gauge set (all valid users rated all items in the gauge set) and apply PCA for dimensionality reduction. Then cluster users. Classify new users to the corresponding cluster and recommend items.
    2. Explore : 1. Cluster method not fit to multi-model. 2. gauge set may hard to select 3. when gauge set is small, the recommend is not accurate.
  5. Collaborative Filtering via Gaussian Probabilistic Latent Semantic Analysis (SIGIR, 2003)(pdf)
    Abstract: Model collaborative filtering task as the classification or regression problem in machine learning and Apply SVD to reduce the dimensionality.
    2. Explore :
  6. Modeling User Rating Profiles for Collaborative Filtering (NIPS, 2003)(pdf)
    Abstract: Model collaborative filtering task as the classification or regression problem in machine learning and Apply SVD to reduce the dimensionality.
    2. Explore :
  7. A Maximum Entropy Approach to Collaborative Filtring in Dynamic, Sparse, High-Dimensional Domains (NIPS, 2002)(pdf)
    Abstract: Model collaborative filtering task as the classification or regression problem in machine learning and Apply SVD to reduce the dimensionality.
    2. Explore :
  8. Clustring Methods for Collaborative Filtring(Technical Report, 1998)(pdf)
    Abstract: Model collaborative filtering task as the classification or regression problem in machine learning and Apply SVD to reduce the dimensionality.
    2. Explore :
  9. Probabilistic Matrix Factorization (PMF) (NIPS, 2008)(pdf)—— Introduction
    1. Abstract: PMF apply a probabilistic approach using Gaussian assumptions on the knonw data and the factor matrics to factor the matrix and pridicting the missing values.Experimental resuts show that PMF perform quite well.
    2. Superiority: Scales linearly, performs well on the large, spase and imbalanced dataset.
  10. Bayesian probabilistic matrix factorization using Markov chain Monte Carlo (ICML, 2008)(pdf)
    1. Abstract: Present a full Bayesian treatment of the PMF model in which model capacity is controlled automatically by integrating over all model parameters and hyperparameters.
    2. Superiority: higher prediction accuracy thant PMF models.
  11. Sorec: social recommendation using probabilistic matrix factorization (CIKM, 2008)(pdf)
    Abstract: PMF apply a probabilistic approach using Gaussian assumptions on the knonw data and the factor matrics to factor the matrix and pridicting the missing values.Experimental resuts show that PMF perform quite well.
Superiority: Scales linearly, performs well on the large, spase and imbalanced dataset.
    2. Superiority: Scales linearly, performs well on the large, spase and imbalanced dataset.
  12. Learning to Recommend with Social Trust Ensemble (SIGIR, 2009)(pdf)
    Abstract: PMF apply a probabilistic approach using Gaussian assumptions on the knonw data and the factor matrics to factor the matrix and pridicting the missing values.Experimental resuts show that PMF perform quite well.
Superiority: Scales linearly, performs well on the large, spase and imbalanced dataset.
    2. Superiority: Scales linearly, performs well on the large, spase and imbalanced dataset.

Content-based Methods


PMF:Training PMF (matlab)


  1. MovieLens :
  2. MovieData : Sample data that contains triplets (user_id, movie_id, rating).


