基于交替最小二乘协同过滤算法的推荐系统

致谢	第5-6页
摘要	第6-9页
ABSTRACT	第9页
1 Introduction	第13-18页
1.1 Background	第13-16页
1.2 Problem Definition	第16页
1.3 Research Goals	第16-17页
1.4 Research Contributions	第17页
1.5 Dissertation Outline	第17-18页
2 Related Work	第18-29页
2.1 Content-Based Recommender System	第18-20页
2.1.1 Pros and Cons of Content-Based Filtering	第19页
2.1.2 Examples of Content-Based Filtering Systems	第19-20页
2.2 Collaborative Filtering	第20-24页
2.2.1 Fundamental Assumptions of Collaborative Filtering	第21-22页
2.2.2 Examples of Collaborative Filtering Based Recommender Systems	第22-24页
2.3 Memory-Based Collaborative Filtering	第24-26页
2.3.1 User-Based Collaborative Filtering	第25页
2.3.2 Item-Based Collaborative Filtering	第25-26页
2.4 Model-Based Collaborative Filtering	第26-28页
2.4.1 Matrix Factorization-Based Algorithm	第26-27页
2.4.2 Clustering-Based Algorithm	第27-28页
2.5 Hybrid Recommender System	第28-29页
3 Applying Alternating Least Squares in Movie Recommender	第29-38页
3.1 Limitations of Existing CF Based Algorithms	第29-30页
3.2 Motivation	第30-31页
3.2.1 Scalability	第31页
3.2.2 Prediction Speed	第31页
3.2.3 Avoidance of Overfitting	第31页
3.3 Matrix Factorization	第31-33页
3.4 Alternating Least Squares (ALS)	第33-34页
3.5 Proposed Implementation	第34-35页
3.6 Predictive Accuracy Metrics	第35-36页
3.7 Alternating Least Squares Pseudocode	第36-38页
4 Big Data Technologies	第38-46页
4.1 Spark and Its Core Technology	第38-42页
4.1.1 Spark SQL	第39-40页
4.1.2 MLLib	第40页
4.1.3 GraphX	第40-41页
4.1.4 Spark Streaming	第41-42页
4.2 Resilient Distributed Datasets (RDDs)	第42-43页
4.3 Spark Development Language Support	第43页
4.4 Spark Deployment Options	第43-44页
4.5 Spark Storage Options	第44页
4.6 Spark API Overview	第44-46页
5 Experiments & Results	第46-55页
5.1 Experimental Dataset	第46-47页
5.2 Experimental Environment	第47页
5.3 Cluster Details	第47-48页
5.3.1 Cluster Specifications	第47-48页
5.3.2 Hadoop Distributed File System (HDFS)	第48页
5.4 Building a Spark Cluster	第48-50页
5.4.1 Install Spark on Master Node	第49页
5.4.2 Install Spark on Slaves (Worker Nodes)	第49页
5.4.3 Spark Web UI	第49-50页
5.5 Experimental Results	第50-55页
6 Conclusion	第55-57页
References	第57-60页
Author Profile and Research Achievements Obtained during the Study for AMaster's / Doctoral Degree	第60-62页
Dataset for the Master's Thesis	第62-63页