Table of Contents

CSC4170 Web Intelligence and Social Computing

[ Discussion Forum | Blogs ]

Breaking News

http://spreadsheets.google.com/ccc?key=0ApPqLXd5MIzkdGIwOGJsOXlsbHE1Mnd1QXA1eUc3Unc&hl=en

Extra Credit Assignments

2009-10 Term 1

Lecture I Lecture II Tutorial I Tutorial II
Time M5, 12:30 pm - 1:15 pm T3-T4, 10:30 am - 12:15 pm T11 6:30 pm - 7:15 pm TBA
Venue ERB 706 ERB 408 SHB 507 TBA

The Golden Rule of CSC4170: No member of the CSC4170 community shall take unfair advantage of any other member of the CSC4170 community.

Course Description

This course introduces fundamental as well as applied computational techniques for collaborative and collective intelligence of group behaviours on the Internet. The course topics include, but are not limited to: web intelligence, web data mining, knowledge discovery on the web, web analytics, web information retrieval, learning to rank, ranking algorithms, relevance feedback, collaborative filtering, recommender systems, human/social computation, social games, opinion mining, sentiment analysis, models and theories about social networks, large graph and link-based algorithms, social marketing, monetization of the web, security/privacy issues related to web intelligence and social computing, etc.

Learning Objectives

Learning Outcomes

Learning Activities

  1. Lectures
  2. Tutorials
  3. Web resources
  4. Videos
  5. Quizzes
  6. Examinations

Personnel

Lecturer Tutor Tutor
Name Irwin King Tom Chao Zhou Xin Xin
Email king AT cse.cuhk.edu.hk czhou AT cse.cuhk.edu.hk xxin AT cse.cuhk.edu.hk
Office Rm 908 Room114A Room101
Telephone 2609 8398
Office Hour(s) * M10, Monday 4:30 to 5:30

* T3, Tuesday 10:30 to 11:30
Tuesday 15:30 to 16:30 Tuesday 15:30 to 16:30

Note: This class will be taught in English. Homework assignments and examinations will be conducted in English.

Syllabus

The pdf files are created in Acrobat 6.0. Please obtain the correct version of the Acrobat Reader from Adobe.

Week Date Topics Tutorials Homework & Events Resources
1 7/9 Introduction to Web Intelligence and Social Computing
Web 2.0

01-Introduction.pdf
Tim O'Reilly on Web 2.0, The Economist, 20/3/2009
2 14/9 Introduction to Web Intelligence and Social Computing

Podcast090914-01
Regular Expressions Web Crawler Introduction to Social Networks
3 21/9 Social Networks-Theory
Graph Theory

02-SNA.pdf OLD!
02-SNA-01.pdf OLD!
02-SNA-02.pdf NEW!
Graph Visualization HW #1
(Due on or before 6:30 pm, Friday, 2 October, 2009)
grading-asgn1
SWT Theory
4 28/9 Graph Mining

03-GraphMining-01.pdf

Podcast090928-01
Podcast090929-01
Graph Mining Algorithms
hits.ppt
Generating Random Graphs
The Clique Algorithm
5 5/10 Link Analysis

04-LinkAnalysis-01.pdf NEW!

Podcast091006-01
PageRank, HITS, etc. HW #2
hw2 sample answer
grading-asgn2
HW Programming #1 HW Programming #1 Testcases

(Due on or before 6:30 pm, Monday, 19 October, 2009)
grading-programming
Introduction to Information Retrieval
6 12/10 Learning to Rank

05-Learning2Rank-01.pdf OLD!
05-Learning2Rank-02.pdf NEW!

Podcast091012-01
Podcast091013-01
PageRank Project Specification
Movie Dataset
7 19/10 Recommender Systems I

06-Recommender-01.pdf
Evaluation Methods
8 26/10 Recommender Systems II
Query Expansion

CIKM2008 Query Suggestion
QF/IQF HW #3
(Due: Monday,23 November,18:30)
9 2/11 Human Computation/Social Games

07-HumanComputation-01.pdf NEW!
humancomputation.ppt Guest Speaker
10 9/11 Crowdsourcing language model
11 16/11 Q&A
Virtual Communities

CSC4170-08-QandA.pdf
Wikis, Blogs, etc. HW #4
(Due: Friday, 4 December 4, 2009, 18:30)
gradings asgn4
hw4 sample answer
12 23/11 Privacy and Security of Information
Education, Policy

09-Security.pdf
NEW!
13 30/11 Wrap Up

Project Presentations
EduTech on Social Computing in Education

Class Project

Class Project Presentation Schedule

  1. TING KAM CHEUNG & MA MING CHAO
  2. YANG NGAI KEUNG
  3. YAU MING HIU & CHOW TSZ YEUNG
  4. LAM KA LOK
  5. TUNG WO HOU
  6. WONG YUK KI & TO KA CHUN
  7. LI WAI WA & TSO XIN
  8. TSANG HO KWAN & TANG CHI CHIU
  9. ZACK BUSH
  10. HO CHUN KIU
  11. LEE WING HUNG

Class Project Presentation Requirements

  1. For each group, the total time for the presentation is 15 minutes, including 12 minutes for the talk and 3 minutes for Q&A. The presentation will follow the order above. Since this class will last until the end of all the presentations, if the time is not suitable for you, you can tell us to change your order.
  2. In the presentation, there is no demo part. The demo part is an independent process divided into two sub-sections. The first section will be hold in tutorial time on Dec. 1st. In this section, all the groups should demo your program to the two tutors. The tutors will guide you to revise your program. The second section will be hold on Wednesday, Dec. 16th. In this section, Prof. King will check your program before the final submission of your codes.
  3. For groups implementing graphical algorithms, you should explain one algorithm as detailed as you can in the presentation. You should give an example with the structure of nodes, values, and your calculations. You also need to analyze the complexity of your algorithms and test whether your algorithms can be applied in large graphs. For other groups, you should focus on three aspects including the motivation of your idea, the detailed algorithms, and the justification of your methods comparing to naive methods through experiments.

Examination Matters

Examination Schedule

Time Venue Notes
Midterm Examination
Written
TBD TBD TBD
Midterm Examination
Programming
TBD TBD TBD
Final Examination 9/12/2009 Wed.
9:30 am to 11:30 am
Room 103, John Fulton Centre The final examination covers all materials presented in the class.

Written Midterm Matters

  1. The midterm will test your knowledge of the materials.
  2. Answer all questions using the answer booklet. There will be more available at the venue if needed.
  3. Write legibly. Anything we cannot decipher will be considered incorrect.

Grade Assessment Scheme

Homework
Assignments
Project Report Project Presentation Final Examination
20% 20% 10% 50%
  1. Assignments (20%)
    1. Written assignment
    2. Optional quizzes
  2. Project (30%)
    1. Report (20%)
    2. Presentation (10%)
  3. Final Examination (50%)
  4. Extra Credit (There is no penalty for not doing the extra credit problems. Extra credit will only help you in borderline cases.)

Required Background

  1. Pre-requisites
    1. - CSC 1110 or 1130 or its equivalent. (Not for students who have taken CSC 2520).

Reference Books

Book Sources

  1. Academic & Professional Book Centre, 1H Cheong Ming Bldg., 80-86 Argyle St., Kowloon, 2398-2191, 2391-7430 (fax)
  2. Caves Books (H. K.), 4B Ferry St., G/F., Yaumatei, Kowloon, 2780-0987, 2771-2298
  3. Man Yuen Book Company, 45 Parkes street, Jordan Road, Kowloon, Hong Kong, 2366-0594. Not very large, Asian edition books, fair price, wide range, some 10% discount.
  4. Swindon Book Co. Ltd, 13-15 Lock Road, Tsim Sha Tsiu, Kowloon, 2366-8001. One of the largest book stores in Hong Kong, exchange rate is not favorable.
  5. Hongkong Book Centre, 522-7064. A branch of the Swindon book shop.

FAQ

  1. Q: What is departmental guideline for plagiarism?
    A: If a student is found plagiarizing, his/her case will be reported to the Department Discipline Committee. If the case is proven after deliberation, the student will automatically fail the course in which he/she committed plagiarism. The definition of plagiarism includes copying of the whole or parts of written assignments, programming exercises, reports, quiz papers, mid-term examinations. The penalty will apply to both the one who copies the work and the one whose work is being copied, unless the latter can prove his/her work has been copied unwittingly. Furthermore, inclusion of others' works or results without citation in assignments and reports is also regarded as plagiarism with similar penalty to the offender. A student caught plagiarizing during tests or examinations will be reported to the Faculty Office and appropriate disciplinary authorities for further action, in addition to failing the course.
  2. Q: What is ACM ICPC?
    A: Association of Computer Machinery International Collegiate Programming Contest. Teams from CUHK have done quite well in the previous years. More information on the CSE's programming team can be found at http://www.cse.cuhk.edu.hk/~acmprog.
  3. Q: What are some of the common mistakes made in online and real-time contest?
    A: There are a few common mistakes. Please check out this site for more information.

Resources

Social Networks-Theory Graph Theory

http://www.cs.purdue.edu/homes/neville/courses/aaai08-tutorial.html
http://cs.stanford.edu/people/jure/icml09networks/
http://www.ofcom.org.uk/advice/media_literacy/medlitpub/medlitpubrss/socialnetworking/report.pdf

Graph Mining

http://www.cs.cmu.edu/~deepay/mywww/papers/csur06.pdf
http://cs.stanford.edu/people/jure/talks/www08tutorial/
http://www.xifengyan.net/tutorial/KDD08_graph_partI.pdf
http://www.xifengyan.net/tutorial/KDD08_graph_partII.pdf

Link Analysis

http://analytics.ijs.si/events/Tutorial-TextMiningLinkAnalysis-KDD2007-SanJose-Aug2007/
http://www.sigkdd.org/explorations/issues/7-2-2005-12/1-Getoor.pdf
http://www.ncjrs.gov/pdffiles1/nij/grants/219552.pdf
http://delab.csd.auth.gr/~dimitris/papers/ENVO07LARskm.pdf

Learning to Rank

http://www2009.org/pdf/T7A-LEARNING%20TO%20RANK%20TUTORIAL.pdf
http://radlinski.org/papers/LearningToRank_NESCAI08.pdf
http://www.aclweb.org/anthology/P/P09/P09-5005.pdf
http://www.cse.iitb.ac.in/~soumen/doc/www2007/TutorialSlides.pdf

Recommender Systems

http://en.wikipedia.org/wiki/Recommender_system http://www.deitel.com/ResourceCenters/Web20/RecommenderSystems/RecommenderSystemsTutorialsandWebcasts/tabid/1313/Default.aspx http://www.computer.org/portal/web/csdl/doi/10.1109/TKDE.2005.99 http://www.springerlink.com/content/n881136032u8k111/ http://www.csd.abdn.ac.uk/~jmasthof/Publications/WPRSIUI07.pdf

Q & A

http://lml.bas.bg/ranlp2005/tutorials/magnini.ppt
http://tcc.itc.it/research/textec/topics/question-answering/Tut-Prager.ppt
http://en.wikipedia.org/wiki/Question_answering
http://trec.nist.gov/pubs/trec9/papers/webclopedia.pdf
http://domino.watson.ibm.com/library/CyberDig.nsf/papers/D12791EAA13BB952852575A1004A055C/$File/rc24789.pdf
http://www.umiacs.umd.edu/~jimmylin/publications/Lin_Katz_EACL2003_tutorial.pdf
http://answers.yahoo.com/
http://zhidao.baidu.com/
http://wenda.tianya.cn/wenda/
http://hk.knowledge.yahoo.com/

Human Computation/Social Games

http://www.gwap.com/gwap/
http://www.cs.cmu.edu/~biglou/

Opinion Mining/Sentiment Analysis

http://www.cs.uic.edu/~liub/FBS/opinion-mining-sentiment-analysis.pdf
http://www.cs.cornell.edu/home/llee/omsa/omsa-published.pdf
http://www.cs.cmu.edu/~wcohen/10-802/sentiment-sep-4.ppt

Visualization

Programming