登入

課程

學年/學期 Academic Year/Semester 100 學年 第 2 學期
課程名稱 Course Name
753863-001
(中 Ch.)網路搜索與探勘
(英 Eng.)Web Search and Mining
授課教師 Instructor 蔡銘峰
修別 Type of Credit 選修 學分數No. of Credits 3.0
備註 Note
N/A
課程目標 Course objectives
The goal of this course is: 1) to provide an overview of Web Search and Mining related research, 2) to systematically review the core research topics in the field, 3) to show case the most recent research progress, and 4) to give students enough training for doing research in the field and an opportunity to work on a research project.
課程大綱 Course Description
Part I: Web Search
• Evaluation
• Retrieval Model
• Language Model
• Link Analysis
• Web Crawling

Part II: Web Mining
• Classification
• Clustering
• Learning to Rank

Part III: Data-Intensive Information Processing
• Introduction to MapReduce
• MapReduce: the Programming Environment
教學方式 Teaching approach
The course will involve lectures by instructor, student presentations, and research projects on major research topics in Web Search and Mining related research. Students are expected to read quite a few research papers and present some of them at the class. There will be a midterm and a few assignments. Students are also required to finish a course project (group work is allowed and encouraged).
每週課程進度與作業要求【請詳述每週課程內容/授課方式與學生預習內容/學習活動/課後作業】
1. (1 week) Introduction: Goals and history of Web Search and Mining; IR vs. Web Search; DM vs. Web Mining.
2. (2 weeks) Web Search 1 - Ranking Evaluation; Probabilistic Information Retrieval
3. (2 weeks) Web Search 2 - Language Model for Information Retrieval
4. (2 weeks) Web Search 3 - Processing Text: Text statistics; Link Analysis
5. (1 week) Web Search 4 - Web Crawling
6. (1 week) Web Mining 1 - Classification and Naive Bayes
7. (2 weeks) Web Mining 2 - Supported Vector Machines; K Nearest Neighbor
8. (2 weeks) Web Mining 3 - Clustering: Flat clustering and Hierarchical clustering
9. (1 weeks) Web Mining 4 - Clustering: K-Means Clustering; Clustering and Search
10. (2 weeks) Data-Intensive Information Processing - Overview of Cloud Computing; Map Reduce; Hadoop
評量工具與策略、評分標準 Evaluation Criteria
Grading will be based on the following weighting scheme:
• Class participation: 10%
• Assignments: 30%
• Midterm exam: 30%
• Project: 30%
教學助理基本資料 Teaching assistant tasks
Grade assignments; Prepare assignments; Answer Questions
指定/參考書目 Textbook & references
(為維護智慧財產權,請務必使用正版書籍)
• Introduction to Information Retrieval, by C. Manning, P. Raghavan, and H. Schütze.
• Search Engines: Information Retrieval in Practice, by Bruce Croft, Donald Metzler, Trevor Strohman.
• Data-Intensive Text Processing with MapReduce, by Jimmy Lin and Chris Dyer.
• Hadoop: The Definitive Guide, by Tom White.
課程相關連結 Course related links
N/A
本課程附件 Course attachments
N/A
課程進行中,是否禁止使用智慧型手機、平板等隨身設備。

需經教師同意始得使用



 
學生自評核心能力填答率: 25% (4/16)
能力項目說明:
A.培養邏輯推理、獨立思考與創新能力 B.理解自然科學與數位科技
C.培養團隊合作的能力 D.具備有效的溝通表達能力
E.養成終身學習與自我提升能力 F.了解當代主題意識與具有國際視野
G.具有專業及道德責任的認知 H.具有藝術涵養
I.能接受多元文化與培養人文關懷之精神 J.擁有公民素養並能履行公民責任