This is a graduate level class on Big Data Systems and Analytics. The syllabus covers a wide range of topics. The concrete course content may change every year to reflect the cutting edge technology and the state of art in BIG DATA research and development.
You are encouraged to participate in the class as much as feasible. In addition to attendance, you are encouraged to ask questions, share your experiences with the class, active participation in class discussions. I value your input on course format, content coverage and composition, and evaluation criteria.
Course Structure: There are no required text books in this course. Course notes and course readings will form the central material for the course. Class readings will consists of materials either handed out in class or made available via the class WWW page. Materials not presented in any readings and handouts may also be covered in class, therefore your attendance is important. The course will consist of two 75-minute lectures per week with weekly reading and homework assignments.
Homework Assignment
There are five homework assignments. You can choose each assignment from either reading based or programming based homework. All 5 assignments are individual homework. Discussion is allowed but the homework deliverable has to be produced by yourself independently.You are encouraged to complete your assignment by incorporating your own experience with the issues you address. You can certainly use what you learned in class as background knowledge. 
Programming based Homework 
The reading critique should have three sections (at least one paragraph per section): 
(1) Summary of the general problems the authors attempt to solve in the two papers and discuss the difference if any. (2) Strong Points: Write the strong points about the two papers in terms of solution approaches, such as what are the best technical contributions of the two papers that you enjoyed learning and reading. (3) Suggestions and/or Weak Points: What are the problems that the authors claim to solve but did not deliver as stated? Technical flaws if any, Suggestions you want to make. 
Submissions of Homework:
You are required to submit your homework on the due date to TSquare. Late submissions by email to TA or instructor are not accepted. Here are guidelines used for grading:  
Programming homework is designed to help students gain better understanding of the course materials by performing some hand-on exercises. Each programming assignment typically consists of 1-3 coding or experimentation tasks, ranging from systems, applications to algorithms. For example, students may be asked to program a given program, or extend an exising software package with some additional functionality. Students may also asked to download a particular sofwware package and run the software on some given dataset(s) and report the execution results and runtime performance comparison on several use case scenarios.  
Reading based homework
Reading based homework is also designed to strengthen the understanding of the course materials by encouraging students to read some reference materials. Each reading based assignment includes 3 subtasks: (1)choose two papers from the course reading list and the two papers should be on the same subject. (2) Read the two papers and understand how the two pieces of work relate to one another and how different the concrete problems, or solution approaches differ from one another. (3) Write a reading critique of 1 page.
Although discussion on assignment is allowed, the verbatim copies of the programming code or critique will be considered cheating. 
If text/code is copied from another source, you should credit the source by referencing the source directly and correctly. Verbatim copying from uncredited sources is considered as plagiarism. If a case of plagiarism or any form of academic dishonesty is found, the guilty parties involved will receive a zero score for the assignment. Repeated offenders will be referred to the Dean's office. Back to Ling Liu's home page
  Back to Ling Liu's home page