You are on page 1of 15

SEEM3550:

Fundamentals of Information Systems


Instructor: Jeffrey Yu, Room ERB 511B
TAs: Kangfei Zhao, Room ERB 801
Yikai Zhang, Room ERB 801
Jiao Su, Room ERB 801
Wei Shi, Room ERB 801
Ke Bi, Room ERB 801
Hao Zhang, Room ERB 711
Chengzhi Piao, Room ERB 512

Class Schedules: Tuesday 9:30 – 10:15, LSK LT2


Thursday 10:30 – 12:15, LSK LT1
BlackBoard (CUHK Elearning System)
https://blackboard.cuhk.edu.hk/webapps/login
All emails about the course should be sent to:
seem3550@se.cuhk.edu.hk Outline 1-1
Why do we need to learn this
course? Big Data Analytics
 “Big data is a collection of data sets so large and
complex.”
 Volume, Various, Velocity, Veracity, Value
 “Businesses, governments and society are only
starting to tap its vast potential.”
Big Data
 There are many different types of data: text
data, image data, audio data, video data, etc.
 The amount of data grows very fast.
 Zettabyte 1,000,000,000,000,000,000,000 byte
 Exabyte 1,000,000,000,000,000,000 byte
 Petabyte 1,000,000,000,000,000 byte
 Terrabyte 1,000,000,000,000 byte
 Gigabyte 1,000,000,000 byte
 Megabyte 1,000,000 byte
 Kilobyte 1,000 byte
 Byte 1 byte
large data processing 1-3
How to deal with massive data?
 We need to manage data.
 We need to process data.
 We need to get information
from data.

By cloudtimes.org, blog.kenexa.com, brightredfingermails.blogspot.com large data processing 1-4


Databases and its Management

Outline 1-5
Data Platforms Landscape Map 2014

Outline 1-6
Top Database Management Systems
http://db-engines.com/en/ranking
 Oracle (Relational DBMS)
 MySQL (Relational DBMS)
 Microsoft SQL Server (Relational DBMS)
 PostgresSQL (Relational DBMS)
 MongoDB (Document store)
 DB2 (Relational DBMS)
 Microsoft Access (Relational DBMS)
 Cassandra (Wide Column Store)
 Radis (Key-value Store)
 Elasticsearch (Search Engine)
Google Data Center

 Google Cloud SQL


allows users to use
relational database in
Google’s Cloud.
Taken from a talk given by Dr. Wen-Syan Li (VP of SAP)
Nov. 11 Singles Day

 It has become the largest online shopping


day in the world, with sales in Alibaba's
sites Tmall and Taobao.
 US$5.8 billion in 2013,
 US$9.3 billion in 2014,
 US$14.3 billion in 2015,
 US$17.8 billion in 2016,
 US$25.4 billion in 2017.
– Wikipedia.org
What Data Science Skills
Employers Want Now (2016)

 https://www.datanami.com/2016/01/07/what
-data-science-skills-employers-want-now
11
 Textbook:
Database System Concepts (sixth edition), by Avi
Silberschatz, Henry F. Korth, and S. Sudarshan,
McGraw-Hill, 2010.
 The textbook website is http://www.db-book.com
 The website includes slides, solutions to practice exercises.
 Try practice exercises by yourself!

 Assessment:
 Three Assignments (individual) 50%
• Assignment 1: SQL (18%)
• Assignment 2: Indexing/Hashing (16%)
• Assignment 3: Query Processing (16%) with possible
programming
 Final examination 50%
Outline 1-12
Tentative Schedule
Wk# Topic Chapter Assignment Due
1 Introduction/Relational Model 1/2
2 SQL 3
3 SQL 3/4
4 Relational Algebra 6
5 Storage and File Structure 10
6 Index and Hashing 11 Ass-1 (SQL)
7 Index and Hashing 11
8 Query Processing 12
9 Query Processing 12 Ass-2 (Index/Hash)
10 Query Optimization 13
11 Query Optimization 13
12 Entity-Relationship Model 7 Ass-3 (Querying)
13 Relational Database Design 8

March 29 to April 4 is reading week. No classes! Outline 1-13


SEEM Program Objectives
1) The ability to apply knowledge of mathematics, science, and engineering appropriate to
the degree discipline
2) The ability to design and conduct experiments, as well as to analyze and interpret data
3) The ability to design a system, component, or process to meet desired needs within
realistic constraints, such as economic, environmental, social, political, ethical, health
and safety, manufacturability and sustainability
4) The ability to function in multi-disciplinary teams
5) The ability to identify, formulate, and solve engineering problems
6) The understanding of professional and ethical responsibility
7) The ability to communicate effectively
8) The ability to understand the impact of engineering solutions in a global and societal
context, especially the importance of health, safety and environmental considerations
to both workers and the general public
9) The ability to stay abreast of contemporary issues
10) The ability to recognize the need for, and to engage in life-long learning
11) The ability to use the techniques, skills, and modern engineering tools necessary for
engineering practice appropriate to the degree discipline
12) The ability to use the computer/IT tools relevant to the discipline along with an
understanding of their processes and limitations
13) The ability to apply the skills relevant to the discipline of operations research and
information technology and their applications in engineering and managerial decision
making, especially in financial services, logistics and supply chain management, business
information systems, and service engineering and management
Important Things
 Student/Faculty Expectations on Teaching and
Learning (http://star.erg.cuhk.edu.hk/upload/StaffStudentExpectations.pdf)
 Departmental Guideline for Plagiarism
 If a student is found plagiarizing, his/her case will be reported to the
Department Examination Panel. If the case is proven after deliberation, the
student will automatically fail the course in which he/she committed
plagiarism.
 The definition of plagiarism includes copying of the whole or parts of written
assignments, programming exercises, reports, quiz papers, mid-term
examinations and final examinations.
 The penalty will apply to both the one who copies the work and the one whose
work is being copied, unless the latter can prove his/her work has been copied
unwittingly.
 Furthermore, inclusion of others' works or results without citation in
assignments and reports is also regarded as plagiarism with similar penalty to
the offender.
 A student caught plagiarizing during tests or examinations will be reported to
the Faculty office and appropriate disciplinary authorities for further action,
in addition to failing the course.
Outline 1-15

You might also like