CITS3401 Data Warehousing
- 6 points
Availability Location Mode Semester 1 UWA (Perth) Face to face
- Details for undergraduate courses
- Level 3 core unit in the Data Science major sequence
- The area of knowledge for this unit is Mathematical and Physical Sciences
- Category B broadening unit for students
- Level 3 elective
- Relational databases are the backbones of modern businesses in processing transactions and storing customer data. Most organisations usually deploy several relational databases for operational convenience. It is quite often necessary to integrate the information existing in different relational databases for planning and decision making. Data warehouses are built to facilitate planning and decision making in businesses integrating data from different relational databases. Online analytical processing (OLAP) is a technology that uses a data warehouse for answering aggregation queries often used in planning. While relational databases hold important transactional information of a business, the success of a business quite often depends on advanced planning and development of strategies based on customer behaviour. Data mining technologies are used for discovering such patterns and trends in data stored in relational databases. This unit introduces the key mechanisms in data warehousing and OLAP. It discusses logical and physical design of data warehouses including star schema, snowflake schema, data marts, partitioning and materialised views. Students study the use of data warehouses through a study of the OLAP technology including the multidimensional OLAP (MOLAP) and relational OLAP (ROLAP) architectures, OLAP operations and structured query language (SQL) support for OLAP.
- Students are able to (1) understand that discovering and extracting knowledge from a massive amount of data is a key problem in many scientific and business disciplines; (2) demonstrate a thorough understanding of key data warehousing and online analytical processing technologies; and (3) apply key data warehousing concepts in designing solutions for business data analytics.
- Typically this unit is assessed in the following ways: (1) project assignment; (2) mid-semester test; and (3) final examination. Further information is available in the unit outline.
Supplementary assessment is not available in this unit except in the case of a bachelor's pass degree student who has obtained a mark of 45 to 49 overall and is currently enrolled in this unit, and it is the only remaining unit that the student must pass in order to complete their course.
- Unit Coordinator(s)
- Dr Jianxin Li
- Unit rules
- CITS1402 Relational Database Management Systems (formerly CITS1402 Introduction to Databases)
CITS2232 Databases; for pre-2012 courses: CITS1402 Relational Database Management Systems (formerly CITS1402 Introduction to Databases)
- CITS4243 Advanced Databases
- Contact hours
- lectures: 2 hours per week; practical classes: 2 hours per week; labs: 2 hours per week
- Unit Outline
Han, J. and Kamber, M. Data Mining: Concepts and Techniques, 2nd edn: Elsevier/Morgan Kaufmann 2006
- The availability of units in Semester 1, 2, etc. was correct at the time of publication but may be subject to change.
- All students are responsible for identifying when they need assistance to improve their academic learning, research, English language and numeracy skills; seeking out the services and resources available to help them; and applying what they learn. Students are encouraged to register for free online support through GETSmart; to help themselves to the extensive range of resources on UWA's STUDYSmarter website; and to participate in WRITESmart and (ma+hs)Smart drop-ins and workshops.
- Books and other material wherever listed may be subject to change. Book lists relating to 'Preliminary reading', 'Recommended reading' and 'Textbooks' are, in most cases, available at the University Co-operative Bookshop (from early January) and appropriate administrative offices for students to consult. Where texts are listed in the unit description above, an asterisk (*) indicates that the book is available in paperback.