Studying online

There are now 3 possible online modes for units:

Units with modes Online timetabled and Online flexible are available for any student to self-enrol and study online.

Units available in Online Restricted mode have been adapted for online study only for those students who require the unit to complete their studies and who are unable to attend campus due to COVID border closures. To be enrolled in a unit in Online Restricted mode, students should contact their Student Advising Office through askUWA and include which of the below criteria applies:

  • You are a student who is currently offshore and unable to enter Australia.
  • You are a student in Australia who is impacted by state or regional border closures.

Click on an offering mode for more details.

CITS4012 Natural Language Processing

Credit
6 points
Offering
(see Timetable)
AvailabilityLocationMode
Semester 1UWA (Perth)Face to face
Semester 2OnlineOnline timetabled
Semester 2UWA (Perth)Face to face
Content
Natural Language has been and will remain as the most preferred way to store and transfer knowledge. More than 80% of electronic data in modern societies are generated and stored in textual format. How to process unstructured text to extract useful insights and support actionable decision making and discover the hidden treasure of collective intelligence is of enormous value. In this unit, we start with traditional text processing techniques using Regular Expressions and discuss the needs of text processing and normalisation. We then introduce fundamental pipelines of natural language processing (NLP), including part-of-speech tagging and various ways of sentence parsing, with the aim of introducing traditional text feature collection techniques for higher-level tasks such as sentiment or document classification. Building on the understanding of the pros and cons of feature-based NLP pipeline approaches, the unit moves onto the modern approach of deep learning for NLP, focusing on word vector representation, neural language models, and recurrent neural networks for NLP. The unit situates the techniques around major NLP tasks, including information extraction, sentiment detection, dialogue systems and machine translation.
Outcomes
Students are able to (1) apply pre-processing techniques for textual data preparation; (2) build pipelines for core NLP tasks; (3) critically analyse different language models; (4) explain how vector representations of words can be obtained; (5) evaluate performance of NLP solutions, both traditional and neural; and (6) undertake core components of major NLP tasks.
Assessment
Indicative assessments in this unit are as follows: (1) programming assignment and (2) final examination. Further information is available in the unit outline.

Supplementary assessment is not available in this unit.
Unit Coordinator(s)
Associate Professor Wei Liu
Unit rules
Prerequisites:
enrolment in Bachelor of Advanced Computer Science [Honours] or Master of Data Science and 12 points of programming based-units.
Co-requisites:
Nil
Incompatibility:
Nil
Contact hours
lectures: 2 hours per week; laboratories: 2 hours per week.
Note
CITS4012 will only be offered in Semester 1 from 2022.
  • The availability of units in Semester 1, 2, etc. was correct at the time of publication but may be subject to change.
  • All students are responsible for identifying when they need assistance to improve their academic learning, research, English language and numeracy skills; seeking out the services and resources available to help them; and applying what they learn. Students are encouraged to register for free online support through GETSmart; to help themselves to the extensive range of resources on UWA's STUDYSmarter website; and to participate in WRITESmart and (ma+hs)Smart drop-ins and workshops.
  • Unit readings, including any essential textbooks, are listed in the unit outline for each unit, one week prior the commencement of study. The unit outline will be available via the LMS and the UWA Handbook one week prior the commencement of study. Reading lists and essential textbooks are subject to change each semester. Information on essential textbooks will also be made available on the Essential Textbooks. This website is updated regularly in the lead up to semester so content may change. It is recommended that students purchase essential textbooks for convenience due to the frequency with which they will be required during the unit. A limited number of textbooks will be made available from the Library in print and will also be made available online wherever possible. Essential textbooks can be purchased from the commercial vendors to secure the best deal. The Student Guild can provide assistance on where to purchase books if required. Books can be purchased second hand at the Guild Secondhand bookshop (second floor, Guild Village), which is located on campus.