263-5354-00: Large Language Models - ETH Zürich

Computer Science

Official Description from ETH Zürich:

Large language models have become one of the most commonly deployed NLP inventions. In the past half-decade, their integration into core natural language processing tools has dramatically increased the performance of such tools, and they have entered the public discourse surrounding artificial intelligence.


Archived Document(s):

263-5354-00 Section 01 - Probabilistic Foundations (open in new window)

263-5354-00 Section 02 - Modeling Foundations (open in new window)

263-5354-00 Section 03 - Classical Language Models (open in new window)

263-5354-00 Section 04 - Neural Network Language Models (open in new window)

263-5354-00 Section 05 - Training, Fine Tuning and Inference (open in new window)

263-5354-00 Section 06 - Applications and the Benefits of Scale (open in new window)

263-5354-00 Section 07 - Security (open in new window)