263-5354-00: Large Language Models - ETH Zürich
Computer Science
Official Description from ETH Zürich:
Large language models have become one of the most commonly deployed NLP inventions. In the past half-decade, their integration into core natural language processing tools has dramatically increased the performance of such tools, and they have entered the public discourse surrounding artificial intelligence.
Archived Document(s):
263-5354-00 Section 01 - Probabilistic Foundations (open in new window)
263-5354-00 Section 02 - Modeling Foundations (open in new window)
263-5354-00 Section 03 - Classical Language Models (open in new window)
263-5354-00 Section 04 - Neural Network Language Models (open in new window)
263-5354-00 Section 05 - Training, Fine Tuning and Inference (open in new window)
263-5354-00 Section 06 - Applications and the Benefits of Scale (open in new window)