Getting started with Odin
What are LLMs?
3 min
odin's chat is powered by a large language model (llm) in chat, you can use the llm to explore topics and learn new things please read how to use chat in how to use chat? docid 4evqr7o6sa kbvhtdti8m in this article, we will explain what llms are, and how they can best be used what are large language models? what are large language models? large language models large language models (llms) are advanced artificial intelligence systems designed to understand and generate human language they are built using deep learning techniques, particularly neural networks with many layers (hence "large") these models are trained on vast amounts of data to learn the patterns, structures, and nuances of language, enabling them to perform a variety of natural language processing tasks training of large language models (llms) training of large language models (llms) data collection data collection llms are trained on massive datasets comprising text from books, articles, patents, websites, and various other sources the aim is to expose the model to a diverse range of language patterns, topics, and styles training process training process the model learns to predict the next word in a sequence given the previous words, a process known as autoregressive training this involves masking parts of the training texts, and asking the llm to predict which word should have been in the masked space this process helps the llm build a 'model of the world' through training, the llm learns about every topic it is trained on after training, it can be used as a type of tireless librarian that can help you uncover information use cases for large language models (llms) use cases for large language models (llms) llms are useful for various tasks, including question answering providing answers to user queries based on given context natural language understanding text classification, sentiment analysis, entity recognition text generation writing assistance, automated content creation, chatbots translation converting text from one language to another summarization condensing long documents into concise summaries pitfalls of llms pitfalls of llms hallucinations and misinformation hallucinations and misinformation llms can generate plausible but incorrect or misleading information they lack the ability to verify facts and may produce false statements with confidence to understand why, we need to understand what llms are fundamentally, llms predict the most likely next word given a prior sequence of words the reason it can predict incorrect words as the most probable next word in a sequence is because llms are a compressed model of the world, similar to how an mp3 file is a compressed version of raw audio therefore, it remembers key parts of the information it has been trained on but does not remember every single detail in other words, it finds the red thread through all of its training data, remembers that, and uses it to predict the next words therefore, the more specific the question, the more likely it is to hallucinate for example, asking it to cite which specific journal it got a particular piece of information from will likely result in hallucinations whereas asking it which academic topics have most actively been explored in neuroscience in the past 10 years will likely result in an accurate answer if you do use an llm to answer very specific questions, make sure to check whether its answer is correct or first limit it to a specific piece of text to answer your question with for example, in our chat with patent feature, the llm is limited to the exact text that is present in the patent in this scenario, it is very unlikely that the llm hallucinates because it is forced to answer your question based on a given piece of text and does not have to reason about it itself