Take a step back from Generative AI, by Jerry Trites
Interested in adopting ChatGPT or generative AI for your business? Take a step back and look at their Large Language Model (LLM) origins. An LLM is a neuro-network using an algorithm that “processes natural language inputs and predicts the next word based on what it’s already seen. It goes on to predict the next word, and the next word, and so on until its answer is complete.”
ChatGPT is based on the GPT LLM, but there are others. “Along with OpenAI’s GPT-3 and 4 LLM, popular LLMs include open models such as Google’s LaMDA and PaLM LLM (the basis for Bard), Hugging Face’s BLOOM and XLM-RoBERTa, Nvidia’s NeMO LLM, XLNet, Co:here, and GLM-130B.”
A distinguishing feature among these Models is the data with which they are armed. ChatGPT is armed with a large volume of data scraped from the internet. The others use a variety of sources, depending on their intended use.
There is a growing interest in Open source LLMs, which potentially be used more effectively than other LLMs for specific corporate purposes, such as fighting cybersecurity. Arming a LLM with data relevant to security breaches would be designed to act more directly on the security issues rather than combing through a lot of irrelevant data. It also would reduce the volume of data being used, thus reducing cost.
For more on this important idea, check out this article: https://www.computerworld.com/article/3697649/what-are-large-language-models-and-how-are-they-used-in-generative-ai.html .
Comments
- No comments found
Leave a comment