What are open-source LLMs?

Open-source large language models (LLMs) are AI models that use publicly available natural language text and software program code data to learn, understand and replicate human language.

Get demo

Things to know about open-source LLMs

Open-source LLMs vs proprietary

Open-source software vs AI

What are top open-source LLMs?

Open-source LLM use cases?

What industries use open-source LLMs? 

What are the benefits?

What are some of the challenges?

What should an organisation?

ServiceNow LLMs

Open Source LLMs, a form of open-source AI, can be freely modified and shared, enabling collaboration and customisation without licensing fees. This openness promotes transparency, safety, competition and diverse applications across industries. A form of open-source AI, these LLMs can be modified and shared freely, allowing collaboration and customisation for any purpose without permission or licensing fees — promoting transparency and safety, competition and polyculture and diverse applications. 

Large language models are the foundation of modern user interfaces powered by artificial intelligence. These models, trained with massive datasets and advanced neural network architectures, make it possible for humans to interact with applications on a natural level. The value of LLMs lies in their ability to perform nuanced language tasks, such as summarisation of large passages of text or, conversely, generating large bodies of text in response to their human user's instructions. In fields as diverse as customer service, research, content creation and education, LLMs allow users to make complex queries and instructions, and for the AI to respond in a way that even inexperienced users can comprehend at super-user equivalency in some cases.  

But while these models may allow for user-friendly interactions, they are extremely complex, often built on proprietary data that reduces visibility into their internal structure and capabilities. While open-source LLMs generally speaking offer a more accessible alternative, the degrees of openness vary. While some large language models throw back the curtain on the underlying training code and data, allowing anyone to use, modify or distribute it, others may only release the model and limited supporting artefacts, making auditing the upstream model supply chain near impossible. By democratising access to powerful AI tools, open-source LLMs give a broader range of developers the power to innovate and customise AI solutions to their specific needs.  

Expand All

Collapse All

What are open-source LLMs vs proprietary LLMs? 

Any discussion of open-source LLMs begs the question: How are open-source LLMs different from proprietary LLMs? Open-source LLMs and proprietary LLMs differ significantly in transparency, accessibility, adaptability and community engagement.

Open-source LLMs

Open-source LLMs are freely available to be used and modified as any user sees fit and may be shared (modifications and all) with others. This encourages a collaborative approach to downstream innovation, allowing developers to customise these models to meet specific needs while also contributing to a dynamic ecosystem where improvements and new applications are continuously emerging. Essentially, every developer that improves on the base model and open sources their fine-tuned model becomes a member of the AI ecosystem for that community.

For this to be possible, open-source LLMs rely heavily on transparency in the model's architecture, training and intended use which it was designed to support. To maximise community and commercial adoption, the data used for pre-training and evaluation, the resources involved and the underlying code itself must be fully accessible for review. This is a major differentiating factor in open-source LLMs, as proprietary LLMs typically lack visibility into their inner workings.

Open-source LLMs offer enhanced freedom in the form of flexibility; organisations can tailor the LLM to their unique specifications. On the other hand, changes introduced in the derivative LLM could result in weakened security, so it is important to select open-source LLMs that have robust data and model governance practices in place to ensure new models meet safety and performance expectations for downstream users. Working with an open-source solution can involve significant costs, including hiring and training expertise, upfront legal fees, feature upgrades, security compliance, support, talent retention and ongoing software life cycle management.

Proprietary LLM 

Proprietary LLMs are controlled and owned by individual entities, with access typically restricted through licences and fees. Companies like OpenAI and Google offer powerful LLMs, but their use is often limited to predefined application programming interfaces (APIs) or specific applications dictated by the providers. This closed approach can limit customisation and adaptation, potentially increasing costs and limiting access to inference cloud computer infrastructure that may not be in the same region as the end user.  Additionally, proprietary LLMs may build on fine-tuned versions of open-source models, with companies adding unique enhancements or 'secret sauce'—such as improved performance or specialised functionalities—that make these versions proprietary. An example of this approach includes offerings like Now LLMs, where tailored improvements set them apart from open-source alternatives. 

That said, there are some advantages to working with proprietary LLM solutions. Specifically, owned and licensed LLMs tend to offer stronger security and are more user-friendly, with company-backed support providing assistance when needed.

Business Applications Made Easy and Quick

The ServiceNow AI Platform allows no/low code development so that business operations analysts can build or prototype their own apps without writing a line of code.

Get eBook

Open-source software vs open-source AI models? 

Open-source software is nothing new; in fact, some of the earliest software was shared freely between those who worked on computers. But with the relatively recent advent of artificial intelligence, the concepts behind open source are beginning to take root in AI. Still, although they share core philosophies of transparency, collaboration and accessibility, there are also some significant differences:

Open-source software 

Open-source software refers to programs or systems whose source code is made freely available for anyone to use, modify and share. Open-source software is governed by licences like the GNU General Public License (GPL) or Apache License, which outline the terms of use and redistribution. Popular examples include Linux, Apache HTTP Server and Mozilla Firefox, which continue to thrive through contributions from global developer communities.

Open-source AI 

With the rise of artificial intelligence, the principles behind open-source software have been adapted to AI. Open-source AI is the result. The Open Source Initiative defines open-source AI as:   "An AI system made available under terms and in a way that grant the freedoms to:  

Use the system for any purpose and without having to ask for permission.  
Study how the system works and inspect its components.  
Modify the system for any purpose, including to change its output.  
Share the system for others to use with or without modifications, for any purpose.

These freedoms apply both to a fully functional system and to discrete elements of a system. A precondition to exercising these freedoms is to have access to the preferred form to make modifications to the system." 

Like open-source software, open-source AI promotes transparency and collaboration by allowing developers direct access to the AI's code. Open-source licences play a central role in this ecosystem. Licences (such as the Blue Oak Model License) outline the terms and conditions associated with using the AI. These legal frameworks help keep open-source AI accessible while protecting contributors from liability.  

Open-source LLMs employ open-source AI supported by licences to provide developers with the freedom to customise and adapt large language models to their specific needs without the constraints of proprietary systems.    

What are top open-source LLM options? 

There are many options for organisations that are interested in working with open-source LLMs. The following are some of the most transparently developed open-source language models currently available: 

StarCoder 

StarCoder, developed by the BigCode project open-scientific collaboration led by Hugging Face and ServiceNow, is an open-source LLM designed for code generation. Trained on over 80 programming languages, it excels in code generation, workflow generation and even text summarisation tasks. StarCoder has a large context window and unique features like infilling capabilities, making it a strong choice of foundation model. It is licensed under OpenRAIL-M, which allows for free commercial use with ethical and responsible AI use case restrictions. 

Luminous 

Created by the German AI startup Aleph Alpha, Luminous focuses on delivering advanced natural language understanding and generation capabilities. It is designed to compete with advanced LLMs (such as recent versions of ChatGPT) while offering transparency and ethical AI development. Luminous consists of 13 billion parameters and is available for tasks ranging from small to large-scale language applications. 

Granite 

IBM's Granite models are open-source code LLMs designed for enterprise-grade applications. Granite models are trained on 116 programming languages and can be used for code generation and bug fixing along with more traditional summarisation and explanation. They are released under the Apache 2.0 license, making them suitable for both research and commercial use. 

Phi-2

Phi-2, developed by Microsoft Research, is a lightweight, 2.7 billion-parameter model that nonetheless is capable of competing with many larger models in terms of performance. Phi-2 can perform complex tasks, including commonsense reasoning, coding, maths and language understanding. Its compact size makes it ideal for efficient experimentation and it is available in the Azure AI Studio. 

Stable Video Diffusion 

Building on the success of the Stable Diffusion framework, Stable Video Diffusion is an open-source model focused on video generation and editing. It leverages AI to create dynamic visual content, offering powerful tools for industries like entertainment and advertising. 

Llama 3

Meta's Llama 3 (with 3.1 being the most recent release) is the latest in the Llama family of models. Available in model sizes ranging from 8 billion to 405 billion-parameter versions, Llama 3 builds on the architecture of its predecessors. Llama 3 performs well in reasoning, coding and multilingual tasks, and has expanded safety tools or detecting insecure code

BERT 

Developed by Google, BERT (Bidirectional Encoder Representations from Transformers) is an encoder-only transformer model designed to understand and generate human language. Since its release in 2018, BERT has been widely adopted for various natural language processing (NLP) tasks such as text classification, question answering and sentiment analysis. Despite its relative age, its influence continues to shape modern NLP. 

What are open-source LLM use cases? 

Versatile and accessible, open-source LLMs have a wide array of applications across a range of industries. Here are some prominent ways in which they can be applied: 

Podcast generation 

Open-source LLMs can be used to effortlessly transform PDFs into engaging podcast-style audio. By implementing a compound LLM workflow, you can convert text from PDFs into a conversational script and then narrate it using advanced text-to-speech technology. This is ideal for creating accessible content, educational materials or just bringing your documents to life in a new format. 

Sentiment analysis 

It can be hard to gauge the emotion behind customer feedback. Open-source LLMs can be trained to analyse text to determine the overall sentiment being expressed: positive, negative or neutral. This allows businesses to better understand customer responses to improve their products and services. Sentiment analysis is a major tool in monitoring social media for customer insights.  

Code generation 

Many open-source LLM models can assist developers by providing code suggestions, writing complex algorithms, fixing bugs in code and even documenting code. They can generate code snippets from natural language descriptions, allowing users to tell the code what they want it to do in plain English. 

Text generation 

LLMs are widely used for generating coherent and contextually relevant text. This includes creating articles, stories and dialogue for virtual assistants. This is a major part of Generative AI (GenAI), and has applications in essentially every arena.  

Content creation and summarisation 

Open-source LLMs can automate content creation and generate summaries of lengthy documents, helping users to quickly grasp the main points of extensive texts. This is particularly valuable for professionals who need to process large volumes of information efficiently. 

Language translation

LLMs are helping break down language barriers. With their multilingual capabilities, many open-source LLMs can translate text between multiple languages, making clear communication with a global audience possible. These models are trained on diverse language datasets, ensuring accurate and context-aware translations that go well beyond simple word-by-word translations. 

AI chatbots/customer support 

Open-source LLMs power today's AI chatbots and virtual assistants, enhancing customer support by providing quick and accurate responses to queries. When designed for conversational tasks, LLMs can handle customer interactions effectively and in a way that feels natural for the user. And should there be anything the LLM is not prepared to address, it can be automated to escalate the issue to a human agent. 

Personalised learning support 

LLMs can be integrated into educational platforms to provide personalised learning experiences. They can adapt content to individual learning styles, offer explanations and generate practice problems specifically tailored to the needs and capabilities of individual users.  

What industries use open-source LLMs? 

The applications outlined above illustrate just how adaptable large language models are. Many businesses are adopting the open-source version of this technology, constantly discovering new ways to leverage LLM solutions to better serve their customers and meet their goals. The following are ways in which key industries are already applying open-source LLMs:

Healthcare

LLM-powered AI telemedicine solutions provide always-available virtual caregivers capable of diagnosing, providing information and organising patient information. Much like AI chatbots and virtual agents, these programs are designed to evaluate patient issues based on insights from extensive data sets, so they know when to provide assistance on their own and when to alert human medical teams.

Finance

In the financial sector, open-source LLMs enhance fraud detection, automate customer support and perform sentiment analysis to identify emerging trends. These models analyse financial documents and other data to provide real-time market insights.

Journalism and news

Journalists and news organisations utilise open-source LLMs for summarisation and translation. LLMs can be used internally to analyse information without sharing proprietary data outside the newsroom. And, for when circumstances demand an immediate turn-around, news agencies can direct open-source LLMs to generate relevant and informative content that speaks to their intended audience.

Science-based industries

LLMs support scientific research by automating literature reviews, data analysis and hypothesis generation, but they can also do so much more. The extreme flexibility of these models means scientists can adapt them to any kind of research — from fighting climate change to analysing DNA sequences and modelling astrophysical phenomena.

What are the benefits of open-source LLMs? 

Customisable autonomous systems that can follow human-language directions and respond in kind — it is not hard to imagine what kinds of advantages these models can mean for business. Companies that leverage open-source LLMs can expect to see:

Enhanced cost-effectiveness 

Open-source LLMs are free to use, which eliminates licensing fees associated with proprietary models. Businesses of all sizes (and budgets) access advanced AI tools, making AI innovation accessible across the board. Just be aware of the increased infrastructure costs associated with open-source LLM use, which may demand more upfront investment. 

Unmatched flexibility 

Open-source LLMs offer unmatched flexibility. Organisations can tailor these models to their specific needs without being stuck locked into a single vendor. The freedom to modify and enhance the models as needed ensures that business demands never need to go unaddressed. 

Optimal code transparency 

Access to the LLM's source code, architecture and training data gives users the visibility they need to understand the inner workings of the model. Transparency builds trust, aids in audits and ensures ethical and legal compliance. 

Increased community contribution   

The open-source nature of these LLMs fosters a collaborative environment where developers worldwide can work together to contribute to the models. Community contributions lead to continuous improvements, rapid issue resolution and near-constant feature introduction and refinement.  

Improved opportunities for LLM optimisation 

Not every LLM is a perfect fit for every organisation. With open-source LLMs, this is not a problem, as developers can make small tweaks and coarse corrections to the model, fine-tuning its performance to deliver the best possible results for the tasks it's applied to.  

What are some of the challenges and risks of open-source LLMs? 

While open-source LLMs offer many benefits, they can also present certain risks. Understanding the following challenges and how to counter them is an important part of ensuring responsible and effective use:

Ethical use 

Open-source LLMs, due to their unrestricted access, can be used for harmful purposes as easily as beneficial ones. Spreading misinformation, violating privacy, accessing restricted or proprietary information — these are all ways in which an AI can be exploited. Ensuring ethical use requires proactive community governance and clear guidelines to balance innovation with safety and responsibility.  

Reliability and accuracy 

Community contributions to open-source LLMs can vary in quality, leading to inconsistent outputs. Without standardised oversight, these models might yield unreliable results, particularly in applications where accuracy is critical. Maintaining high standards means keeping a close eye on all code contributions and refining them when necessary. Unfortunately, this can be a major job when working with open-source codes. 

Bias 

Any AI is only as good as the information it's built on, and when that information includes prejudiced opinions or unfair assumptions, the AI may learn to perpetuate them. LLMs can become influenced by the biases present in their training data, resulting in skewed and unfair outputs. Vigilant oversight and diverse data curation are essential to mitigate bias and ensure fairness and inclusivity in AI-generated content and decisions. 

What should an organisation look for in open-source LLMs?

When selecting an open-source large language model (LLM), organisations need to consider several factors to ensure they choose the best fit for their needs. Here are the most important considerations: 

Goals 
Identify the main purpose of the LLM. What is it going to achieve for the business? What will its focus be? Some models are tailored for research, while others are suited for commercial use. Make sure the LLM aligns with specific goals.

Accuracy 
Evaluate the model's accuracy for the tasks it will face. Even with access to customise the code, some LLMs are better suited for different uses.

Cost 
While open-source LLMs are free, consider the costs of hosting, training and maintaining the model. Larger models require more resources, which can increase infrastructure and operational expenses.

Performance 
Assess the LLM's language fluency, coherence and context comprehension. High performance in these areas improves user experience and task effectiveness.

Data Security 
Ensure the LLM can securely handle sensitive data, especially when dealing with personal or proprietary information.

Quality of training data 
Check the quality of the original training data used by the LLM. High-quality data leads to better outputs. If necessary, use custom data to fine-tune the model for improved results.

Available skillsets 
Complex LLMs require advanced skills in data science, machine learning operations (MLOps) and NLP. Ensure the team that will be working most closely with the LLM has the necessary experience. If they do not, make plans to train or hire to fill that skill gap.

ServiceNow Pricing

ServiceNow offers competitive product packages that scale with you as your enterprise business grows and your needs change.

Get Pricing

ServiceNow LLMs: Built on a foundation of open-source LLMs

ServiceNow delivers top-tier proprietary language models through the Now LLM Service, enabling advanced AI-driven features like chat summarisation, agent record summarisation, AI-enhanced search, dynamic translation and seamless conversational flows. These capabilities are built on open-source innovation; ServiceNow openly trains foundation models, such as StarCoder, as part of its initiatives. These models are then refined into proprietary Now LLMs, specifically tailored to optimise productivity across the enterprise and deliver exceptional generative AI experiences. 

This open-source foundation, enhanced through collaboration with partners like Hugging Face and NVIDIA, allows ServiceNow to advance AI responsibly while sharing valuable innovations with the community. Through this approach, organisations benefit from powerful, specialised AI skills — from automated workflow recommendations to precise text-to-code generation. And, because Now LLMs are part of the fully integrated ServiceNow AI Platform®, they seamlessly enhance existing workflows and applications to drive efficiency across every department. 

See ServiceNow in action and discover how AI and LLM can enhance your operations; schedule your demo today! 

Dive deeper into generative AI

Accelerate productivity with Now Assist – generative AI built right into the ServiceNow AI Platform.

Learn more

Contact Us

Resources

Articles

 What is ServiceNow?

 What is platform as a service (PaaS)?

What is machine learning?

Analyst Reports

IDC InfoBrief: Maximise AI Value with a Digital Platform

Data Sheets

ServiceNow AI Platform

Predictive Intelligence

Performance Analytics

eBooks

 Empowering CIOs to Lead

 The Shifting Role of the CIO

 4 steps to automate and connect your organisation

White Papers

 TM Forum Report: How to lead in the open API economy

Automotive

Banking

Consumer Packaged Goods

Healthcare

Insurance

Life Sciences

Manufacturing

Nonprofit

National Government

Retail

Technology Providers

Telecom

Find a partner

Become a partner

Partner awards

Partner portal

Partner applications

Careers

Investors

ServiceNow AI Research

Leadership

Locations

Newsroom

Analyst Reports

Global impact

Trust and compliance

AI Agents

IT Service Management

ServiceNow AI Control Tower

IT Operations Management

Customer Service Management

Strategic Portfolio Management

IT Asset Management

Governance, Risk, and Compliance

Security Operations

Field Service Management

HR Service Delivery

Employee Center

AI

Data

Workflows

AI Experience

Infrastructure

RaptorDB

AI Agents

ServiceNow AI Control Tower

Security

App Engine

ServiceNow Store

Responsible AI

Provide better experiences

Resolve issues faster

Create and automate workflows

Enterprise Architecture

Service Operations Workspace

Cloud Governance Suite

Operational Technology Management

IT Asset Management

IT Operations Management

IT Service Management

ServiceNow Cloud Observability

Strategic Portfolio Management

Digital End-user Experience

Customer Service Management

Field Service Management

Sales and Order Management

Configure, Price, Quote

Financial Services Operations

Healthcare and Life Sciences Service Management

Sales and Order Management for Technology Providers

Sales and Order Management for Telecommunications

Public Sector Digital Services

Telecommunications Service Management

Technology Provider Service Management

Security Operations

Security Incident Response

Vulnerability Response

Threat Intelligence Security Center

Integrated Risk Management

Third-party Risk Management

Security Posture Control

Privacy Management

HR Service Delivery

Talent Development

Legal Service Delivery

Workplace Service Delivery

App Engine

Integration Hub

Accounts Payable Operations

Sourcing and Procurement Operations

Supplier Lifecycle Operations