๐Ÿฆฉ Apex Vanguard Learn

Software & AI Licenses Explained

A practical, plain-English reference to the licenses that govern open-source code, source-available products, datasets, documentation, and AI models. Know what each one actually requires before you ship โ€” what you must share, what you must credit, and what patent rights come with it.

25+licenses compared
6license categories
5obligations decoded
Start here

The five questions that matter

A software license is a grant of permission with strings attached. Almost every dispute comes down to five practical questions. Read any license โ€” open source or not โ€” by answering these:

Copyleft in one sentence: a copyleft license uses copyright to require that derivative works be released under the same (or a compatible) license โ€” the opposite of public domain. "Strong" copyleft reaches into code you link against; "weak" copyleft is limited to the licensed files; "network" copyleft adds the trigger of merely offering the software as a hosted service.
The big reference

License comparison table

The major software, source-available, data, and AI licenses at a glance. "Must share source?" describes the copyleft trigger; "Patent grant?" means an express grant in the license text. This is a summary for orientation โ€” always read the actual license and confirm version numbers.

โœ… = yes   โŒ = no   โš ๏ธ = conditional / see notes. Summary only; not a substitute for the license text or legal advice.
LicenseTypeCommercial useMust share source?Attribution?Patent grant?Notes
Permissive
MITPermissiveโœ… YesโŒ Noโœ… Keep noticeโŒ None expressShortest, most popular. Do anything; keep the copyright + license text.
BSD-2-ClausePermissiveโœ… YesโŒ Noโœ… Keep noticeโŒ None express"Simplified BSD." Like MIT with a no-warranty clause.
BSD-3-ClausePermissiveโœ… YesโŒ Noโœ… Keep noticeโŒ None expressAdds a no-endorsement clause: don't use the author's name to promote your product.
ISCPermissiveโœ… YesโŒ Noโœ… Keep noticeโŒ None expressFunctionally equivalent to MIT, simpler wording. Default in the npm ecosystem.
Apache-2.0Permissiveโœ… YesโŒ Noโœ… Notice + NOTICE file; state changesโœ… Express + retaliationThe "business-safe" permissive license. Explicit patent grant that terminates if you sue over patents. Preferred for company projects.
UnlicensePublic-domainโœ… YesโŒ NoโŒ NoneโŒ None expressDedicates work to the public domain. No attribution required. (Public-domain dedication isn't valid in every jurisdiction.)
CC0-1.0Public-domain (data/docs)โœ… YesโŒ NoโŒ NoneโŒ NoneCreative Commons public-domain tool. Common for datasets. Not recommended for source code (no warranty/patent terms).
Weak copyleft (file- or library-scoped)
MPL-2.0Weak copyleftโœ… Yesโš ๏ธ Modified MPL files onlyโœ… Keep noticesโœ… ExpressFile-level copyleft: share changes to MPL-licensed files; your other files stay yours. Mixes cleanly into proprietary products.
LGPL-2.1 / 3.0Weak copyleftโœ… Yesโš ๏ธ Library changes; allow relinkingโœ… Keep noticesโš ๏ธ Via GPL terms (3.0)For libraries. You may link from proprietary code if users can replace the LGPL library (dynamic linking / object files).
EPL-2.0Weak copyleftโœ… Yesโš ๏ธ Modified EPL files onlyโœ… Keep noticesโœ… ExpressEclipse Public License. File-level copyleft, business-friendly. Common in the Java/Eclipse world.
Strong copyleft
GPL-2.0Strong copyleftโœ… Yesโš ๏ธ On distribution (whole work)โœ… Keep noticesโŒ None express (implied)Distribute a binary โ†’ must offer complete corresponding source under GPL. No express patent grant. Linking GPL code generally makes the combined work GPL.
GPL-3.0Strong copyleftโœ… Yesโš ๏ธ On distribution (whole work)โœ… Keep noticesโœ… Express + anti-tivoizationGPL-2.0 plus an express patent grant, hardware install-info ("anti-tivoization"), and clearer termination. Not one-way compatible with GPL-2.0-only.
Network copyleft
AGPL-3.0Network copyleftโœ… Yesโš ๏ธ Also when offered over a networkโœ… Keep noticesโœ… ExpressCloses the "SaaS loophole": if users interact with a modified version over a network, you must offer them the source. Often avoided by companies for internal/hosted use.
Source-available (NOT OSI open source)
BSL 1.1 (Business Source)Source-availableโš ๏ธ Limited until change dateโœ… Source visibleโœ… Keep noticesโŒ None expressSource is public but production use may be restricted (e.g., no competing service) until a "change date," when it converts to an open license (often Apache/GPL). Used by MariaDB, HashiCorp.
Elastic License 2.0Source-availableโš ๏ธ Not as a managed serviceโœ… Source visibleโœ… Keep noticesโŒ None expressFree to use/modify, but you may not provide it as a hosted/managed service to third parties or circumvent license keys.
SSPL-1.0Source-availableโš ๏ธ Service-provider burdenโš ๏ธ Must open the whole service stackโœ… Keep noticesโŒ None expressAGPL-derived. If you offer it as a service, you must release the source of the entire management/hosting stack. Not OSI-approved. Used by MongoDB.
Data & documentation (Creative Commons)
CC-BY-4.0Data/docs ยท attributionโœ… YesโŒ No (share-alike not required)โœ… Credit requiredโŒ N/AUse anything if you credit the author. Standard for open datasets and docs. (CC is for content, not code.)
CC-BY-SA-4.0Data/docs ยท copyleftโœ… Yesโš ๏ธ Derivatives under same licenseโœ… Credit requiredโŒ N/A"Copyleft for content." Adaptations must be CC-BY-SA. Powers Wikipedia / OpenStreetMap.
CC-BY-NC-4.0Data/docs ยท non-commercialโŒ Non-commercial onlyโŒ Noโœ… Credit requiredโŒ N/ANo commercial use without a separate license. "NC" is a frequent trap for businesses building on "free" data.
AI model & dataset licenses
OpenRAIL / OpenRAIL-MAI ยท responsible-useโœ… YesโŒ Noโœ… Pass terms downstreamโš ๏ธ Varies by instancePermissive except for enumerated use-based restrictions (e.g., no harm, no unlawful surveillance). Restrictions must flow to all downstream users. Used by Stable Diffusion, BLOOM.
Llama Community LicenseAI ยท custom (Meta)โš ๏ธ Yes, with conditionsโŒ Noโœ… "Built with Llama" + licenseโš ๏ธ ConditionalCommercial use allowed, but a >700M monthly-active-user threshold requires a separate Meta license; naming/attribution rules apply. Not OSI open source.
Gemma Terms of UseAI ยท custom (Google)โœ… YesโŒ Noโœ… Pass terms + noticesโš ๏ธ ConditionalBroad commercial rights with a Prohibited Use Policy that binds you and all downstream recipients. Not OSI open source.
Commercial / ProprietaryProprietaryโš ๏ธ Per contractโŒ No (closed)โš ๏ธ Per contractโš ๏ธ Per contractA negotiated EULA/SaaS agreement. Everything โ€” use, redistribution, patents, support, indemnity โ€” is whatever the contract says. No default rights.
Reading the "Type" column: only permissive, weak/strong/network copyleft, and the public-domain rows are OSI-approved open source. Source-available licenses (BSL, Elastic, SSPL) and the custom AI licenses (Llama, Gemma) let you see the code or weights but are not open source โ€” they carry field-of-use limits. Don't assume "the source is on GitHub" means "I can do anything."
Categories

What each category means

Plain-English explainers for the six families above. When in doubt, classify a license into one of these before you read the fine print.

Permissive

Do almost anything

MIT, BSD, ISC, Apache-2.0. Use, modify, embed in closed products โ€” you only have to preserve the copyright and license notice. Apache-2.0 adds an express patent grant, which is why companies prefer it for anything patent-sensitive.

Weak copyleft

Share changes to these files

MPL-2.0, LGPL, EPL-2.0. The copyleft is scoped to the licensed files or library โ€” your surrounding proprietary code stays proprietary. The classic way to consume copyleft inside a commercial product.

Strong copyleft

The whole work goes with it

GPL-2.0, GPL-3.0. Distribute software that includes GPL code and the combined work generally must be GPL โ€” and you must offer complete source. Powerful for ensuring freedom; needs care in a commercial stack.

Network copyleft

SaaS counts as distribution

AGPL-3.0. Adds a trigger that ordinary GPL lacks: if users reach a modified version over a network, you must offer them the source. This is what makes AGPL controversial for hosted services.

Source-available

Visible, but restricted

BSL, Elastic-2.0, SSPL. You can read and modify the code, but commercial or "compete-with-us" use is curtailed โ€” sometimes until a future change date. Useful, but not open source.

AI & data

Weights, data, and outputs

OpenRAIL, Llama, Gemma, Creative Commons. Here "the work" fragments into code, model weights, training data, and generated output โ€” each potentially under different terms and use-based restrictions.

Special focus

AI model & dataset licenses

AI licensing breaks the old assumption that "the license covers the code." A modern model release is really a bundle of distinct assets โ€” and the rights to each can differ. This is where most teams get exposed.

Weights โ‰  code. A repository can ship its training/inference code under Apache-2.0 while the model weights ship under a custom license (Llama, Gemma) or an OpenRAIL responsible-use license. Permission to run the code is not permission to use the weights commercially.
Data rights โ‰  model rights. A dataset under CC-BY-NC (non-commercial) does not automatically taint a model trained on it โ€” and conversely, a permissively licensed model may have been trained on data you have no rights to redistribute. Track the dataset license and the weights license separately; they answer different questions.
Use-based restrictions travel downstream. OpenRAIL, Llama, and Gemma all attach acceptable-use rules (no enumerated harms; Meta's MAU threshold; Google's Prohibited Use Policy) that you must pass on to everyone you distribute to. Stripping those terms breaks the license.
Output ownership is a fourth question. Who owns what the model generates, and whether you may use those outputs to train a competing model, is governed by the model's terms โ€” not by the weights' copyright status. Read the "outputs" clause explicitly.

Practical checklist for adopting any open-weight model: (1) identify the license on the weights, separately from the code; (2) confirm commercial use is allowed at your scale; (3) record the acceptable-use restrictions you must propagate; (4) check the dataset provenance and its license; (5) verify the terms on model outputs. A model that's "free to download" can still be unusable for your product.

Not sure what your stack is actually licensed to do?

Apex Vanguard runs license-readiness reviews โ€” we inventory every dependency, model, and dataset in your product, flag the copyleft, source-available, and AI use-restriction traps, and hand you a clear obligations map before you ship or raise. Pair it with Vanguard IP-Researcher to keep that inventory current as your dependencies change.

Book a license-readiness review โ†’ IP & license consulting ยท $400/hr ยท Vanguard IP-Researcher from $200/mo with a 30-day free trial