Definition

model card in machine learning

Stephen J. Bigelow

By

Stephen J. Bigelow, Senior Technology Editor

Published: Mar 25, 2024

What is a model card in machine learning?

A model card is a type of documentation that is created for, and provided with, machine learning models. A model card functions as a type of data sheet, similar in principle to the consumer safety labels, food nutritional labels, a material safety data sheet or product spec sheets.

There has been a dramatic rise in the development and adoption of machine learning (ML) and artificial intelligence (AI) during recent years. Further advances in generative AI employ large language models (LLMs) as a core component. However, the many models used in those platforms are increasingly complex and difficult to understand. Even model developers sometimes struggle to fully understand and describe the ways a given model behaves. This complexity has created serious questions about core business values such as transparency, ethics and accountability. Common questions include the following:

What is a particular ML model used for?
How does a particular ML model work or perform?
What training data was used to train the model (and what were the results)?

First proposed by Google in 2018, the model card is a means of documenting vital elements of a ML model so users -- including AI designers, business leaders and ML end users -- can readily understand the intended use cases, characteristics, behaviors, ethical considerations, and the biases and limitations of a particular ML model.

As late as 2024, there are no current legislative or regulatory requirements to produce or provide model card documentation with ML models. Similarly, there are no currently established standards in model card format or content. However, major ML developers have spearheaded the adoption of model card documentation as a way of demonstrating responsible AI development, and adopters can find model cards for major platforms such as Meta Llama, Google face detection and OpenAI GPT-3.

Benefits of using model cards for AI, ML and LLM projects

The rise of ML and AI is driving the need for transparency and responsible governance. Businesses must understand what ML models are for, how they work, how they compare to other competitive models, how they're trained and their suitability for intended tasks.

Model cards are a tool that can address such concerns which readily impact governance and regulatory issues for the business. Model cards can provide a range of important benefits to ML and AI projects including the following:

Model selection. By adopting a consistent standard for format and content, model cards can help ML and AI developers evaluate and select the most appropriate ML model for the project. Model cards can help whittle down an initial list of ML candidates, focus testing and evaluation on the most appropriate models, and accelerate final selection of the best ML model for the task at hand.
Model behavior and performance. ML models are not perfect, and limitations in the training data sets or biases in the training methodologies can impact the model's behavior when dealing with real-world data. Model cards typically include known limitations in data and training, allowing ML model users to see and address such limitations through additional training and data. This can help the business reduce errors and biases in the ML model and improve AI project performance and outcomes.
Continued AI improvements. The details and information provided by ML model cards help businesses select the most appropriate models and allow ML model developers to compare their models to others to make platforms more capable and competitive. This leads to more innovation and improvements to ML models and the resulting ML and AI platforms that use those models.
Business governance. The old axiom "you can't manage what you can't see" is entirely appropriate for modern ML models. A business that employs opaque or incomprehensible ML models cannot understand or explain the behavior of their developing ML or AI project-- risking serious business continuance, business governance and regulatory compliance violations. ML model cards are a means for the business to demonstrate an understanding of the underlying ML models that they select and use.
Business transparency and ethics. ML and AI technologies have an impact on society -- both in the collected and used data and in the ways that data is processed to make decisions. The documentation and details provided in ML model cards can help a business manage responsible AI development and address social concerns regarding the organization's use of AI and underlying data.

7 key sections of a model card

Labels and other informational summaries are generally most effective when they allow comparing similar products side by side using comparable content and formats. However, the information presented on an ML model card can vary. Unlike highly regulated informational displays -- such as food nutritional labeling -- there are no current standards to govern the information or formatting included on ML model cards.

ML models can vary dramatically in their scope, purpose and capabilities, so this makes it hard to regulate. For example, an ML model developed to aid in medical diagnosis can be distinctly different than an ML model created to run analytics on retail sales operations, or a complex LLM used in an AI construct. Consequently, ML model developers largely use their own discretion to determine what information to include, and how that information should be presented. Yet, as leading technology firms develop ML/AI platforms and document those offerings through model cards, some de facto documentation standards are taking shape. Model cards should include the following:

1. Basic details

This first section of a model card is typically the introduction to the model which can outline the model's essential details including the model's name, version, revision list, a brief general description of the model, business or developer details and contact information, and licensing details or limits.

2. Use case details

This section describes the intended uses, use cases and users for the model. For example, a section on use cases may describe uses in object detection, facial detection or medical diagnoses. This section may also include caveats, use limitations or uses deemed out of scope. For example, a model intended for object detection may detail input from photos or video; output including detection of a specified number of object classes; and other output data such as object bounding box coordinates, knowledge graph ID, object description and confidence score.

3. Architectural details

This section describes the overall design of the model and any underlying hardware back end that runs the model and hosts related data. Readers can refer to the model card to understand the design elements or underlying technologies that make the model work. For the object detection model example, the model card may describe an architecture including a single image detector model with a Resnet 101 backbone and a feature pyramid network feature map.

4. Training details

This section outlines, describes or summarizes the data used in model training; where and when the data was obtained; and any statistical distribution of key factors in the data which may allow for inadvertent bias. Since training data may be proprietary to the model's developers, training details may be deliberately limited or protected by a separate confidentiality agreement. Training details may also describe training methodologies employed with the model.

5. Performance details

This section outlines details related to the model's performance measured against a test data set, not a training data set, as well as details about the test data set itself. For the object detection model example, performance metrics included on the model card may note the use of both Google's internal image data set as well as an open source image set as test data and the number of object classes the model can detect in each data set. Additionally, performance details may outline reported metrics including the precision and accuracy of the object detection. More sophisticated models may utilize other detailed metrics to measure performance.

6. Limitation details

A key segment of any model card is the section describing limitations, possible biases or variable factors that might affect the model's performance or output. For the object detection model example, known limitations may include factors such as object size, clutter, lighting, blur, resolution and object type since the model can't recognize everything.

7. Business details

This final segment of a model card is often dedicated to business-related details including information about the model's developers, detailed contact, support and licensing information, fairness/privacy and usage information, suggestions for model monitoring, any relevant assessment of impacts to individuals or society, and other ethical or potential legal concerns related to the model's usage.

Examples of model cards

As leading technology organizations build ML and AI platforms, their work on model cards and other documentation has provided a standard for other ML firms to follow. Today there are many examples of ML model cards to review including the following major examples:

Google's face detection model card.
Meta's Llama model card.
OpenAI's GPT-3 model card.

There are also more standardized tools for model card creation, as well as model card repositories, such as these examples:

GitHub hosts a template for creating ML model cards.
Google offers a model card toolkit available through GitHub.
Hugging Face offers a tool to generate model cards.

Both GitHub and Hugging Face provide a repository of model cards which are available for review and study, offering model card examples across many different model types, purposes and industry segments.

Continue Reading About model card in machine learning

The best large language models

AI regulation: What businesses need to know

How do LLMs like ChatGPT work?

Generative AI ethics: The biggest concerns and risks

Compare large language models vs. generative AI

Search Networking

What is multi-access edge computing? Benefits and use cases
Multi-access edge computing (MEC) is a network architecture concept that brings cloud computing capabilities and IT services ...
What is 5G?
Fifth-generation wireless or 5G is a global standard and technology for wireless and telecommunications networks.
What is a small cell in wireless networks?
A small cell is a type of low-power cellular radio access point or base station that provides wireless service within a limited ...

Search Security

What is identity and access management? Guide to IAM
No longer just a good idea, IAM is a crucial piece of the cybersecurity puzzle. It's how an organization regulates access to ...
What is data masking?
Data masking is a security technique that modifies sensitive data in a data set so it can be used safely in a non-production ...
What is antivirus software?
Antivirus software (antivirus program) is a security program designed to prevent, detect, search and remove viruses and other ...

Search CIO

What is a chief data officer (CDO)?
A chief data officer (CDO) in many organizations is a C-level executive whose position has evolved into a range of strategic data...
What is user-generated content?
User-generated content (UGC) is published information that an unpaid contributor provides to a website.
What is business process outsourcing (BPO)?
Business process outsourcing (BPO) is a business practice in which an organization contracts with an external service provider to...

Search HRSoftware

What is compensation management?
Compensation management is the discipline and process for determining employees' appropriate pay, incentives, rewards, bonuses ...
What is HR technology (human resources tech)?
HR technology (human resources tech) refers to the hardware and software that support an organization's human resource management...
What is core HR (core human resources)?
Core HR (core human resources) is an umbrella term that refers to the essential, mandatory and fundamental tasks and functions of...

Search Customer Experience

What are virtual agents and how are they being used?
A virtual agent is an AI-powered software application or service that interacts with humans or other digital systems in a ...
Customer acquisition cost (CAC): How to calculate and reduce it
Customer acquisition cost (CAC) is the cost associated with convincing a consumer to buy your product or service, including ...
What is direct marketing?
Direct marketing is a type of advertising campaign that seeks to elicit an action (such as an order, a visit to a store or ...

Close