This standard defines and provides criteria to measure the capabilities of foundation models. The standard focuses on measurable and objective…
IEEE The Institute of Electrical and Electronics Engineers Inc.
This standard addresses the evaluation of safety, explainability, and stability of algorithms implementing autonomous driving levels, as defined by SAE…
This standard defines a comprehensive framework for federated machine learning of semantic information agents. It targets two primary layers: ⢅
This standard provides: 1. Agent Components: The building blocks and architectural elements that constitute an educational Large Language Model (LLM)…
This standard addresses Artificial Intelligence (AI) risks such as malicious use, AI race, organizational risks, and rogue AI’s. The standard…
This recommended practice provides a data processing framework for training large language models, refining the relevant terms and definitions. The…
This recommended practice provides a comprehensive framework for understanding, defining, and evaluating AI risks, AI safety, AI trustworthiness, and AI…
For the software engineering life cycle empowered by Generative Pre-trained Transformer (GPT) this recommended practice specifies: ⢠a description and…
IEEE P3419
Standard for Large Language Model Evaluation
This standard establishes a comprehensive set of criteria for the evaluation of Large Language Models (LLMs) and extends to multimodal…
This standard specifies the basic functions, performance requirements, software ecosystem, and application scenarios of the deep learning chip for the…
This recommended practice specifies framework and evaluation methods of audio-driven portraits based on artificial intelligence. In this recommended practice, the…
The standard specifies a type of compilation interface and its intermediate representation used for deep learning model computation tasks. This…