IEEE 3128-2025

Recommended Practice for The Evaluation of Artificial Intelligence (AI) Dialogue System Capabilities

Last updated: 18 Jul 2024

Development Stage

Pre-draft

Draft

Published

9 Nov 2021

31 Jul 2024

12 Feb 2025

published

Scope

This recommended practice establishes an evaluation framework for the capabilities of artificial intelligence dialogue systems such as chatbots, consulting terminals, or operation interfaces. The recommended practice defines and classifies the types and levels of the intelligence capabilities according to a checklist of criteria. The checklist tables describe the criteria used to determine the level that a dialogue system achieves based on the analysis of behavior and performance. ©IEEE 2022. All rights reserved.

Purpose

This recommended practice provides a framework for the evaluation of AI applications in dialogue systems, where both industry and academia can achieve consensus on professional terms and definitions, and share a common understanding on test methods as well. This recommended practice can also be used as the test guide for AI dialogue systems in smart manufacturing and smart city construction. ©IEEE 2022. All rights reserved.

External Links

More information

BSI webpage

Let the community know

Categorisation

Domain: Horizontal

Scope: AI-specific

Topic: Accuracy and performance, System quality

Application: NLP - Human-machine conversation

Type: Measurement and test methods

Key Information

Organisation: IEEE

Share on X Share on LinkedIn

Discussion Forum

Author

Posts
31 January 2023 at 3:46 pm
Up
0
::
- Report
AI Standards Hub

Share your thoughts on this standard with the AI Standards Hub community here.
Author

Posts

You must be logged in to contribute to the discussion

Login

Content Type

Recommended Practice for The Evaluation of Artificial Intelligence (AI) Dialogue System Capabilities

Development Stage

Pre-draft

Draft

Published

Scope

Purpose

External Links

Let the community know

Categorisation

Key Information

Discussion Forum

You must be logged in to contribute to the discussion

Report abuse

Report submitted

Provide feedback on the site

Feedback submitted

Submit a missing item

Feedback submitted