The platform for open and practice-oriented research

product

Human-centered evaluation of explainable AI applications

Explainable Artificial Intelligence (XAI) aims to provide insights into the inner workings and the outputs of AI systems. Recently, there’s been growing recognition that explainability is inherently human-centric, tied to how people perceive explanations. Despite this, there is no consensus in the research community on whether user evaluation is crucial in XAI, and if so, what exactly needs to be evaluated and how. This systematic literature review addresses this gap by providing a detailed overview of the current state of affairs in human-centered XAI evaluation. We reviewed 73 papers across various domains where XAI was evaluated with users. These studies assessed what makes an explanation “good” from a user’s perspective, i.e., what makes an explanation meaningful to a user of an AI system. We identified 30 components of meaningful explanations that were evaluated in the reviewed papers and categorized them into a taxonomy of human-centered XAI evaluation, based on: (a) the contextualized quality of the explanation, (b) the contribution of the explanation to human-AI interaction, and (c) the contribution of the explanation to human- AI performance. Our analysis also revealed a lack of standardization in the methodologies applied in XAI user studies, with only 19 of the 73 papers applying an evaluation framework used by at least one other study in the sample. These inconsistencies hinder cross-study comparisons and broader insights. Our findings contribute to understanding what makes explanations meaningful to users and how to measure this, guiding the XAI community toward a more unified approach in human-centered explainability.

MULTIFILE

Human-centered evaluation of explainable AI applications

product

An Agile Framework for Trustworthy AI

From the article: The ethics guidelines put forward by the AI High Level Expert Group (AI-HLEG) present a list of seven key requirements that Human-centered, trustworthy AI systems should meet. These guidelines are useful for the evaluation of AI systems, but can be complemented by applied methods and tools for the development of trustworthy AI systems in practice. In this position paper we propose a framework for translating the AI-HLEG ethics guidelines into the specific context within which an AI system operates. This approach aligns well with a set of Agile principles commonly employed in software engineering. http://ceur-ws.org/Vol-2659/

PDF

product

The probable future of toxicology - probabilistic risk assessment

Both because of the shortcomings of existing risk assessment methodologies, as well as newly available tools to predict hazard and risk with machine learning approaches, there has been an emerging emphasis on probabilistic risk assessment. Increasingly sophisticated AI models can be applied to a plethora of exposure and hazard data to obtain not only predictions for particular endpoints but also to estimate the uncertainty of the risk assessment outcome. This provides the basis for a shift from deterministic to more probabilistic approaches but comes at the cost of an increased complexity of the process as it requires more resources and human expertise. There are still challenges to overcome before a probabilistic paradigm is fully embraced by regulators. Based on an earlier white paper (Maertens et al., 2022), a workshop discussed the prospects, challenges and path forward for implementing such AI-based probabilistic hazard assessment. Moving forward, we will see the transition from categorized into probabilistic and dose-dependent hazard outcomes, the application of internal thresholds of toxicological concern for data-poor substances, the acknowledgement of user-friendly open-source software, a rise in the expertise of toxicologists required to understand and interpret artificial intelligence models, and the honest communication of uncertainty in risk assessment to the public.

PDF

The probable future of toxicology - probabilistic risk assessment

product

AI Application in Transport and Logistics

The aim of this research/project is to investigate and analyze the opportunities and challenges of implementing AI technologies in general and in the transport and logistics sectors. Also, the potential impacts of AI at sectoral, regional, and societal scales that can be identified and chan- neled, in the field of transport and logistics sectors, are investigated. Special attention will be given to the importance and significance of AI adoption in the development of sustainable transport and logistics activities using intelligent and autonomous transport and cleaner transport modalities. The emphasis here is therefore on the pursuit of ‘zero emissions’ in transport and logistics at the urban/city and regional levels.Another goal of this study is to examine a new path for follow-up research topics related to the economic and societal impacts of AI technology and the adoption of AI systems at organizational and sectoral levels.This report is based on an exploratory/descriptive analysis and focuses mainly on the examination of existing literature and (empirical) scientific research publica- tions, previous and ongoing AI initiatives and projects (use cases), policy documents, etc., especially in the fields of transport and logistics in the Netherlands. It presents and discusses many aspects of existing challenges and opportunities that face organizations, activities, and individuals when adopting AI technology and systems.

PDF

AI Application in Transport and Logistics

product

Using ChatGPT-4 to grade open question exams

This research investigates the potential and challenges of using artificial intelligence, specifically the ChatGPT-4 model developed by OpenAI, in grading and providing feedback in an educational setting. By comparing the grading of a human lecturer and ChatGPT-4 in an experiment with 105 students, our study found a strong positive correlation between the scores given by both, despite some mismatches. In addition, we observed that ChatGPT-4's feedback was effectively personalized and understandable for students, contributing to their learning experience. While our findings suggest that AI technologies like ChatGPT-4 can significantly speed up the grading process and enhance feedback provision, the implementation of these systems should be thoughtfully considered. With further research and development, AI can potentially become a valuable tool to support teaching and learning in education. https://saiconference.com/FICC

PDF

Using ChatGPT-4 to grade open question exams

product

Is AI 'just' a new technology? On integrating AI education in digital design curricula

Design schools in digital media and interaction design face the challenge of integrating recent artificial intelligence (AI) advancements into their curriculum. To address this, curricula must teach students to design both "with" and "for" AI. This paper addresses how designing for AI differs from designing for other novel technologies that have entered interaction design education. Future digital designers must develop new solution repertoires for intelligent systems. The paper discusses preparing students for these challenges, suggesting that design schools must choose between a lightweight and heavyweight approach toward the design of AI. The lightweight approach prioritises designing front-end AI applications, focusing on user interfaces, interactions, and immediate user experience impact. This requires adeptness in designing for evolving mental models and ethical considerations but is disconnected from a deep technological understanding of the inner workings of AI. The heavyweight approach emphasises conceptual AI application design, involving users, altering design processes, and fostering responsible practices. While it requires basic technological understanding, the specific knowledge needed for students remains uncertain. The paper compares these approaches, discussing their complementarity.

PDF

Is AI 'just' a new technology? On integrating AI education in digital design curricula

product

Inclusieve artificial intelligence

In het boek komen 40 experts aan het woord, die in duidelijke taal uitleggen wat AI is, en welke vragen, uitdagingen en kansen de technologie met zich meebrengt.

PDF

product

AI-based exhaust gas temperature

Data-driven condition-based maintenance (CBM) and predictive maintenance (PdM) strategies have emerged over recent years and aim at minimizing the aviation maintenance costs and environmental impact by the diagnosis and prognosis of aircraft systems. As the use of data and relevant algorithms is essential to AI-based gas turbine diagnostics, there are different technical, operational, and regulatory challenges that need to be tackled in order for the aeronautical industry to be able to exploit their full potential. In this work, the machine learning (ML) method of the generalised additive model (GAM) is used in order to predict the evolution of an aero engine’s exhaust gas temperature (EGT). Three different continuous synthetic data sets developed by NASA are employed, known as New Commercial Modular Aero-Propulsion System Simulation (N-CMAPSS), with increasing complexity in engine deterioration. The results show that the GAM can be predict the evolution of the EGT with high accuracy when using several input features that resemble the types of physical sensors installed in aero gas turbines currently in operation. As the GAM offers good interpretability, this case study is used to discuss the different data attributes a data set needs to have in order to build trust and move towards certifiable models in the future.

PDF

product

Paper vs. Practice: How Legal and Ethical Frameworks Influence Public Sector Data Professionals in the Netherlands

Recent years have seen a massive growth in ethical and legal frameworks to govern data science practices. Yet one of the core questions associated with ethical and legal frameworks is the extent to which they are implemented in practice. A particularly interesting case in this context comes to public officials, for whom higher standards typically exist. We are thus trying to understand how ethical and legal frameworks influence the everyday practices on data and algorithms of public sector data professionals. The following paper looks at two cases: public sector data professionals (1) at municipalities in the Netherlands and (2) at the Netherlands Police. We compare these two cases based on an analytical research framework we develop in this article to help understanding of everyday professional practices. We conclude that there is a wide gap between legal and ethical governance rules and the everyday practices.

MULTIFILE

Search results

Products 325

Human-centered evaluation of explainable AI applications

An Agile Framework for Trustworthy AI

The probable future of toxicology - probabilistic risk assessment

AI Application in Transport and Logistics

Using ChatGPT-4 to grade open question exams

Is AI 'just' a new technology? On integrating AI education in digital design curricula

Inclusieve artificial intelligence

AI-based exhaust gas temperature

Paper vs. Practice: How Legal and Ethical Frameworks Influence Public Sector Data Professionals in the Netherlands

Navigate to

Categories

Filters

Products 325

Human-centered evaluation of explainable AI applications

An Agile Framework for Trustworthy AI

The probable future of toxicology - probabilistic risk assessment

AI Application in Transport and Logistics

Using ChatGPT-4 to grade open question exams

Is AI 'just' a new technology? On integrating AI education in digital design curricula

Inclusieve artificial intelligence

AI-based exhaust gas temperature

Paper vs. Practice: How Legal and Ethical Frameworks Influence Public Sector Data Professionals in the Netherlands