Het platform voor open en praktijkgericht onderzoek

product

Integration of Large Language Models in the Public Sector

MULTIFILE

product

Twice random, once mixed: applying mixed models to simultaneously analyze random effects of language and participants

Psychologists, psycholinguists, and other researchers using language stimuli have been struggling for more than 30 years with the problem of how to analyze experimental data that contain two crossed random effects (items and participants). The classical analysis of variance does not apply; alternatives have been proposed but have failed to catch on, and a statistically unsatisfactory procedure of using two approximations (known as F 1 and F 2) has become the standard. A simple and elegant solution using mixed model analysis has been available for 15 years, and recent improvements in statistical software have made mixed models analysis widely available. The aim of this article is to increase the use of mixed models by giving a concise practical introduction and by giving clear directions for undertaking the analysis in the most popular statistical packages. The article also introduces the djmixed add-on package for SPSS, which makes entering the models and reporting their results as straightforward as possible.

MULTIFILE

Twice random, once mixed: applying mixed models to simultaneously analyze random effects of language and participants

product

Exploring Bias in Data and Models for Misinformation Detection from Text

With the proliferation of misinformation on the web, automatic misinformation detection methods are becoming an increasingly important subject of study. Large language models have produced the best results among content-based methods, which rely on the text of the article rather than the metadata or network features. However, finetuning such a model requires significant training data, which has led to the automatic creation of large-scale misinformation detection datasets. In these datasets, articles are not labelled directly. Rather, each news site is labelled for reliability by an established fact-checking organisation and every article is subsequently assigned the corresponding label based on the reliability score of the news source in question. A recent paper has explored the biases present in one such dataset, NELA-GT-2018, and shown that the models are at least partly learning the stylistic and other features of different news sources rather than the features of unreliable news. We confirm a part of their findings. Apart from studying the characteristics and potential biases of the datasets, we also find it important to examine in what way the model architecture influences the results. We therefore explore which text features or combinations of features are learned by models based on contextual word embeddings as opposed to basic bag-of-words models. To elucidate this, we perform extensive error analysis aided by the SHAP post-hoc explanation technique on a debiased portion of the dataset. We validate the explanation technique on our inherently interpretable baseline model.

PDF

product

The application of natural language processing for the extraction of mechanistic information in toxicology

To study the ways in which compounds can induce adverse effects, toxicologists have been constructing Adverse Outcome Pathways (AOPs). An AOP can be considered as a pragmatic tool to capture and visualize mechanisms underlying different types of toxicity inflicted by any kind of stressor, and describes the interactions between key entities that lead to the adverse outcome on multiple biological levels of organization. The construction or optimization of an AOP is a labor intensive process, which currently depends on the manual search, collection, reviewing and synthesis of available scientific literature. This process could however be largely facilitated using Natural Language Processing (NLP) to extract information contained in scientific literature in a systematic, objective, and rapid manner that would lead to greater accuracy and reproducibility. This would support researchers to invest their expertise in the substantive assessment of the AOPs by replacing the time spent on evidence gathering by a critical review of the data extracted by NLP. As case examples, we selected two frequent adversities observed in the liver: namely, cholestasis and steatosis denoting accumulation of bile and lipid, respectively. We used deep learning language models to recognize entities of interest in text and establish causal relationships between them. We demonstrate how an NLP pipeline combining Named Entity Recognition and a simple rules-based relationship extraction model helps screen compounds related to liver adversities in the literature, but also extract mechanistic information for how such adversities develop, from the molecular to the organismal level. Finally, we provide some perspectives opened by the recent progress in Large Language Models and how these could be used in the future. We propose this work brings two main contributions: 1) a proof-of-concept that NLP can support the extraction of information from text for modern toxicology and 2) a template open-source model for recognition of toxicological entities and extraction of their relationships. All resources are openly accessible via GitHub (https://github.com/ontox-project/en-tox).

PDF

The application of natural language processing for the extraction of mechanistic information in toxicology

product

Collaboration between Speech and Language Therapists and Parents of Children with Developmental Language Disorders

In order to optimize collaboration between Speech and Language Therapists (SLTs) and parents of children with Developmental Language Disorders (DLD), our aim was to study what is needed for SLTs to transition from the parent-as-therapist aide model to the FCC model and optimal collaborate with parents. Chapter 2 discusses the significance of demystifying collaborative working by making explicit how collaboration works. Chapter 3 examines SLTs’ perspectives on engaging parents in parent-child interaction therapy, utilizing a secondary analysis of interview data. Chapter 4 presents a systematic review of specific strategies that therapists can employ to enhance their collaboration with parents of children with developmental disabilities. Chapter 5 explores the needs of parents in their collaborative interactions with SLTs during therapy for their children with DLD, based on semi-structured interviews. Chapter 6 reports the findings from a behavioral analysis of how SLTs currently engage with parents of children with DLD, using data from focus groups. Chapter 7 offers a general discussion on the findings of this thesis, synthesizing insights from previous chapters to propose recommendations for practice and future research.

PDF

product

Language Sample Analysis in Clinical Practice: Speech-Language Pathologists’ Barriers, Facilitators, and Needs

Purpose: Most speech-language pathologists (SLPs) working with children with developmental language disorder (DLD) do not perform language sample analysis (LSA) on a regular basis, although they do regard LSA as highly informative for goal setting and evaluating grammatical therapy. The primary aim of this study was to identify facilitators, barriers, and needs related to performing LSA by Dutch SLPs working with children with DLD. The secondary aim was to investigate whether a training would change the actual performance of LSA. Method: A focus group with 11 SLPs working in Dutch speech-language pathology practices was conducted. Barriers, facilitators, and needs were identified using thematic analysis and categorized using the theoretical domain framework. To address the barriers, a training was developed using software program CLAN. Changes in barriers and use of LSA were evaluated with a survey sent to participants before, directly after, and 3 months posttraining. Results: The barriers reported in the focus group were SLPs’ lack of knowledge and skills, time investment, negative beliefs about their capabilities, differences in beliefs about their professional role, and no reimbursement from health insurance companies. Posttraining survey results revealed that LSA was not performed more often in daily practice. Using CLAN was not the solution according to participating SLPs. Time investment remained a huge barrier. Conclusions: A training in performing LSA did not resolve the time investment barrier experienced by SLPs. User-friendly software, developed in codesign with SLPs might provide a solution. For the short-term, shorter samples, preferably from narrative tasks, should be considered.

PDF

Language Sample Analysis in Clinical Practice: Speech-Language Pathologists’ Barriers, Facilitators, and Needs

product

Toward live domain-specific languages

Live programming is a style of development characterized by incremental change and immediate feedback. Instead of long edit-compile cycles, developers modify a running program by changing its source code, receiving immediate feedback as it instantly adapts in response. In this paper, we propose an approach to bridge the gap between running programs and textual domain-specific languages (DSLs). The first step of our approach consists of applying a novel model differencing algorithm, tmdiff, to the textual DSL code. By leveraging ordinary text differencing and origin tracking, tmdiff produces deltas defined in terms of the metamodel of a language. In the second step of our approach, the model deltas are applied at run time to update a running system, without having to restart it. Since the model deltas are derived from the static source code of the program, they are unaware of any run-time state maintained during model execution. We therefore propose a generic, dynamic patch architecture, rmpatch, which can be customized to cater for domain-specific state migration. We illustrate rmpatch in a case study of a live programming environment for a simple DSL implemented in Rascal for simultaneously defining and executing state machines.

PDF

product

Constructive alignment in foreign language curricula

While Communicative Language Teaching (CLT) is recognised as an effective approach worldwide, its implementation in foreign language (FL) classrooms remains difficult. Earlier studies have identified factors impeding CLT implementation, such as a lack of communicative lesson materials or teachers' more traditional views on language learning. In the Netherlands, CLT goals have been formulated at the national level, but are not always reflected in daily FL teaching and assessment practice. As constructive alignment between learning goals, classroom activities and assessments is a precondition for effective teaching, it is important to gain a deeper understanding of the degree of alignment in Dutch FL curricula and the factors influencing it. The current study therefore aims to take a systematic inventory of classroom practices regarding the translation of national CLT goals into learning activities and assessments. Findings revealed that teaching activities and classroom assessments predominantly focused on grammar knowledge and vocabulary out of context and, to a lesser extent, on reading skills. External factors, such as teaching and testing materials available, and conceptual factors, such as teachers' conceptions of language learning, were identified to contribute to the observed lack of alignment. Assessments in particular seem to exert a negative washback effect on CLT implementation.

PDF

Constructive alignment in foreign language curricula

product

Data Science Today and Tomorrow for SMEs

This final installment in our e-learning series offers a comprehensive look at the current impact and future potential of data science across industries. Using real-world examples like medical image analysis and operational efficiencies at Rotterdam The Hague Airport, we showcase data science’s transformative capabilities. The video also introduces the promise of Large Language Models (LLMs) such as Chat GPT and the simplification brought by Automated Machine Learning (AutoML). Emphasizing the blend of technology and human insight, we explore the evolving landscape of AI and data science for businesses.

VIDEO

Zoekresultaten

Producten 2.025

Integration of Large Language Models in the Public Sector

Twice random, once mixed: applying mixed models to simultaneously analyze random effects of language and participants

Exploring Bias in Data and Models for Misinformation Detection from Text

The application of natural language processing for the extraction of mechanistic information in toxicology

Collaboration between Speech and Language Therapists and Parents of Children with Developmental Language Disorders

Language Sample Analysis in Clinical Practice: Speech-Language Pathologists’ Barriers, Facilitators, and Needs

Toward live domain-specific languages

Constructive alignment in foreign language curricula

Data Science Today and Tomorrow for SMEs

Projecten 2

AI-enhanced Data Analysis

Generatieve AI en risico-inschatting in de ambulancezorg, een Safe End?

Navigeer naar

Categorieën

Filters

Producten 2.025

Integration of Large Language Models in the Public Sector

Twice random, once mixed: applying mixed models to simultaneously analyze random effects of language and participants

Exploring Bias in Data and Models for Misinformation Detection from Text

The application of natural language processing for the extraction of mechanistic information in toxicology

Collaboration between Speech and Language Therapists and Parents of Children with Developmental Language Disorders

Language Sample Analysis in Clinical Practice: Speech-Language Pathologists’ Barriers, Facilitators, and Needs

Toward live domain-specific languages

Constructive alignment in foreign language curricula

Data Science Today and Tomorrow for SMEs

Projecten 2

AI-enhanced Data Analysis

Generatieve AI en risico-inschatting in de ambulancezorg, een Safe End?