The platform for open and practice-oriented research

product

Heuristic Coordination in Cooperative Multi-Agent Reinforcement Learning

Key to reinforcement learning in multi-agent systems is the ability to exploit the fact that agents only directly influence only a small subset of the other agents. Such loose couplings are often modelled using a graphical model: a coordination graph. Finding an (approximately) optimal joint action for a given coordination graph is therefore a central subroutine in cooperative multi-agent reinforcement learning (MARL). Much research in MARL focuses on how to gradually update the parameters of the coordination graph, whilst leaving the solving of the coordination graph up to a known typically exact and generic subroutine. However, exact methods { e.g., Variable Elimination { do not scale well, and generic methods do not exploit the MARL setting of gradually updating a coordination graph and recomputing the joint action to select. In this paper, we examine what happens if we use a heuristic method, i.e., local search, to select joint actions in MARL, and whether we can use outcome of this local search from a previous time-step to speed up and improve local search. We show empirically that by using local search, we can scale up to many agents and complex coordination graphs, and that by reusing joint actions from the previous time-step to initialise local search, we can both improve the quality of the joint actions found and the speed with which these joint actions are found.

LINK

product

Deep Reinforcement Learning Applied to a Robotic Pick-and-Place Application

Industrial robot manipulators are widely used for repetitive applications that require high precision, like pick-and-place. In many cases, the movements of industrial robot manipulators are hard-coded or manually defined, and need to be adjusted if the objects being manipulated change position. To increase flexibility, an industrial robot should be able to adjust its configuration in order to grasp objects in variable/unknown positions. This can be achieved by off-the-shelf vision-based solutions, but most require prior knowledge about each object tobe manipulated. To address this issue, this work presents a ROS-based deep reinforcement learning solution to robotic grasping for a Collaborative Robot (Cobot) using a depth camera. The solution uses deep Q-learning to process the color and depth images and generate a greedy policy used to define the robot action. The Q-values are estimated using Convolutional Neural Network (CNN) based on pre-trained models for feature extraction. Experiments were carried out in a simulated environment to compare the performance of four different pre-trained CNNmodels (RexNext, MobileNet, MNASNet and DenseNet). Results showthat the best performance in our application was reached by MobileNet,with an average of 84 % accuracy after training in simulated environment.

PDF

Deep Reinforcement Learning Applied to a Robotic Pick-and-Place Application

product

Reinforcement Learning for Collaborative Robots Pick-and-Place Applications:

The number of applications in which industrial robots share their working environment with people is increasing. Robots appropriate for such applications are equipped with safety systems according to ISO/TS 15066:2016 and are often referred to as collaborative robots (cobots). Due to the nature of human-robot collaboration, the working environment of cobots is subjected to unforeseeable modifications caused by people. Vision systems are often used to increase the adaptability of cobots, but they usually require knowledge of the objects to be manipulated. The application of machine learning techniques can increase the flexibility by enabling the control system of a cobot to continuously learn and adapt to unexpected changes in the working environment. In this paper we address this issue by investigating the use of Reinforcement Learning (RL) to control a cobot to perform pick-and-place tasks. We present the implementation of a control system that can adapt to changes in position and enables a cobot to grasp objects which were not part of the training. Our proposed system uses deep Q-learning to process color and depth images and generates an (Formula presented.) -greedy policy to define robot actions. The Q-values are estimated using Convolution Neural Networks (CNNs) based on pre-trained models for feature extraction. To reduce training time, we implement a simulation environment to first train the RL agent, then we apply the resulting system on a real cobot. System performance is compared when using the pre-trained CNN models ResNext, DenseNet, MobileNet, and MNASNet. Simulation and experimental results validate the proposed approach and show that our system reaches a grasping success rate of 89.9% when manipulating a never-seen object operating with the pre-trained CNN model MobileNet.

PDF

product

CURIOSITY-DRIVEN REINFORCEMENT LEARNING AGENT FOR MAPPING UNKNOWN INDOOR ENVIRONMENTS

Autonomously exploring and mapping is one of the open challenges of robotics and artificial intelligence. Especially when the environments are unknown, choosing the optimal navigation directive is not straightforward. In this paper, we propose a reinforcement learning framework for navigating, exploring, and mapping unknown environments. The reinforcement learning agent is in charge of selecting the commands for steering the mobile robot, while a SLAM algorithm estimates the robot pose and maps the environments. The agent, to select optimal actions, is trained to be curious about the world. This concept translates into the introduction of a curiosity-driven reward function that encourages the agent to steer the mobile robot towards unknown and unseen areas of the world and the map. We test our approach in explorations challenges in different indoor environments. The agent trained with the proposed reward function outperforms the agents trained with reward functions commonly used in the literature for solving such tasks.

MULTIFILE

product

Human-Centered AI for Dementia Care: Using Reinforcement Learning for Personalized Interventions Support in Eating and Drinking Scenarios

For people with early-dementia (PwD), it can be challenging to remember to eat and drink regularly and maintain a healthy independent living. Existing intelligent home technologies primarily focus on activity recognition but lack adaptive support. This research addresses this gap by developing an AI system inspired by the Just-in-Time Adaptive Intervention (JITAI) concept. It adapts to individual behaviors and provides personalized interventions within the home environment, reminding and encouraging PwD to manage their eating and drinking routines. Considering the cognitive impairment of PwD, we design a human-centered AI system based on healthcare theories and caregivers’ insights. It employs reinforcement learning (RL) techniques to deliver personalized interventions. To avoid overwhelming interaction with PwD, we develop an RL-based simulation protocol. This allows us to evaluate different RL algorithms in various simulation scenarios, not only finding the most effective and efficient approach but also validating the robustness of our system before implementation in real-world human experiments. The simulation experimental results demonstrate the promising potential of the adaptive RL for building a human-centered AI system with perceived expressions of empathy to improve dementia care. To further evaluate the system, we plan to conduct real-world user studies.

PDF

product

Collective learning in schools described: building collective learning capacity

Processes of collective learning are expected to increase the professionalism of teachers and school leaders. Little is known about the processes of collective learning which take place in schools and about the way in which those processes may be improved. This paper describes a research into processes of collective learning at three primary schools. Processes of collective learning are described which took place in small teams in these schools. It is also pointed out which attempts can be made in order to reinforce these processes in the schools mentioned.

PDF

Collective learning in schools described: building collective learning capacity

product

Preparing undergraduate students for lifelong learning

This thesis presents an exploration of ‘how entrepreneurship education pedagogy can enhance undergraduate business students’ autonomous motivation for self-directed learning’. It has twin, equally valuable, purposes: to make an original theoretical contribution and to improve professional practice in this area. The work addresses the lack of pedagogical research in entrepreneurship education that focuses on learner development, with a specific aim at development of self-directed learning skills for lifelong learning. The research is approached with a concurrent, mixed methods design, comparing pre- and a post-EE, self-assessment survey results from 245 students, enrolled in a Young Enterprise venture creation programme, and a control group at a Dutch university. With the use of open-question surveys among the same population, during and after the EE modules, as well as from focus group discussions with a selection of participating students and teachers, explanation was sought for the observations drawn from the quantitative study. Significant relationships were found between students’ self-reported maturity of autonomy, self-efficacy, and motivation for learning, and in how these relate to self-directed learning readiness. Entrepreneurship education was found to significantly moderate the relationship between the learning characteristics and self-directed learning, and to strengthen of the students’ perceived readiness for self-directed learning. Explanation for the impact of EE were found to be related to the stage-wise, mixed pedagogy approach to learning, that combines authentic learning with a hierarchical approach to competence development, and supportive team dynamics. The research contributes to practice with a proposed conceptual framework for understanding how to prepare for self-directed learning readiness and a teaching-learning framework for its development in formal educational settings. It contributes to knowledge with its deeper understanding of how students experience learning in EE and how that affects their willingness to pursue learning opportunities.

MULTIFILE

Preparing undergraduate students for lifelong learning

product

ons of life's most precious gifts

Professional development of teacher educators is an important topic, because teacher educators need to maintain and enhance their expertise in order to educate our future teachers (Kools & Koster, n.d. ; Dengerink, Lunenberg & Kools, 2015). How do teacher educators fulfil this task, especially within the hectic timeframe of everyday work? I asked four colleges to participate in a group to share their experiences, actions or behaviour in the organisation about their development in their profession of being a teacher educator. My purpose is to bring awareness and movement into that group. My research focusses on teacher educators in a large teacher education department in the Netherlands and the opportunities for action available to them. During this study we are currently creating a learning environment in which mutual cooperation increases the learning potential of all participants. In this group participants take or make time to learn, giving words to their scopes . Researcher and participants discuss and explore on the basis of equality, reciprocity and mutual understanding. By deploying methods borrowed from ‘Appreciative Inquiry’(Massenlink et al., 2008) the enthusiasm of a study group is raised and the intrinsic motivation of the participants stimulated. Our study group will convene three times. Its goal is to stimulate cooperation among teacher educators through optimisation of existing qualities, a method that could be described as empowerment, or a process of collective reinforcement ‘To learn’ involves experiencing that what one does really matters, as well as developing one’s own persona in the local community. Intervention, action, reflection and study group meetings alternate in the course of our research. In addition to audio and video recordings, data consists of reports drawn up on the basis of member checks. Data is analysed qualitatively by coding the interview texts and reports. After applying the codes, the researcher discusses the coding in a research group and with the participants of the study group (membercheck). Working collaboratively can offer learning challenges that catalyse growth as a professional, teacher educators become acquainted and approach each other from the perspective of their respective professional and functional responsibilities. This study offers perspectives for other teacher educators to recognize these possibilities in their own situation. Moreover the study offers a description of a way to organise collegial exchange. The research is related to the RDC professional development of teacher educators.

PDF

product

Low Dimensional State Representation Learning with Robotics Priors in Continuous Action Spaces

Autonomous robots require high degrees of cognitive and motoric intelligence to come into our everyday life. In non-structured environments and in the presence of uncertainties, such degrees of intelligence are not easy to obtain. Reinforcement learning algorithms have proven to be capable of solving complicated robotics tasks in an end-to-end fashion without any need for hand-crafted features or policies. Especially in the context of robotics, in which the cost of real-world data is usually extremely high, reinforcement learning solutions achieving high sample efficiency are needed. In this paper, we propose a framework combining the learning of a low-dimensional state representation, from high-dimensional observations coming from the robot 's raw sensory readings, with the learning of the optimal policy, given the learned state representation. We evaluate our framework in the context of mobile robot navigation in the case of continuous state and action spaces. Moreover, we study the problem of transferring what learned in the simulated virtual environment to the real robot without further retraining using real-world data in the presence of visual and depth distractors, such as lighting changes and moving obstacles.

MULTIFILE

Search results

Products 1.114

Heuristic Coordination in Cooperative Multi-Agent Reinforcement Learning

Deep Reinforcement Learning Applied to a Robotic Pick-and-Place Application

Reinforcement Learning for Collaborative Robots Pick-and-Place Applications:

CURIOSITY-DRIVEN REINFORCEMENT LEARNING AGENT FOR MAPPING UNKNOWN INDOOR ENVIRONMENTS

Human-Centered AI for Dementia Care: Using Reinforcement Learning for Personalized Interventions Support in Eating and Drinking Scenarios

Collective learning in schools described: building collective learning capacity

Preparing undergraduate students for lifelong learning

ons of life's most precious gifts

Low Dimensional State Representation Learning with Robotics Priors in Continuous Action Spaces

People 2

Corné Dijkmans

Henry Maathuis

Navigate to

Categories

Filters

Products 1.114

Heuristic Coordination in Cooperative Multi-Agent Reinforcement Learning

Deep Reinforcement Learning Applied to a Robotic Pick-and-Place Application

Reinforcement Learning for Collaborative Robots Pick-and-Place Applications:

CURIOSITY-DRIVEN REINFORCEMENT LEARNING AGENT FOR MAPPING UNKNOWN INDOOR ENVIRONMENTS

Human-Centered AI for Dementia Care: Using Reinforcement Learning for Personalized Interventions Support in Eating and Drinking Scenarios

Collective learning in schools described: building collective learning capacity

Preparing undergraduate students for lifelong learning

ons of life's most precious gifts

Low Dimensional State Representation Learning with Robotics Priors in Continuous Action Spaces

People 2

Corné Dijkmans

Henry Maathuis