Huntington’s disease (HD) and various spinocerebellar ataxias (SCA) are autosomal dominantly inherited neurodegenerative disorders caused by a CAG repeat expansion in the disease-related gene1. The impact of HD and SCA on families and individuals is enormous and far reaching, as patients typically display first symptoms during midlife. HD is characterized by unwanted choreatic movements, behavioral and psychiatric disturbances and dementia. SCAs are mainly characterized by ataxia but also other symptoms including cognitive deficits, similarly affecting quality of life and leading to disability. These problems worsen as the disease progresses and affected individuals are no longer able to work, drive, or care for themselves. It places an enormous burden on their family and caregivers, and patients will require intensive nursing home care when disease progresses, and lifespan is reduced. Although the clinical and pathological phenotypes are distinct for each CAG repeat expansion disorder, it is thought that similar molecular mechanisms underlie the effect of expanded CAG repeats in different genes.
The predicted Age of Onset (AO) for both HD, SCA1 and SCA3 (and 5 other CAG-repeat diseases) is based on the polyQ expansion, but the CAG/polyQ determines the AO only for 50% (see figure below). A large variety on AO is observed, especially for the most common range between 40 and 50 repeats11,12. Large differences in onset, especially in the range 40-50 CAGs not only imply that current individual predictions for AO are imprecise (affecting important life decisions that patients need to make and also hampering assessment of potential onset-delaying intervention) but also do offer optimism that (patient-related) factors exist that can delay the onset of disease.
To address both items, we need to generate a better model, based on patient-derived cells that generates parameters that not only mirror the CAG-repeat length dependency of these diseases, but that also better predicts inter-patient variations in disease susceptibility and effectiveness of interventions. Hereto, we will use a staggered project design as explained in 5.1, in which we first will determine which cellular and molecular determinants (referred to as landscapes) in isogenic iPSC models are associated with increased CAG repeat lengths using deep-learning algorithms (DLA) (WP1). Hereto, we will use a well characterized control cell line in which we modify the CAG repeat length in the endogenous ataxin-1, Ataxin-3 and Huntingtin gene from wildtype Q repeats to intermediate to adult onset and juvenile polyQ repeats. We will next expand the model with cells from the 3 (SCA1, SCA3, and HD) existing and new cohorts of early-onset, adult-onset and late-onset/intermediate repeat patients for which, besides accurate AO information, also clinical parameters (MRI scans, liquor markers etc) will be (made) available. This will be used for validation and to fine-tune the molecular landscapes (again using DLA) towards the best prediction of individual patient related clinical markers and AO (WP3). The same models and (most relevant) landscapes will also be used for evaluations of novel mutant protein lowering strategies as will emerge from WP4.
This overall development process of landscape prediction is an iterative process that involves (a) data processing (WP5) (b) unsupervised data exploration and dimensionality reduction to find patterns in data and create “labels” for similarity and (c) development of data supervised Deep Learning (DL) models for landscape prediction based on the labels from previous step. Each iteration starts with data that is generated and deployed according to FAIR principles, and the developed deep learning system will be instrumental to connect these WPs. Insights in algorithm sensitivity from the predictive models will form the basis for discussion with field experts on the distinction and phenotypic consequences. While full development of accurate diagnostics might go beyond the timespan of the 5 year project, ideally our final landscapes can be used for new genetic counselling: when somebody is positive for the gene, can we use his/her cells, feed it into the generated cell-based model and better predict the AO and severity? While this will answer questions from clinicians and patient communities, it will also generate new ones, which is why we will study the ethical implications of such improved diagnostics in advance (WP6).
The search for existing non-animal alternative methods for use in experiments is currently challenging because of the lack of both comprehensive structured databases and balanced keyword-based search strategies to mine unstructured textual databases. In this paper we describe 3Ranker, which is a fast, keyword-independent algorithm for finding non-animal alternative methods for use in biomedical research. The 3Ranker algorithm was created by using a machine learning approach, consisting of a Random Forest model built on a dataset of 35 million abstracts and constructed with weak supervision, followed by iterative model improvement with expert curated data. We found a satisfactory trade-off between sensitivity and specificity, with Area Under the Curve (AUC) values ranging from 0.85-0.95. Trials showed that the AI-based classifier was able to identify articles that describe potential alternatives to animal use, among the thousands of articles returned by generic PubMed queries on dermatitis and Parkinson's disease. Application of the classification models on time series data showed the earlier implementation and acceptance of Three Rs principles in the area of cosmetics and skin research, as compared to the area of neurodegenerative disease research. The 3Ranker algorithm is freely available at www.open3r.org; the future goal is to expand this framework to cover multiple research domains and to enable its broad use by researchers, policymakers, funders and ethical review boards, in order to promote the replacement of animal use in research wherever possible.
Objectives: Pulmonary hypertension is one of the leading causes of death in systemic sclerosis. Early detection and treatment of pulmonary hypertension in systemic sclerosis is crucial. Nailfold capillaroscopy microscopy, vascular autoantibodies AT1R and ETAR, and several candidate-biomarkers have the potential to serve as noninvasive tools to identify systemic sclerosis patients at risk for developing pulmonary hypertension. Here, we explore the classifying potential of nailfold capillaroscopy microscopy characteristics and serum levels of selected candidate-biomarkers in a sample of systemic sclerosis patients with and without different forms of pulmonary hypertension.Methods: A total of 81 consecutive systemic sclerosis patients were included, 40 with systemic sclerosis pulmonary hypertension and 41 with no pulmonary hypertension. In each group, quantitative and qualitative nailfold capillaroscopy microscopy characteristics, vascular autoantibodies AT1R and ETAR, and serum levels of 24 soluble serum factors were determined. For evaluation of the nailfold capillaroscopy microscopy characteristics, linear regression analysis accounting for age, sex, and diffusing capacity of the lungs for carbon monoxide percentage predicted was used. Autoantibodies and soluble serum factor levels were compared using two-sample t test with equal variances.Results: No statistically significant differences were observed in quantitative or qualitative nailfold capillaroscopy microscopy characteristics, or vascular autoantibody ETAR and AT1R titer between systemic sclerosis-pulmonary hypertension and systemic sclerosis-no pulmonary hypertension. In contrast, several serum levels of soluble factors differed between groups: Endostatin, sVCAM, and VEGFD were increased, and CXCL4, sVEGFR2, and PDGF-AB/BB were decreased in systemic sclerosis-pulmonary hypertension. Random forest classification identified Endostatin and CXCL4 as the most predictive classifiers to distinguish systemic sclerosispulmonary hypertension from systemic sclerosis-no pulmonary hypertension.Conclusion: This study shows the potential for several soluble serum factors to distinguish systemic sclerosis-pulmonary hypertension from systemic sclerosis-no pulmonary hypertension. We found no classifying potential for qualitative or quantitative nailfold capillaroscopy microscopy characteristics, or vascular autoantibodies.
Ongoing
Not known