Diverticular disease: picking pockets and population biobanks

Rinse K Weersma; Miles Parkes

doi:10.1136/gutjnl-2019-318231

Article Text

PDF

XML

Commentary

Diverticular disease: picking pockets and population biobanks

Rinse K Weersma1,
Miles Parkes2

¹ Department of Gastroenterology and Hepatology, University of Groningen and University Medical Center Groningen, Groningen, The Netherlands
² Gastroenterology Unit, Addenbrookes Hospital, Cambridge, UK

Correspondence to Dr Rinse K Weersma, Department of Gastroenterology and Hepatology, University of Groningen and University Medical Center Groningen, Groningen 9700RB, The Netherlands; r.k.weersma{at}umcg.nl

https://doi.org/10.1136/gutjnl-2019-318231

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

The last decade has seen an explosion in genome-wide association studies (GWAS) in many diseases. With these—and its worth reminding ourselves that this is the primary goal of GWAS—have come valuable insights into pathogenic pathways. Some have been predictable (eg, variants affecting antigen presentation, co-stimulation and T-cell responses in immune-mediated diseases) and others, unexpected (eg, autophagy in Crohn’s disease). Until recently, such studies were performed in disease cohorts specifically ascertained for the purpose, with clear evidence that the larger the cohort the more loci will be found.1

Such disease-specific cohorts with well-defined phenotypes can be assembled where there are engaged clinical communities, but may be tricky to ascertain for more general traits. Into this category falls diverticular disease. Diverticulosis affects up to 50% of the population but was poorly studied at a genetic level until recently, despite clear evidence for its relatively high heritability.2 3 In Gut, Schafmayer et al report on a large-scale analysis using clinical and genetic data from the UK Biobank. This population cohort totals 500 000 individuals, among whom ~32 000 have a recorded diagnosis of diverticular disease according to the international classification of disease (ICD) 9/10 coding. The authors went on to replicate their findings in a hospital-based case–control cohort.4 Interestingly, a similar approach was recently deployed by Maguire et al, using the same biobank ‘discovery’ dataset and a separate independent hospital-based registry as replication cohort.5 The earlier study identified 39 susceptibility loci for diverticular disease and showed that candidate genes reside in plausible biological pathways involved in cell–cell adhesion, membrane transport signalling and intestinal motility.

In this paper by Schafmayer et al, 48 susceptibility loci were identified for diverticular disease at a genome-wide significant statistical threshold, with consistent directionality in the discovery and replication cohorts. Out of these, 12 regions were novel compared with the previous publications. GWAS loci usually harbour multiple genes and it is frequently unclear as to which gene is actually causal. The authors, therefore, performed a series of downstream in silico analyses to prioritise candidate genes within each locus. To the extent that these approaches have refined the associations seen, the authors highlighted one gene per locus, identified the fact that many of their signals map to introns (non-coding inserts in genes), and analysed layer-specific mRNA expression and fluorescence immunohistochemical staining in colonic biopsies. Of note, it appears that some degree of manual curation was used in the fine mapping routines, and with this the concern that some bias regarding likely causal genes might have bled into the final results. Nevertheless, novel insights into the pathophysiology of diverticular disease are derived, with the suggestion that it is a disorder of intestinal neuromuscular function, mesenteric vascular smooth muscle function and connective fibre support. These, therefore, overlap at least to a significant extent with the conclusions of Maguire et al. At this stage, these mechanisms must be viewed as hypotheses to be tested. Confirmation requires more detailed genetic mapping, ascertainment of correlation between the associated genetic variants and gene expression in relevant cell types, and the interrogation of their functional impact.

Now, what to think of the methodology deployed in the current study? Inevitably, there are some trade-offs when using a population cohort as opposed to a clinical cohort, not least in the definition of the disease under study. Schafmayer et al used ICD10 coding (K57) to define the case group, which includes both diverticulitis and diverticulosis. Anyone familiar with coding in the clinical setting will be aware of its potential inaccuracies, not least for a condition such as diverticulosis, where many affected individuals are asymptomatic and others may be diagnosed with the condition in the absence of objective evidence and where the true diagnosis is, for example, irritable bowel syndrome. Further, given the high prevalence of diverticular disease in the population, and its increase with age, one might suppose that mis-specification for ‘case’ and ‘control’ status (the latter including many ‘not yet affected’) would be an issue. It seems that Schafmayer et al used a rather more inclusive definition than the study of Maguire et al, and with the larger sample identified more loci with a genome-wide significance. As has been well recognised previously, in genetic studies of common diseases, cohort size and statistical power really count, even if they come with some loss of phenotyping accuracy.

While the potential diagnostic imprecision inherent in a population database such as the UK Biobank might be viewed as a weakness, the very fact that the cohort is broadly representative of the whole population rather than specifically collected disease cohorts itself provides an important ‘real world’ context for genetic findings. There is a large danger based on the published literature that clinicians view genetic results as more powerful and of greater deterministic value than they actually are. A recent publication highlights this by showing that the penetrance of causal mutations for many monogenic conditions has probably been substantially over-estimated due to its derivation from studies based on tertiary referral cohorts.6

The UK Biobank is a prospective cohort study on ~500 000 population-based individuals.7 For each participant, a large set of phenotypes (including ICD codes as used in the current study), health-related measurements, diet and lifestyle data, and biomarkers are available. There are ambitious plans to link them to primary healthcare records and prescribing data, as well as possibly collecting stool samples to complement the DNA and data already collected. GWAS has been performed on nearly the whole cohort using an array with more than 750 000 genetic markers. This unprecedented open access database has been made available to the research community, and—given the size of the cohort—represents a rich resource for genetic studies of common diseases. For many gastro-intestinal (GI) disorders, the number of affected individuals within the UK Biobank is substantial. For example, more than 18 000 are recorded to have cholelithiasis, which is more than double the number of individuals included in the largest GWAS meta-analysis on gallstone disease to date.8 Similar number of individuals have, for example, gastro-oesophageal reflux disease, colorectal polyps or irritable bowel syndrome. For each of these conditions, there is adequate statistical power to detect genetic associations with high confidence, as nicely shown by the two studies on diverticular disease. Furthermore, other biobanks are also coming on-stream. In the northern part of the Netherlands, ‘Lifelines’, a three generations longitudinal cohort study of 165 000 participants, will have genetic data available soon, allowing for similar analyses or joint analyses with the UK Biobank results.9

These resources provide a fantastic chance for researchers to identify genetic risk factors, pathogenetic mechanisms and potential drug targets across the range of GI disorders, as was done for diverticular disease. To date, out of the more than 1000 genetic studies approved by UK Biobank, only a limited number relate to GI disease (see https://www.ukbiobank.ac.uk/approved-research). This is not good enough. Our research community needs to wake up. This is a fantastic opportunity. We should seize it!

References

↵
2. Parkes M ,
3. Cortes A ,
4. van Heel DA , et al
. Genetic insights into common pathways and complex relationships among immune-mediated diseases. Nat Rev Genet 2013;14:661–73.doi:10.1038/nrg3502
OpenUrl CrossRef PubMed
↵
2. Sigurdsson S ,
3. Alexandersson KF ,
4. Sulem P , et al
. Sequence variants in ARHGAP15, COLQ and FAM155A associate with diverticular disease and diverticulitis. Nat Commun 2017;8:15789.doi:10.1038/ncomms15789
OpenUrl
↵
2. Granlund J ,
3. Svensson T ,
4. Olén O , et al
. The genetic influence on diverticular disease--a twin study. Aliment Pharmacol Ther 2012;35:1103–7.doi:10.1111/j.1365-2036.2012.05069.x
OpenUrl CrossRef PubMed
↵
2. Schafmayer C ,
3. Harrison JW ,
4. Buch S , et al
. Genome-wide association analysis of diverticular disease points towards neuromuscular, connective tissue and epithelial pathomechanisms. Gut 2019;68:854–65.doi:10.1136/gutjnl-2018-317619
OpenUrl Abstract/FREE Full Text
↵
2. Maguire LH ,
3. Handelman SK ,
4. Du X , et al
. Genome-wide association analyses identify 39 new susceptibility loci for diverticular disease. Nat Genet 2018;50:1359–65.doi:10.1038/s41588-018-0203-z
OpenUrl
↵
2. Wright CF ,
3. West B ,
4. Tuke M , et al
. Assessing the pathogenicity, penetrance, and expressivity of putative disease-causing variants in a population setting. Am J Hum Genet 2019;104:275–86.doi:10.1016/j.ajhg.2018.12.015
OpenUrl
↵
2. Bycroft C ,
3. Freeman C ,
4. Petkova D , et al
. The UK Biobank resource with deep phenotyping and genomic data. Nature 2018;562:203–9.doi:10.1038/s41586-018-0579-z
OpenUrl CrossRef
↵
2. Joshi AD ,
3. Andersson C ,
4. Buch S , et al
. Four susceptibility loci for gallstone disease identified in a meta-analysis of genome-wide association studies. Gastroenterology 2016;151:351–63.doi:10.1053/j.gastro.2016.04.007
OpenUrl
↵
2. Scholtens S ,
3. Smidt N ,
4. Swertz MA , et al
. Cohort Profile: lifelines, a three-generation cohort study and biobank. Int J Epidemiol 2015;44:1172–80.doi:10.1093/ije/dyu229
OpenUrl CrossRef PubMed

Footnotes

Contributors RKW and MP contributed equally to the writing of the manuscript.
Competing interests RKW received unrestricted research grants from and has acted as consultant for Takeda Pharmaceutical Company. MP has received speaker honoraria from Takeda.
Provenance and peer review Commissioned; internally peer reviewed.
Correction notice This article has been corrected since it first published online. The open access licence type has been amended.
Patient consent for publication Not required.

Linked Articles

Colon
Genome-wide association analysis of diverticular disease points towards neuromuscular, connective tissue and epithelial pathomechanisms

Clemens Schafmayer James William Harrison Stephan Buch Christina Lange Matthias C Reichert Philipp Hofer François Cossais Juozas Kupcinskas Witigo von Schönfels Bodo Schniewind Wolfgang Kruis Jürgen Tepel Myrko Zobel Jonas Rosendahl Thorsten Jacobi Andreas Walther-Berends Michael Schroeder Ilka Vogel Petr Sergeev Hans Boedeker Holger Hinrichsen Andreas Volk Jens-Uwe Erk Greta Burmeister Alexander Hendricks Sebastian Hinz Sebastian Wolff Martina Böttner Andrew R Wood Jessica Tyrrell Robin N Beaumont Melanie Langheinrich Torsten Kucharzik Stefanie Brezina Ursula Huber-Schönauer Leonora Pietsch Laura Sophie Noack Mario Brosch Alexander Herrmann Raghavan Veera Thangapandi Hans Wolfgang Schimming Sebastian Zeissig Stefan Palm Gerd Focke Anna Andreasson Peter T Schmidt Juergen Weitz Michael Krawczak Henry Völzke Gernot Leeb Patrick Michl Wolfgang Lieb Robert Grützmann Andre Franke Frank Lammert Thomas Becker Limas Kupcinskas Mauro D’Amato Thilo Wedel Christian Datz Andrea Gsur Michael N Weedon Jochen Hampe
Gut 2019; 68 854-865 Published Online First: 19 Jan 2019. doi: 10.1136/gutjnl-2018-317619

[1] ↵

Parkes M ,
Cortes A ,
van Heel DA , et al
. Genetic insights into common pathways and complex relationships among immune-mediated diseases. Nat Rev Genet 2013;14:661–73.doi:10.1038/nrg3502
OpenUrl CrossRef PubMed

[3] Parkes M ,

[4] Cortes A ,

[5] van Heel DA , et al

[6] ↵

Sigurdsson S ,
Alexandersson KF ,
Sulem P , et al
. Sequence variants in ARHGAP15, COLQ and FAM155A associate with diverticular disease and diverticulitis. Nat Commun 2017;8:15789.doi:10.1038/ncomms15789
OpenUrl

[8] Sigurdsson S ,

[9] Alexandersson KF ,

[10] Sulem P , et al

[11] ↵

Granlund J ,
Svensson T ,
Olén O , et al
. The genetic influence on diverticular disease--a twin study. Aliment Pharmacol Ther 2012;35:1103–7.doi:10.1111/j.1365-2036.2012.05069.x
OpenUrl CrossRef PubMed

[13] Granlund J ,

[14] Svensson T ,

[15] Olén O , et al

[16] ↵

Schafmayer C ,
Harrison JW ,
Buch S , et al
. Genome-wide association analysis of diverticular disease points towards neuromuscular, connective tissue and epithelial pathomechanisms. Gut 2019;68:854–65.doi:10.1136/gutjnl-2018-317619
OpenUrl Abstract/FREE Full Text

[18] Schafmayer C ,

[19] Harrison JW ,

[20] Buch S , et al

[21] ↵

Maguire LH ,
Handelman SK ,
Du X , et al
. Genome-wide association analyses identify 39 new susceptibility loci for diverticular disease. Nat Genet 2018;50:1359–65.doi:10.1038/s41588-018-0203-z
OpenUrl

[23] Maguire LH ,

[24] Handelman SK ,

[25] Du X , et al

[26] ↵

Wright CF ,
West B ,
Tuke M , et al
. Assessing the pathogenicity, penetrance, and expressivity of putative disease-causing variants in a population setting. Am J Hum Genet 2019;104:275–86.doi:10.1016/j.ajhg.2018.12.015
OpenUrl

[28] Wright CF ,

[29] West B ,

[30] Tuke M , et al

[31] ↵

Bycroft C ,
Freeman C ,
Petkova D , et al
. The UK Biobank resource with deep phenotyping and genomic data. Nature 2018;562:203–9.doi:10.1038/s41586-018-0579-z
OpenUrl CrossRef

[33] Bycroft C ,

[34] Freeman C ,

[35] Petkova D , et al

[36] ↵

Joshi AD ,
Andersson C ,
Buch S , et al
. Four susceptibility loci for gallstone disease identified in a meta-analysis of genome-wide association studies. Gastroenterology 2016;151:351–63.doi:10.1053/j.gastro.2016.04.007
OpenUrl

[38] Joshi AD ,

[39] Andersson C ,

[40] Buch S , et al

[41] ↵

Scholtens S ,
Smidt N ,
Swertz MA , et al
. Cohort Profile: lifelines, a three-generation cohort study and biobank. Int J Epidemiol 2015;44:1172–80.doi:10.1093/ije/dyu229
OpenUrl CrossRef PubMed

[43] Scholtens S ,

[44] Smidt N ,

[45] Swertz MA , et al

Log in using your username and password

Main menu

Log in using your username and password

You are here

Statistics from Altmetric.com

Request Permissions

References

Footnotes

Linked Articles

Read the full text or download the PDF:

Log in using your username and password