Development and validation of a MEDLINE search filter/hedge for degenerative cervical myelopathy
Degenerative cervical myelopathy (DCM) is a common condition with many unmet clinical needs. Pooled analysis of studies is an important tool for advancing medical understanding. This process starts with a systematic search of the literature. Identification of studies in DCM is challenged by a number of factors, including non-specific terminology and index terms. Search filters or HEDGEs, are search strings developed and validated to optimise medical literature searches. We aimed to develop a search filter for DCM for the MEDLINE database.
The diagnostic test assessment framework of a “development dataset” and seperate “validation dataset” was used. The development dataset was formed by hand searching four leading spinal journals (Spine, Journal of Neurosurgery Spine, Spinal Cord and Journal of Spinal Disorders and Techniques) in 2005 and 2010. The search filter was initially developed focusing on sensitivity and subsequently refined using NOT functions to improve specificity. One validation dataset was formed from DCM narrative and systematic review articles and the second, articles published in April of 1989, 1993, 1997, 2001, 2005, 2009, 2013 and 2017 retrieved via the search MeSH term ‘Spine’. Metrics of sensitivity, specificity, precision and accuracy were used to test performance.
Hand searching identified 77/1094 relevant articles for 2005 and 55/1199 for 2010. We developed a search hedge with 100% sensitivity and a precision of 30 and 29% for the 2005 and 2010 development datasets respectively. For the selected time periods, EXP Spine returned 2113 publications and 30 were considered relevant. The search filter identified all 30 relevant articles, with a specificity of 94% and precision of 20%. Of the 255 references listed in the narrative index reviews, 225 were indexed in MEDLINE and 165 (73%) were relevant articles. All relevant articles were identified and accuracy ranged from 67 to 97% over the three reviews. Of the 42 articles returned from 3 recent systematic reviews, all were identified by the filter.
We have developed a highly sensitive hedge for the research of DCM. Whilst precision is similarly low as other hedges, this search filter can be used as an adjunct for DCM search strategies.
KeywordsCervical Myelopathy Spondylosis Spondylotic Stenosis Disc herniation Ossification posterior longitudinal ligament Degeneration Hedge Search filter MEDLINE Systematic review
Chronic obstructive pulmonary disease
Degenerative cervical myelopathy
Japanese Orthopaedic Association
Medical Subject Headings
Ossification of the ligamentum flavum
Ossification of the posterior longitudinal ligament
Degenerative cervical myelopathy [DCM] is a new umbrella term for a common clinical phenotype: cervical spinal cord compression causing myelopathy (spinal cord damage) from degenerative changes of the surrounding spinal structures . Causative degenerative pathology include disc prolapses, osteophyte formation or ligament hypertrophy. DCM is estimated to be the most common cause of spinal cord dysfunction  and despite surgery to alleviate compression, most patients retain lifelong disabilities. A recent study identified that quality of life amongst DCM patients was worse than patients with heart failure, COPD and Cancer . Clearly therefore, there remain major unmet clinical needs in DCM.
DCM was often, formerly referred to as Cervical Spondylotic Myelopathy. However, there was inconsistency as to whether this included related conditions such as ossification of the posterior longitudinal ligament [OPLL] or ossification of the ligamentum flavum [OLF]. This has caused ambiguity in critical appraisal during research synthesis.  The various terms were also a mouthful for patients . These factors have contributed to the proposal of DCM as a new term.
From a clinical point of view, the development of an umbrella term is logical; patients suffer the same clinical symptoms, from a presumed common spinal cord injury mechanism, undergo a similar clinical work up and are treated via surgical decompression. Therefore their pooled analysis can further understanding of diagnosis, prognosis, treatment efficacy and appropriate study design [5, 6, 7, 8].
Myelopathy is a medical term for disease of the spinal cord, which causes a set of common symptoms. It does not specify the aetiology or the level affected.
The type of surgical treatments for the cervical spine are common to many pathologies and not specific to DCM.
Cervical has anatomical relevance outside of the spine, including ‘of the cervix’, a well-researched area of women’s health.
Search filters, also known as search hedges, are validated search schemes which can be incorporated into any strategy to focus their results to a certain target. They have been developed to help clinicians efficiently and accurately filter the ever burgeoning medical literature to answer important clinical questions and advance care . For example, hedges have been developed to select for specific study designs , specialty [15, 16, 17], themes  or disease . Development requires testing against a manual hand search. Typically, a ‘gold standard’ database is created manually, with a proportion used for development of the hedge, which is then validated in the remainder.
Systematic reviews help prevent research wastage. In 2010 it was estimated $240 billion was spent on health research, yet as much as 85% failed to deliver meaningful clinical benefit . In the report, purported reasons include duplication of existing knowledge, which could be prevented by prior systematic review.
A number of medical literature databases exist. Although Cochrane recommends the use of MEDLINE, EMBASE and CENTRAL for evaluation of interventions, MEDLINE is the most popular database and identifies the majority of included studies. It is noted, however, that the use of MEDLINE alone is often insufficient .
Our objective was to develop a search hedge for degenerative cervical myelopathy in the MEDLINE database. Our priority was sensitivity, with a view that from this foundation, systematic reviewers could focus their strategies.
The objective was to develop a filter with > 98% sensitivity for studies considering any aspect of DCM in humans, for the MEDLINE database. The authors chose to focus on sensitivity, as it was intended that this filter would often form the basis of a DCM systematic review and reviewers could add additional syntax to focus their search with respect to their question.
Considered DCM in humans
Reported solely on animal data
Reported on heterogeneous populations (not exclusively DCM).
Studies with heterogeneous populations are commonly identified in DCM search strategies, for as aforementioned, studies evaluating a surgical technique may use a population with mixed pathologies. The extraction of DCM specific data is therefore difficult and rarely sought. These studies were therefore excluded. However, if the study solely evaluated a surgical technique on patients with DCM, it was included.
All types of article (for example reviews, primary clinical trials or commentary), written in any language were included. As English-speaking authors, foreign language texts underwent a translated title and abstract screen only.
A development dataset was created, comprising articles published in four leading spinal journals  in the years 2005 and 2010; Spinal Cord, Spine, Journal of Neurosurgery Spine and Journal of Spinal Disorders and Techniques. When choosing these spinal journals, consideration was given to ensure both surgical (Spine and Journal of Neurosurgery Spine) and non-surgical (Spinal Cord and Journal of Spinal Disorders and Techniques) focused journals were selected. The DCM literature is heavily weighted towards surgery, as the only evidence-based treatment, and it was felt this balance would ensure greater generalisation of the developed search strategy given relevant material is published outside of these fields also [5, 6]. In our previous systematic reviews, 48 (44%) of the included articles were published within these four journals with the remaining 60 from 36 different journals, indicating their relevance as developmental journals to the field of DCM. Articles were hand-searched by authors (BMD, SG, KY) for inclusion using title and abstracts, and where necessary full text articles. Searches were randomly allocated to an author, such that overall, the entire database was screened at least twice by two different reviewers.
The initial search strategy was designed based on the results of our previous systematic reviews (108 relevant articles) [5, 6]. Titles and abstracts were scrutinized for relevant keywords and listed MeSH (Medical Subject Headings). The MEDLINE MeSH taxonomy was reviewed to identify appropriate grouping terms. Based on these articles, and in keeping with systematic reviews conducted by others within [8, 24, 25, 26, 27] or related to the field , the filter was developed using two components: 1) ‘Pertaining to the cervical spine’ AND 2) ‘Pertaining to spinal cord compression (i.e. myelopathy)’. This was felt to be logical, as both these components have to be satisfied for a diagnosis of DCM. We sought to optimise the strategy by comparing iteration A (x number of hits) and with the subsequent iteration B (y number of hits). Where x > y, we combined A NOT B, to identify any missed papers and judge their relevance/importance. Where y > x, we combined B NOT A, to identify any missed papers and judge their relevance. Expert judgement was used to identify search terms and further optimise the search strategy.
Once a 100% sensitive search strategy had been developed, the incorporation of NOT functions was trialed to increase specificity but retain 100% sensitivity. This was undertaken using the same iterative process. In addition to the already formed developmental dataset, an additional developmental dataset was formed by screening articles identified by the 100% sensitive search strategy within the publications of the four developmental journals during 2015. If the addition of a NOT term removed a valid article, this version of the search strategy was discarded, and an alternative option trialed. In order to achieve a validated hedge with > 98% sensitive hedge (once extrapolated across less focused journals) it was believed necessary to have 100% sensitivity during development. Therefore, the loss of any relevant article was deemed unacceptable at this stage.
The filter was developed using the OVID platform for MEDLINE, in concert with a medical librarian (IK).
Due to the relative low incidence of DCM publications, when forming our development datasets, in order to avoid hand searching extremely large datasets we had focused on leading spinal journals only. Previous hedge developments have used a single dataset, fractioned for development and validation. However to ensure that the developed hedge was generalisable to other journals, at different time points, we developed two further and distinct datasets for validation. One dataset was made up of the references from recent DCM reviews, retrievable using the OVID platform. The second dataset comprised articles published in April of 1989, 1993, 1997, 2001, 2005, 2009, 2013 and 2017 retrieved via the search EXP MeSH term ‘Spine’. The term was chosen as it is a general and broad category, but one for which DCM studies may match (Fig. 1).
Development and validation data sets, and the returns from filter iterations were exported from OVID, to Excel (Microsoft, California) to analyse based on their Unique Identifiers. Metrics of sensitivity, specificity, accuracy and precision were used to compare performance.
Hand searching of leading spinal journals identified 77 (out of 1094) relevant articles for 2005 and 55 (out of 1199) for 2010.
Example of search development and evaluation. Expanding on the terms ‘cervical’ and ‘myelopathy’ improved sensitivity. However, the use of ‘neck’ or terms relating to ‘spinal cord injury’ were of no added benefit. Results given for 2005 development database, for which 77 articles were identified using the hand search. Changes to the search strategy from one iteration to the next are shown in bold
(cervical and myelopathy).mp.
(exp Cervical Vertebrae/ or exp Cervical Cord/ or cervical.mp. or (phrenic nucleus or accessory nucleus).mp.) and (myelopath*.mp. or exp Spinal Cord Diseases/ or (spinal cord adj3 (diseas* or disorder*)).mp.)
(exp Cervical Vertebrae/ or exp. Cervical Cord/ or cervical.mp. or (phrenic nucleus or accessory nucleus).mp.) and (myelopath*.mp. or exp. Spinal Cord Diseases/ or (spinal cord adj3 (diseas* or disorder*)).mp. or myeloradiculopath*.mp. or (Spinal Cord adj3 Compress*).mp. or exp Spinal Cord Compression/)
(exp Cervical Vertebrae/ or exp. Cervical Cord/ or cervical.mp. or (phrenic nucleus or accessory nucleus).mp. or exp Neck/ or neck*.mp.) and (myelopath*.mp. or exp. Spinal Cord Diseases/ or (spinal cord adj3 (diseas* or disorder*)).mp. or myeloradiculopath*.mp. or (Spinal Cord adj3 Compress*).mp. or exp. Spinal Cord Compression/)
(exp Cervical Vertebrae/ or exp. Cervical Cord/ or cervical.mp. or (phrenic nucleus or accessory nucleus).mp. or exp. Neck/ or neck*.mp.) and (myelopath*.mp. or exp. Spinal Cord Diseases/ or (spinal cord adj3 (diseas* or disorder*)).mp. or myeloradiculopath*.mp. or (Spinal Cord adj3 Compress*).mp. or exp. Spinal Cord Compression/ or (spinal cord adj3 (injur* or trauma* or contusion* or lacerat*)).mp or exp Spinal Cord Injuries/)
The terms related to ‘cervical’ and ‘myelopathy’ had a ceiling affect, with the missing studies largely focussed on OPLL exclusively. On this basis, the ‘cervical’ component was initially expanded to include OPLL as a MeSH term and keyword, but this still missed relevant articles. As such the search strategy was expanded to include all articles with OPLL as a MeSH term, independent of the ‘Myelopathy’ search component. Another important adaptation was the inclusion JOA (Japanese Orthopaedic Association) within the ‘myelopathy’ search component, as a number of relevant articles did not mention myelopathy in their title or abstract. The JOA, in its various forms, is the most commonly used grading assessment for human function in DCM . It was specifically developed for assessment of function in DCM and therefore felt to be an appropriate synonym.
With a 100% sensitive search for our development databases, we explored the use of ‘NOT’ functions to improve precision. The use of keywords was ineffective, as a number of relevant articles specified exclusion criteria in their abstract – therefore these double negatives led to inappropriate removal of relevant articles from our search results. As such, only MeSH terms could be used in the NOT search component. This reduced the number of retrieved articles by 30%, from 15, 827 to 11, 033 (searched performed 14th July 2017). The final filter for validation had a precision of 30 and 29% for the 2005 and 2010 development datasets.
Validation in selected narrative review articles. All relevant references were identified with the search filter
Number of References (Relevant Articles)
Number of Journals (Publication Years, Range)
Number of Development Journals (%)
Kato et al. (2016) 
Wilson et al. (2017) 
Abiola et al. (2016) 
In addition, the included articles from the 3 most recent systematic reviews on DCM were used [33, 34, 35]. As per the objectives of the reviews, these were reviews for which the DCM search filter is an intended user. The reviews were not published in journals used as part of the development dataset. Of the 42 separate articles included (43 were included, but one study was included in two systematic reviews) all studies were identified by our search filter. 23 (53%) of included articles were published in journals used to build the development dataset.
Amongst the narrative review references, common irrelevant articles related to a surgical technique not unique to DCM (25) or OPLL in other areas of the spine (8). Surgical technique articles were appropriately not retrieved in 8 cases.
We have developed and validated a highly sensitive search filter for DCM (Additional file 1). Whilst the precision is relatively low, this foundation can be developed by researchers to focus their literature searches as required and is comparable to other filters [19, 36].
How does the precision compare to other filters?
Imprecision is a recognised challenge for search filters and to be expected, when there is a relatively low proportion of relevant articles within diverse databases .
The use of NOT functions has been shown to improve precision. There are recognised risks of using NOT functions including a tendency to introduce errors within long search syntax and the risk of removing relevant articles . Specifically, we found that the NOT functions when used with keywords removed relevant articles by creating double negatives. Specifically, some abstracts reported their exclusion criteria and therefore by also placing our matching exclusion criteria in our search syntax, the article was not returned. We do not think this has been specifically described before. Whilst this could be circumvented by using MeSH terms as opposed to keywords, this limited the terminology that could be incorporated in the NOT functions.
The precision of existing filters varies greatly, depending on the target. When evaluating filters to identify systematic reviews, Lee et al. found precision of < 5% in all tested filters [14, 39], whereas filters to identify Chronic Kidney Disease studies or content relevant for Geriatrics had precisions > 90% [16, 19, 36]. Our precision ranged from 20 to 30% depending on the database, which appears middle of the road. Efforts to further improve this using NOT functions led to the exclusion of relevant articles, which we deemed unacceptable. Although a limited comparison (considering included articles, across many databases), the precision of recent DCM reviews has been less than 8% [8, 24, 25, 26, 27]. Given our search filter could identify the same articles, this would suggest that it would be able to increase efficiency for DCM systematic review.
What is generating imprecision in our DCM filter?
For our search, the majority of studies that were retrieved, but irrelevant, concerned Spinal Cord Injury, surgical techniques which are shared between DCM and other conditions, such as disc replacement surgery and anterior cervical discectomy, and OPLL outside the cervical spine. Spinal cord injury and surgery for degenerative spinal disease have a high research output and bear a close relation to DCM, in terms of the contributory pathology, the disability and the treatments. As such they share many MeSH terms, and equally as often stated in DCM studies’ exclusion criteria, the use of keywords led to double negatives. However, these topics were not completely represented by our search filter, as evidenced by their mixed identification during validation, and this must be recognised by researchers wishing to use the filter. For example, if a review was to evaluate a surgical technique used in the treatment of DCM, including data from its use in other indications or wishing to extract outcomes from mixed populations, this filter would not be appropriate. If a review wanted to evaluate a technique reported just in patients with DCM, then this filter would be appropriate.
Is imprecision a problem?
Efficiency is important for day-to-day uptake of filters, outside research fields. It is estimated that whilst a researcher may spend on average 1 h critically appraising the literature, clinically physicians may have less than 2 min to identify relevant evidence . A balance between precision and sensitivity therefore needs to be struck.
Automated data mining, including the use of machine learning, for literature searching is a growing field of research [41, 42, 43]. It offers the potential to very accurately select articles of relevance and improve efficiency. At present, the areas closest to use by non-specialists, are tools to improve the efficiency of title and abstract screening in systematic reviews. These include Abstrackr , Revis  and EPPI Reviewer tools . These are tools which could be used to optimise a sensitive search filter.
Our objective was to develop a search filter for DCM in MEDLINE that could form the basis of any DCM systematic review and therefore we prioritised sensitivity over all other performance metrics. Whilst this limits its usability for clinicians on a day to day basis, the intention is that no relevant studies are excluded, and that the adjuvant search strategy will improve the specificity / precision for the researchers’ needs.
What are the limitations of this filter?
The generalisation of a search filter will be dependent on the database it was developed from. Investigators balance creating a database which is representative of the literature as a whole, with including sufficient target articles and remaining of a size which is amenable to hand searching. Various strategies have been employed; most commonly author selected journals, but also citation chasing and systematic reviews. 
This initial author selection process is therefore a potential limitation to generalisation. In this study, we used a combination of previous strategies, including key journals, an exploded MESH term, narrative and systematic reviews, to minimize this risk. In addition, given the search filter’s performance was maintained across many other journals and years of medical literature during validation, we therefore feel confident it is generalisable for MEDLINE.
This filter has been developed based on the presently available constructs and MeSH terms. In the absence of text mining software, frequency text analysis could not be employed to help identify key search terms. This process instead took place by visual inspection by multiple authors. Additionally if new MeSH terms arose within the field, this could alter the performance and re-validation may be needed. By incorporating both MeSH terms and free text searches we hope to ensure some longevity to this function.
We have developed a highly sensitive search filter of relevance to clinical DCM research. When using this filter, it is important to consider its limitations with respect to a review’s desired objectives.
MRNK is supported by NIHR Clinician Scientist Award. This report is independent research arising from a Clinician Scientist Award, CS-2015-15-023, supported by the National Institute for Health Research. The views expressed in this publication are those of the authors and not necessarily those of the NHS, the National Institute for Health Research or the Department of Health.
Availability of data and materials
All data generated or analysed during this study are included in this published article. The source data is available on request.
BMD was involved in the conception and design of the study, forming the development dataset, developing the search filter, data analysis and writing the manuscript. SG was involved in the formation of the development dataset. KY was involved in the formation of the development dataset and drafting the manuscript. IK was involved in the development of the search filter. MRNK was involved in the conception and design of the study, overall supervision, drafting of the manuscript. All authors read and approved the final manuscript.
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
- 3.Oh T, Lafage R, Lafage V, Protopsaltis T, Challier V, Shaffrey C, Kim HJ, Arnold P, Chapman J, Schwab F, et al. Comparing quality of life in cervical Spondylotic myelopathy with other chronic debilitating diseases using the short form survey 36-health survey. World Neurosurg. 2017;106:699–706.CrossRefPubMedGoogle Scholar
- 4.E R: Elemental Ideas: Cervical Spondylotic Myelopathy. In. Edited by TV C. Available from: http://archive.cambridge-tv.co.uk/elemental-ideas-cervical-spondylotic-myelopathy-part-one/.
- 25.Tetreault LA, Dettori JR, Wilson JR, Singh A, Nouri A, Fehlings MG, Brodt ED, Jacobs WB. Systematic review of magnetic resonance imaging characteristics that affect treatment decision making and predict clinical outcome in patients with cervical spondylotic myelopathy. Spine. 2013;38(22 Suppl 1):S89–110.CrossRefPubMedGoogle Scholar
- 26.Wilson JR, Barry S, Fischer DJ, Skelly AC, Arnold PM, Riew KD, Shaffrey CI, Traynelis VC, Fehlings MG. Frequency, timing, and predictors of neurological dysfunction in the nonmyelopathic patient with cervical spinal cord compression, canal stenosis, and/or ossification of the posterior longitudinal ligament. Spine. 2013;38(22 Suppl 1):S37–54.CrossRefPubMedGoogle Scholar
- 33.Fehlings MG, Tetreault LA, Kurpad S, Brodke DS, Wilson JR, Smith JS, Arnold PM, Brodt ED, Dettori JR. Change in functional impairment, disability, and quality of life following operative treatment for degenerative cervical myelopathy: a systematic review and meta-analysis. Glob Spine J. 2017;7(3 Suppl):53s–69s.CrossRefGoogle Scholar
- 35.Tetreault LA, Rhee J, Prather H, Kwon BK, Wilson JR, Martin AR, Andersson IB, Dembek AH, Pagarigan KT, Dettori JR, et al. Change in function, pain, and quality of life following structured nonoperative treatment in patients with degenerative cervical myelopathy: a systematic review. Glob Spine J. 2017;7(3 Suppl):42s–52s.CrossRefGoogle Scholar
- 36.Hildebrand AM, Iansavichus AV, Lee CW, Haynes RB, Wilczynski NL, McKibbon KA, Hladunewich MA, Clark WF, Cattran DC, Garg AX. Glomerular disease search filters for Pubmed, Ovid Medline, and Embase: a development and validation study. BMC Med Inform Decis Mak. 2012;12:49.CrossRefPubMedPubMedCentralGoogle Scholar
- 41.Felizardo KR SN, Martins RM, Mendes E, MacDonell SG, Maldonando JC: Using visual text mining to support the study selection activity in systematic literature reviews. In: 2011 5th International Symposium on Empirical Software Engineering and Measurement (ESEM) IEEE; 2011. pp. 77–86. https://dl.acm.org/citation.cfm?id=2082758&picked=prox.
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.