Deep learning-assisted diagnosis of femoral trochlear dysplasia based on magnetic resonance imaging measurements

doi:10.12998/wjcc.v11.i7.1477

Advanced Search

BPG is committed to discovery and dissemination of knowledge

Home / Archive / Volume 11, Issue 7

This Article

Academic Content and Language Evaluation of This Article

CrossCheck and Google Search of This Article

Academic Rules and Norms of This Article

Citation of this article

Corresponding Author of This Article

Research Domain of This Article

Article-Type of This Article

Open-Access Policy of This Article

Times Cited Counts in Google of This Article

Number of Hits and Downloads for This Article

Total Article Views (4386)

All Articles published online

The chart showing PDF series, WORD series, HTML series, Figures (1-2) series, Tables (1-7) series.

Item

Count

PDF

213

WORD

HTML

2194

Figures (1-2)

595

Tables (1-7)

525

Sum=3571

Publishing Process of This Article

The chart showing Browse series, Download series.

Item

Count

Browse

170

Download

645

Sum=815

Mar 6, 2023 (publication date) through Jul 19, 2025

Times Cited of This Article

Times Cited (2)

Journal Information of This Article

Publication Name

World Journal of Clinical Cases

ISSN

2307-8960

Publisher of This Article

Baishideng Publishing Group Inc, 7041 Koll Center Parkway, Suite 160, Pleasanton, CA 94566, USA

Retrospective Study Open Access

World J Clin Cases. Mar 6, 2023; 11(7): 1477-1487
Published online Mar 6, 2023. doi: 10.12998/wjcc.v11.i7.1477

Deep learning-assisted diagnosis of femoral trochlear dysplasia based on magnetic resonance imaging measurements

Sheng-Ming Xu, Dong Dong, Wei Li, Tian Bai, Ming-Zhu Zhu, Gui-Shan Gu

Sheng-Ming Xu, Wei Li, Gui-Shan Gu, Department of Orthopedic Surgery, The First Hospital of Jilin University, Changchun 130000, Jilin Province, China

Dong Dong, Department of Radiology, The First Hospital of Jilin University, Changchun 130000, Jilin Province, China

Tian Bai, Ming-Zhu Zhu, College of Computer Science and Technology, Jilin University, Changchun 130000, Jilin Province, China

ORCID number: Sheng-Ming Xu (0000-0001-7924-4777); Wei Li (0000-0002-9915-7117); Gui-Shan Gu (0000-0002-1483-3746).

Author contributions: Xu SM, Dong D and Gu GS proposed the research topics; Gu GS, Xu SM, Bai T and Li W designed the research protocols; Xu SM, Dong D, Li W and Zhu MZ performed the data acquisition; Dong D and Zhu MZ participated in experimental data analysis; Xu SM was responsible for the primary manuscript generation; Gu GS and Bai T substantively revised it; all authors have read and agreed to the published version of the manuscript.

Institutional review board statement: The study was conducted according to the guidelines of the Declaration of Helsinki, and approved by Medical Ethics Committee of the first hospital of of Jilin University (registration number 2020-607, approved on 15 Dec 2020).

Informed consent statement: Patients were not required to give informed consent to the study because the analysis used anonymous clinical data that were obtained after each patient agreed to treatment by written consent.

Conflict-of-interest statement: All authors declare no conflicts of interest for this article.

Data sharing statement: The data presented in this study are available on request from the corresponding author.

Open-Access: This article is an open-access article that was selected by an in-house editor and fully peer-reviewed by external reviewers. It is distributed in accordance with the Creative Commons Attribution NonCommercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial. See: https://creativecommons.org/Licenses/by-nc/4.0/

Corresponding author: Gui-Shan Gu, MD, Professor, Department of Orthopedic Surgery, The First Hospital of Jilin University, No. 71 Xinmin Street, Changchun 130000, Jilin Province, China. gugs@jlu.edu.cn

Received: November 27, 2022
Peer-review started: November 27, 2022
First decision: January 19, 2023
Revised: January 27, 2023
Accepted: February 13, 2023
Article in press: February 13, 2023
Published online: March 6, 2023
Processing time: 95 Days and 8.2 Hours

Abstract

BACKGROUND

Femoral trochlear dysplasia (FTD) is an important risk factor for patellar instability. Dejour classification is widely used at present and relies on standard lateral X-rays, which are not common in clinical work. Therefore, magnetic resonance imaging (MRI) has become the first choice for the diagnosis of FTD. However, manually measuring is tedious, time-consuming, and easily produces great variability.

AIM

To use artificial intelligence (AI) to assist diagnosing FTD on MRI images and to evaluate its reliability.

METHODS

We searched 464 knee MRI cases between January 2019 and December 2020, including FTD (n = 202) and normal trochlea (n = 252). This paper adopts the heatmap regression method to detect the key points network. For the final evaluation, several metrics (accuracy, sensitivity, specificity, etc.) were calculated.

RESULTS

The accuracy, sensitivity, specificity, positive predictive value and negative predictive value of the AI model ranged from 0.74-0.96. All values were superior to junior doctors and intermediate doctors, similar to senior doctors. However, diagnostic time was much lower than that of junior doctors and intermediate doctors.

CONCLUSION

The diagnosis of FTD on knee MRI can be aided by AI and can be achieved with a high level of accuracy.

Key Words: Femoral trochlear dysplasia; Deep learning; Artificial intelligence; Magnetic resonance imaging; Diagnosis

Core Tip: Femoral trochlear dysplasia is an important risk factor for patellar instability. Magnetic resonance imaging has become the preferred method for evaluating femoral trochlear dysplasia. However, manually measuring femoral trochlea parameters on magnetic resonance imaging is tedious, time-consuming, and easily produces great variability. In this work, we propose an assisted diagnosis algorithm framework based on deep learning technology, which can quickly and accurately distinguish whether there is trochlear dysplasia in the femur. All values (The accuracy, sensitivity, specificity, etc.) were superior to junior doctors and intermediate doctors, similar to senior doctors. Our model is beneficial to both orthopedic surgeons and radiologists, especially, the young front-line clinicians with less experience.

Citation: Xu SM, Dong D, Li W, Bai T, Zhu MZ, Gu GS. Deep learning-assisted diagnosis of femoral trochlear dysplasia based on magnetic resonance imaging measurements. World J Clin Cases 2023; 11(7): 1477-1487
URL: https://www.wjgnet.com/2307-8960/full/v11/i7/1477.htm
DOI: https://dx.doi.org/10.12998/wjcc.v11.i7.1477

INTRODUCTION

Femoral trochlear dysplasia (FTD) was first proposed by Swedish professor Brattstrom in 1964[1]. It is an important risk factor for patellar instability[2]. It is defined as a shallow, flat or raised trochlear groove, with an incidence of 96% in patients with recurrent patellar instability[3,4]. Different degrees of FTD are treated differently. Dejour classification is widely used at present and relies on the morphological features of standard lateral X-rays of the knee to describe the increasing disease severity (type A-D)[5]. However, it is subjectively dependent on doctors and has the disadvantages of poor sensitivity and specificity, which may lead to the neglect of FTD and the formation of an incorrect treatment plan for patients.

Paiva et al[6] reviewed all the literature related to FTD measurement, including X-ray computed tomography (CT) and magnetic resonance imaging (MRI) examination, and summarized 33 FTD evaluation methods. However, compared with X-ray and CT, MRI has the advantage of protruding articular cartilage, so MRI has become the preferred method for evaluating femoral trochlear dysplasia[7]. Specifically, the measurement of femoral trochlear groove depth, lateral trochlear inclination and trochlear facet asymmetry by MRI is particularly good in distinguishing normal and dysplastic femur trochlea[6-11]. However, tedious and repeated measurement is essential to using these qualitative and quantitative parameters to diagnose FTD, and it easily produces considerable differences in intragroup consistency and intergroup consistency. For surgeons and radiology doctors, especially resident physicians with less experience, it not only increases the working hours and burden but also increases the occurrence of misdiagnosis.

Artificial intelligence (AI) has become more and more popular in medical research, because it can not only quickly and accurately assist the diagnosis of diseases, such as cancer, but also participate in the robot surgery system to accurately treat diseases, which has made a great contribution to the development of health care system[12,13]. Recently, the use of deep learning methods in medical imaging has aroused much interest[14,15]. Chung et al[16] assessed the ability of artificial intelligence to detect and classify proximal humeral fractures using ordinary shoulder radiographs. Their results show that convolutional neural networks (CNNs) outperforms general physicians and orthopaedists. Pranata et al[17] used a deep learning algorithm to automatically classify and detect fracture locations in CT of the calcaneus. Urakawa et al[18] trained a convolutional neural network to compare its performance with orthopaedists in diagnosing intertrochanteric fractures. Cheng et al[19] localized and detected hip fractures on pelvic X-rays by using a deep convolutional neural network (DCNN). The feasibility and effectiveness of deep neural networks for hip fracture screening were confirmed. Lindsey et al[20] developed a deep neural network to assist emergency physicians in detecting and locating fractures in X-ray photographs. These findings show that senior medical experts can share their expertise with young doctors at the forefront of medicine through deep learning networks.

In addition to X-rays, deep learning and appropriate construction models can also efficiently identify MRI images, and their role in assisting in diagnosis is obvious. Zhou et al[21] used a combination of CNN and 3D deformation modelling to perform fast and accurate comprehensive knee joint tissue segmentation. Liu et al[22] developed a deep learning-based fully automatic cartilage damage detection system using segmentation and classification CNNs. Fritz et al[23] compared the accuracy of a fully automatic DCNN with radiologists in identifying medial and lateral meniscus tears in MRI. The results showed that meniscus tear detection based on DCNN can be performed fully automatically with similar specificity. Shin et al[24]'s study showed that the CNN model can be used to diagnose the presence of meniscal tears and distinguish the types of meniscus tears. Tang et al[25] also proposed a fully automatic CNN-based knee segmentation system for the rapid and accurate evaluation of knee images.

All these developments suggest that artificial intelligence seems to be a breakthrough in solving the problem of femoral trochlea diagnosis. We propose an artificial intelligence system to label and detect the key points of knee MRI to assist in the diagnosis of femoral trochlear dysplasia. Our assumption is that the deep learning method will provide diagnostic performance similar to that of clinical radiologists and achieve higher intraobserver consistency to detect FTD.

MATERIALS AND METHODS

Description of the dataset

We collected 202 cases of femoral trochlear dysplasia examined in the First Hospital of Jilin University from January 2019 to August 2022. The inclusion criteria were that there was at least one history of patellar dislocation and the MRI suggested the presence of trochlear dysplasia. The exclusion criteria were as follows: osteoarthritis, rheumatoid arthritis, ipsilateral knee surgery history and MRI with image quality problems. Additionally, according to the above inclusion and exclusion criteria, 252 patients with other knee joint problems, such as mild soft tissue injury or joint effusion, were selected for the same period of knee MRI. Sex and age were matched with the trochlear dysplasia group. The diagnosis of trochlear dysplasia was made by senior radiologists and senior surgeons with more than 10 years of working experience, and all differences were resolved before training and testing, as confirmed by MRI images. Finally, 464 cases of knee MRI were selected, including femoral trochlea dysplasia (n = 202) and normal trochlea (n = 252), 263 of whom were female and 201 of whom were male. The MRI images used in this study were 1.5T and were obtained from a Siemens digital radiography facility. Then, after preprocessing, the image dataset was randomly divided into a training set, validation set and test set. Ethical approval was provided by the Medical Ethics Committee.

Data preparation

The sample was marked by three doctors, including a senior surgeon, an intermediate surgeon and a junior surgeon. Trochlear depth and asymmetry of the facet length were measured 3 cm above the femorotibial joint space, and lateral trochlear inclination was measured on the first craniocaudal image that depicted the complete cartilaginous trochlea[10,11]. All the samples in the training set and the validation set were marked by senior surgeons, and two surgeons marked and judged the samples in the test set (Figure 1).

Open in New Tab Full Size Figure Download Figure

Figure 1 Sample labeling base on axial magnetic resonance imaging. Trochlear depth was calculated according to the formula [a + b] ÷ 2 - c; asymmetry of the facet length was expressed as [g ÷ f] × 100%; lateral trochlear inclination is the angle between f and e.

Training the deep CNN and framework

In the current diagnosis process, doctors need to manually mark the anatomical key points of the MRII to measure the anatomical parameters and complete femoral trochlear dysplasia diagnosis through measurement results. However, the manual annotation process is time-consuming, and the diagnosis consistency varies. Therefore, a key point detection method is proposed in this paper to realize the automatic diagnosis of diseases.

Since different key points have different feature information, to achieve accurate automatic positioning of key points, this paper adopts the heatmap regression method to detect the key points network. During training, the numerical coordinate labels are first converted into labels in the form of heatmaps according to a Gaussian distribution function. In the heatmap label, the pixel value closer to the key point is closer to 1. The model can fully learn the regional information between different key points by using the label in the form of a heatmap. The key point detection method using heatmap regression can be regarded as a pixel-level regression task. Therefore, a U-shaped network structure is designed in this paper to learn the mapping between the input MRII and the heatmap. The whole network is divided into two structures: encoder and decoder. The encoder is composed of five network layers through continuous convolution and downsampling operations.

The feature map channel size of each layer is continuously increased from 64 to 1024, and two cascaded 3 × 3 convolutions are included in each layer. Each convolutional model is composed of a 3 × 3 convolution, batch normalization and the LeakyReLU activation function. The down sampling operation between different layers uses the maximum pooling method to retain the features learned by the previous layer. The decoder is also composed of a five-layer network.

However, in the encoder, the feature map learned by the encoder is restored to the original input scale by using an up sampling operation. In the up sampling process, the learned feature maps of the encoder and the decoder are spliced in the direction along the channel by means of skip cascading. The reason for this approach is that the low-level features of the image, such as the line, texture, and shape of the femur, are usually learned in the first few layers of the encoder. As the network continues to deepen, the network learns more abstract high-level features, such as anatomical information in femoral images. A more accurate heatmap is predicted by fusing low-level features with high-level features using a skip cascade. In a convolutional neural network, convolution usually learns the local feature information. Therefore, the model needs to focus on the global context information in the image to learn the discriminative information around the key points. This paper introduces a nonlocal neural network designed based on the attention mechanism of the previous structure and captures the global contextual information in the image through this structure. Since a nonlocal network usually requires large computational complexity, it is placed between the encoder and decoder. Because the feature map learned in the last layer of the encoder has the smallest scale, learning high-level features is enhanced by the model. In the last layer of the entire network, the final heatmap is calculated using 1x1 convolutions. The number of channels in the heatmap is consistent with the number of key points. By obtaining the position of the extreme value of the heatmap on each channel, the final key point coordinates are obtained.

During training, the sigma size of the heatmap is set to 3. The loss function adopts the squared error loss function and uses Adam as the optimizer for training. The total number of iterations for training is 25000, while the initial learning rate is set to 0.001 and the batch size is set to 4. The model is trained in the training set, and the model with the smallest error in the validation set is selected as the final test model (Figure 2).

Open in New Tab Full Size Figure Download Figure

Figure 2 Network structure.

Image processing

Because of the brightness difference in the collected patient images, histogram equalization is used to process the image brightness. Additionally, to prevent overfitting in the training process, the image is enhanced by SimpleITK, which includes random flipping, random scaling and random rotation. All images are sampled to 256 × 256 pixels. The pixel space size of the image in the dataset is distributed between 0.546 mm/pixels and 0.625 mm/pixels.

Statistical analysis:

SPSS 26.0 was used for all statistical analyses. Frequency and percentage were used to describe categorical variables, mean and standard deviation were used to describe continuous variables conforming to a normal distribution, and median and quartile were used to describe continuous variables not conforming to a normal distribution. Differences between groups were compared using the t test, Mann-Whitney U test, or chi-square test.

Using the paired sample T test, one-way ANOVA or Kendall's W test to compare the measurement results, time spent and measurement errors between the AI model and clinicians, P < 0.05 was considered to be statistically significant.

The sensitivity, specificity, positive predictive value, negative predictive value and accuracy of the AI model, junior doctors and intermediate doctors’ diagnoses of abnormal trochlea were calculated. The area under the ROC curve (AUC) was calculated to compare the diagnostic efficacy of the AI model, junior doctors and intermediate doctors for abnormal trochlea. Additionally, the intraclass correlation coefficient (ICC) was calculated using the two-way mixed effects model to compare the absolute consistency of the diagnostic values of abnormal trochlea calculated by the AI model, junior doctors and intermediate doctors with those of senior doctors.

The kappa consistency test was used to compare the diagnostic consistency of the AI model, junior doctors, and intermediate doctors with that of senior doctors and the test-retest reliability of the AI model, junior doctors, and intermediate doctors after 2 wk.

RESULTS

General information

The age and sex of the participants were collected. Participants ranged in age from 16 to 68 years old. As shown in Table 1, there were 370 people in the training set, with a median age of 39 years, and 217 women (58.6%); there were 94 people in the test set, with a median age of 38.2 years, and 46 women (48.9%). There was no statistically significant difference in age or sex between the participants in the training set and the test set (P = 0.646, P = 0.090).

Table 1 General information of the participants.

Obvervational index	*Training set (n =* 370)**	*Test set (n =* 94)**	P value
Age (yr)	39 [28.8-50.0]	38.2 [27-48]	0.646¹
Female (n, %)	217 (58.6)	46 (48.9)	0.090²

¹Mann-Whitney U test.

²Chi-square test.

Diagnostic efficiency and accuracy

As shown in Table 2, the average time for the AI model to make a diagnosis through two levels of MRI images was only 0.14 s, which was significantly shorter than that of intermediate doctors and junior doctors. Compared with senior doctors, junior doctors, intermediate doctors and AI models had statistically significant differences in the measurement errors of the three parameters.

Table 2 Comparison of diagnosis time and error of the artificial intelligence model, junior doctors and intermediate doctors.

Obvervational index	Junior doctor	Intermediate doctor	Artificial intelligence model	P value
Diagnosis time (seconds)	102.97 ± 21.26	86.64 ± 12.14	0.14 ± 0.11	< 0.001¹
Error of Trochlear depth (mm)	0.54 [0.23-1.00]	0.54 [0.14-0.99]	0.32 [0.11-0.54]	< 0.001²
Error of asymmetry of the facet length	0.09 [0.54-0.17]	0.07 [0.03-0.16]	0.05 [0.03-0.13]	0.006²
Error of lateral trochlear inclination (°)	1.77 [0.85-2.79]	1.71 [0.90-3.02]	0.90 [0.42-1.84]	0.003²

¹One-way ANOVA.

²Kendall's W test.

AI model diagnosis effect

As shown in Table 3, taking the diagnosis results of senior doctors as a reference, the sensitivity, specificity, positive predictive value and negative predictive value of the AI model ranged from 0.74-0.96. The accuracy of the AI model was 0.88, and the AUC was 0.88. The ability of the AI model to distinguish abnormal trochlea at all MRI levels was higher than that of intermediate doctors and junior doctors. In the diagnosis of abnormal trochlea, the kappa value of the AI model was 0.76, which was highly consistent with that of senior doctors. It was better than that of intermediate doctors (0.60) and junior doctors (0.58), which had a moderate degree of consistency compared with senior doctors.

Table 3 Diagnostic effect of junior doctors, intermediate doctors and artificial intelligence models.

Group	Sensitivity	Specificity	Positive predictive value	Negative predictive value	Accuracy	AUC	Kappa
Junior doctor	0.65	0.92	0.86	0.76	0.80	0.79	0.58
Intermediate doctor	0.63	0.96	0.93	0.75	0.81	0.79	0.60
Artificial intelligence model	0.79	0.96	0.94	0.84	0.88	0.88	0.76

AUC: Area under curve.

As shown in Table 4, comparing the measurement results of the diagnostic parameters between the AI model and the senior doctor, the paired sample t test results showed that there was no significant difference in the lateral trochlear inclination and the depth of the trochlear groove (P = 0.424), but there was a significant difference in the facet ratio of medial to lateral (P = 0.024).

Table 4 Intergroup comparison of the artificial intelligence model and senior doctors.

Obvervational index	Artificial intelligence model	Senior doctor	P value
Trochlear depth (mm)	4.41 ± 1.85	4.34 ± 2.10	0.424¹
Asymmetry of the facet length	0.57 ± 0.17	0.60 ± 0.18	0.024¹
Lateral trochlear inclination (°)	16.31 ± 5.92	16.00 ± 6.02	0.235¹

¹Student's t test.

As shown in Table 5, the measurement results of the junior doctor, intermediate doctor and AI models on diagnostic parameters were compared. The results showed that the measurements of the trochlear groove depth and lateral trochlear inclination of the AI model were in good agreement with those of senior doctors (ICC = 0.91, 95%CI: 0.86-0.94, P < 0.001, ICC = 0.91 95%CI: 0.87-0.94, P < 0.001), while the measurement of the facet ratio of medial to lateral was generally consistent with that of senior doctors (ICC = 0.71, 95%CI: 0.59-0.80, P < 0.001). The measurement of the facet ratio of medial to lateral between junior doctors and intermediate doctors was less consistent with that of senior doctors (ICC = 0.57, 95%CI: 0.32-0.73, P < 0.001; ICC = 0.41 95%CI: 0.16-0.60, P < 0.001). The measurement results of the three parameters of the AI model were better than those of junior and intermediate doctors.

Table 5 Intragroup correlation coefficient and 95% confidence interval of the measures for the junior doctors, intermediate doctors and artificial intelligence models compared with senior doctors.

Group	Trochlear depth (mm)	Asymmetry of the facet length	Lateral trochlear inclination (°)
Junior doctor	0.87 (0.78-0.92)^a	0.57 (0.32-0.73)^a	0.82 (0.75-0.88)^a
Intermediate doctor	0.88 (0.82-0.90)^a	0.41 (0.16-0.60)^a	0.87 (0.79-0.92)^a
Artificial intelligence model	0.91 (0.86-0.94)^a	0.71 (0.59-0.80)^a	0.91 (0.87-0.94)^a

^aP < 0.001.

Retest reliability

As shown in Table 6, the diagnosis of abnormal trochlea by the AI model was consistent at all MRI levels, showing very high test-retest reliability. For intermediate doctors and junior doctors, the diagnosis had a high consistency, and the kappa value ranged from 0.76 to 0.78.

Table 6 The intragroup consistency of the measured parameters between junior doctors, intermediate doctors and the artificial intelligence model.

	Junior doctor	Intermediate doctor	Artificial intelligence model
Kappa	0.76	0.78	1.00

As shown in Table 7, the measurement results of the AI model for the three parameters were completely consistent (ICC = 1.00, 95%CI: 1.00 to 1.00, P < 0.001; ICC = 1.00, 95%CI: 1.00 to 1.00, P < 0.001; ICC = 1.00, 95%CI: 1.00 to 1.00, P < 0.001). The retest reliability of the measurement of the depth of the trochlear groove and lateral trochlear inclination were good between junior and intermediate doctors, with ICC values ranging from 0.89 to 0.93, but the retest reliability of the facet ratio of medial to lateral was moderate (ICC = 0.71, 95%CI: 0.60-0.80, P < 0.001; ICC = 0.62, 95%CI: 0.47-0.73, P < 0.001).

Table 7 Intragroup correlation coefficient and 95% confidence interval of the two measurements before and after for junior doctors, intermediate doctors and artificial intelligence model.

Group	Trochlear depth (mm)	Asymmetry of the facet length	Lateral trochlear inclination (°)
Junior doctor	0.93 (0.89-0.95)^a	0.71 (0.60-0.80)^a	0.90 (0.85-0.93)^a
Intermediate doctor	0.92 (0.88-0.95)^a	0.62 (0.47-0.73)^a	0.89 (0.84-0.93)^a
Artificial intelligence model	1.00 (1.00-1.00)^a	1.00 (1.00-1.00)^a	1.00 (1.00-1.00)^a

^aP < 0.001.

DISCUSSION

FTD is a progressive disease that leads to patellar instability and even patellofemoral arthritis. There are many methods to treat FTD, such as trochleoplasty[26,27]. However, trochleoplasty is not suitable for all patients with FTD. For a large number of patients without patellar instability or mild FTD (type A), conservative treatment or simple MPFL is the best choice[28,29]. Therefore, early FTD diagnosis and determination of its severity are critical for identifying patients who require observation or treatment. The diagnosis of early FTD is challenging. There are different measurement parameters to describe FTD in the related literature, including X-ray, CT and MRI. However, standard lateral radiographs of the knee joint are not common in clinical work. With the continuous development of medical imaging technology, MRI can clearly show the anatomy of the articular cartilage surface, ligaments and muscles can better show the morphology of the femoral trochlea, and has more advantages in evaluating FTD[30,31]. Therefore, our model is based on MRII, not X-ray[32-34]. Among various quantitative radiology methods to characterize the femoral trochlea, trochlear depth, asymmetry of the facet length and lateral trochlear inclination are considered the best prediction parameters, with extremely high sensitivity and specificity[35-37].

However, the manual measurement of femoral trochlea parameters is tedious, time-consuming, and easily produces great variability. The development of artificial intelligence in imaging-related fields provides a breakthrough for solving problems of the femoral trochlea. Artificial intelligence has been previously applied to the diagnosis and treatment of femoral trochlea diseases. Cerveri et al[38] combined a statistical shape model (SSM) with a stacked sparse autoencoder network to represent the individual morphology of the trochlea through a set of parameters and used these parameters to distinguish between different degrees of abnormalities and calculate the SSM of normal and dysplasia trochlea regions. Some scholars also proposed a reproducible measurement method of the 3D femoral model, quantified the knee parameters of the distal femur, and used an artificial neural network to predict the parameter values describing the geometry of the normal trochlear groove to simulate the surgical procedure of femoral trochleoplasty[39]. However, limited by the computer level at that time, the further application of artificial intelligence was not studied. However, there is no doubt that the research of Cerveri P provided evidence that artificial intelligence can be applied to the diagnosis and treatment of FTD[38].

In this study, we propose an aided diagnosis algorithm framework based on deep learning technology, which shows that deep learning achieves high performance in distinguishing normal femoral trochlea from trochlea dysplasia, which is superior to junior doctors and intermediate doctors and similar to that of senior doctors (ICC = 0.91, 95%CI: 0.87 to 0.94, P < 0.001). For the diagnosis of trochlea dysplasia, the accuracy of the AI model was 0.88, and the AUC value was 0.88. The range of sensitivity, specificity, positive predictive value and negative predictive value was 0.74-0.96, which was higher than that of junior doctors and intermediate doctors. In terms of intragroup consistency and intergroup consistency, the AI model was also superior to junior doctors and intermediate doctors. However, in terms of diagnosis time, the average time of the AI model was only 0.14 s, which was significantly shorter than that of junior doctors and intermediate doctors.

To locate the key points accurately and automatically, the thermal map regression method is used to detect the key points. An algorithm based on deep learning is developed and used to automatically measure the femoral trochlear parameters in axial MRI images. The system can accurately, efficiently and stably evaluate trochlear groove depth and lateral trochlear inclination. Its performance is superior to junior doctors and intermediate doctors and similar to that of senior doctors. The good performance of the system shows the clinical potential of artificial intelligence for assisting surgeons in the tedious process of accurately measuring femoral trochlear parameters. It is good to help young doctors with less experience detect femoral trochlear dysplasia at an early stage.

The feasibility of using deep learning to help doctors diagnose diseases on MRI of the knee joint has been well verified. The high efficiency, accuracy and repeatability of artificial intelligence can solve the problem of the large demand for diagnosis and measurement of the femoral trochlea. In addition to its widespread recognition, MRI does not involve cutting the selected sample image, thus avoiding the problem that the accuracy rate may be improved due to image cutting[23].

In addition, it was found that the measurement of lateral trochlear inclination and depth of the trochlear groove were better than the measurement of the facet ratio of medial to lateral, both in terms of intragroup consistency and in terms of the error of disease diagnosis. The reason for this is that although MRI can better observe the shape of the femoral trochlea compared with X-ray, it cannot describe the shape of the trochlear surface by only marking key points, which leads to large measurement errors and poor consistency within and between groups. Therefore, when using MRI to diagnose FTD, the selection of parameters is more inclined to measure the lateral trochlear inclination and the depth of the trochlear groove.

This study demonstrates that the deep learning algorithm can be used to assist in the diagnosis of FTD, but this does not mean that it is ready for immediate clinical practice. We evaluated the diagnostic performance of the CNN based on a single axial MRI image of the knee to maintain the project simplicity, which may not actually fully reflect the clinically relevant situation because the assessment of femoral trochlear development will involve at least two levels of MRI imaging. Finally, limited by the resolution of the 1.5T MRI axial image, the diagnostic performance of the CNN is reduced to a certain extent. Using higher resolution images can improve diagnostic accuracy.

However, there are still some deficiencies that are unavoidable in the initial stage of the research. In the future, we hope to conduct further research based on the existing data and research results, such as how to classify FTD to guide the treatment of different types.

CONCLUSION

In conclusion, this paper adopts the heatmap regression method base on deep learning to build an AI model. All values of AI model were superior to junior doctors and intermediate doctors and similar to senior doctors. Therefore, the diagnosis of FTD on knee MRI can be aided by AI and can be achieved with a high level of accuracy. AI have great potential to become a useful tool for the assisted diagnosis of orthopaedic diseases. Its greatest significance is to assist young front-line clinicians with less experience to complete the diagnosis of the disease faster and more accurately.

ARTICLE HIGHLIGHTS

Research background

Femoral trochlear dysplasia (FTD) is an important risk factor for patellar instability, with an incidence of 96% in patients with recurrent patellar instability. Magnetic resonance imaging (MRI) has become the preferred method for evaluating FTD. However, tedious and repeated measurement is essential to using these qualitative and quantitative parameters to diagnose FTD, and it easily produces considerable differences in intragroup consistency and intergroup consistency.

Research motivation

Whether artificial intelligence can be used to assist in the diagnosis of femoral trochlear dysplasia remains unclear.

Research objectives

To propose an artificial intelligence (AI) system to label and detect the key points of knee MRI to assist in diagnosing FTD quickly and accurately.

Research methods

We searched knee MRI cases, including femoral trochlear dysplasia and normal femoral trochlea, all the samples marked by doctors were divided into three sets, including the training set, the validation set and the test set. The performance of AI model to diagnose FTD was improved through continuous training and learning.

Research results

All values (The accuracy, sensitivity, specificity, etc.) were superior to junior doctors and intermediate doctors and similar to senior doctors. In terms of intragroup consistency and intergroup consistency, the AI model was also superior to junior doctors and intermediate doctors. However, diagnostic time was much lower than that of junior doctors and intermediate doctors.

Research conclusions

AI has great potential in the assisted diagnosis of orthopedic diseases. Its greatest significance is to assist young front-line clinicians with less experience to complete the diagnosis of the disease faster and more accurately.

Research perspectives

In the future, we hope to conduct further research based on the existing data and research results, such as how to classify FTD to guide the treatment of different types.

Footnotes

Provenance and peer review: Unsolicited article; Externally peer reviewed.

Peer-review model: Single blind

Specialty type: Medicine, research and experimental

Country/Territory of origin: China

Peer-review report’s scientific quality classification

Grade A (Excellent): 0

Grade B (Very good): B, B

Grade C (Good): C, C

Grade D (Fair): 0

Grade E (Poor): 0

P-Reviewer: Mijwil MM, Iraq; Mohey NM, Egypt S-Editor: Wang LL L-Editor: A P-Editor: Wang LL

References

DeVries CA, Bomar JD, Pennock AT. Prevalence of Trochlear Dysplasia and Associations with Patellofemoral Pain and Instability in a Skeletally Mature Population. J Bone Joint Surg Am. 2021;103:2126-2132. [RCA] [PubMed] [DOI] [Full Text] [Cited by in RCA: 17] [Reference Citation Analysis (0)]

2.	Parikh SN, Rajdev N, Sun Q. The Growth of Trochlear Dysplasia During Adolescence. J Pediatr Orthop. 2018;38:e318-e324. [RCA] [PubMed] [DOI] [Full Text] [Cited by in Crossref: 22] [Cited by in RCA: 23] [Article Influence: 3.3] [Reference Citation Analysis (0)]

3.	Bollier M, Fulkerson JP. The role of trochlear dysplasia in patellofemoral instability. J Am Acad Orthop Surg. 2011;19:8-16. [RCA] [PubMed] [DOI] [Full Text] [Cited by in Crossref: 120] [Cited by in RCA: 119] [Article Influence: 8.5] [Reference Citation Analysis (0)]

4.	Sales-Fernández R, Shah N. The pathologic double contour sign and the trochlea shape patterns can diagnose trochlea dysplasia. J ISAKOS. 2022;. [RCA] [PubMed] [DOI] [Full Text] [Reference Citation Analysis (0)]

Dejour DH. The patellofemoral joint and its historical roots: the Lyon School of Knee Surgery. Knee Surg Sports Traumatol Arthrosc. 2013;21:1482-1494. [RCA] [PubMed] [DOI] [Full Text] [Cited by in Crossref: 46] [Cited by in RCA: 50] [Article Influence: 4.2] [Reference Citation Analysis (0)]

Paiva M, Blønd L, Hölmich P, Steensen RN, Diederichs G, Feller JA, Barfod KW. Quality assessment of radiological measurements of trochlear dysplasia; a literature review. Knee Surg Sports Traumatol Arthrosc. 2018;26:746-755. [RCA] [PubMed] [DOI] [Full Text] [Cited by in Crossref: 51] [Cited by in RCA: 53] [Article Influence: 7.6] [Reference Citation Analysis (0)]

7.	Shen J, Qin L, Yao WW, Li M. The significance of magnetic resonance imaging in severe femoral trochlear dysplasia assessment. Exp Ther Med. 2017;14:5438-5444. [RCA] [PubMed] [DOI] [Full Text] [Full Text (PDF)] [Cited by in RCA: 5] [Reference Citation Analysis (0)]

Dong Z, Niu Y, Duan G, Song Y, Qi J, Wang F. Evaluation of Trochlear Dysplasia Severity Using Trochlear Angle: A Retrospective Study Based on Computed Tomography (CT) Scans. Med Sci Monit. 2018;24:5118-5122. [RCA] [PubMed] [DOI] [Full Text] [Full Text (PDF)] [Cited by in Crossref: 7] [Cited by in RCA: 8] [Article Influence: 1.1] [Reference Citation Analysis (0)]

Ye Q, Yu T, Wu Y, Ding X, Gong X. Patellar instability: the reliability of magnetic resonance imaging measurement parameters. BMC Musculoskelet Disord. 2019;20:317. [RCA] [PubMed] [DOI] [Full Text] [Full Text (PDF)] [Cited by in Crossref: 11] [Cited by in RCA: 22] [Article Influence: 3.7] [Reference Citation Analysis (0)]

10.

Carrillon Y, Abidi H, Dejour D, Fantino O, Moyen B, Tran-Minh VA. Patellar instability: assessment on MR images by measuring the lateral trochlear inclination-initial experience. Radiology. 2000;216:582-585. [RCA] [PubMed] [DOI] [Full Text] [Cited by in Crossref: 235] [Cited by in RCA: 238] [Article Influence: 9.5] [Reference Citation Analysis (0)]

11.	Pfirrmann CW, Zanetti M, Romero J, Hodler J. Femoral trochlear dysplasia: MR findings. Radiology. 2000;216:858-864. [RCA] [PubMed] [DOI] [Full Text] [Cited by in Crossref: 254] [Cited by in RCA: 260] [Article Influence: 10.4] [Reference Citation Analysis (0)]

12.

Tayarani N MH. Applications of artificial intelligence in battling against covid-19: A literature review. Chaos Solitons Fractals. 2021;142:110338. [RCA] [PubMed] [DOI] [Full Text] [Cited by in Crossref: 120] [Cited by in RCA: 66] [Article Influence: 16.5] [Reference Citation Analysis (0)]

13.	Mijwil MM, Aggarwal K. A diagnostic testing for people with appendicitis using machine learning techniques. Multimed Tools Appl. 2022;81:7011-7023. [RCA] [PubMed] [DOI] [Full Text] [Cited by in Crossref: 5] [Cited by in RCA: 25] [Article Influence: 8.3] [Reference Citation Analysis (0)]

14.

Litjens G, Kooi T, Bejnordi BE, Setio AAA, Ciompi F, Ghafoorian M, van der Laak JAWM, van Ginneken B, Sánchez CI. A survey on deep learning in medical image analysis. Med Image Anal. 2017;42:60-88. [RCA] [PubMed] [DOI] [Full Text] [Cited by in Crossref: 5573] [Cited by in RCA: 4893] [Article Influence: 611.6] [Reference Citation Analysis (0)]

15.	LeCun Y, Bengio Y, Hinton G. Deep learning. Nature. 2015;521:436-444. [RCA] [PubMed] [DOI] [Full Text] [Cited by in Crossref: 36149] [Cited by in RCA: 19896] [Article Influence: 1989.6] [Reference Citation Analysis (0)]

16.

Chung SW, Han SS, Lee JW, Oh KS, Kim NR, Yoon JP, Kim JY, Moon SH, Kwon J, Lee HJ, Noh YM, Kim Y. Automated detection and classification of the proximal humerus fracture by using deep learning algorithm. Acta Orthop. 2018;89:468-473. [RCA] [PubMed] [DOI] [Full Text] [Full Text (PDF)] [Cited by in Crossref: 204] [Cited by in RCA: 269] [Article Influence: 38.4] [Reference Citation Analysis (0)]

17.

Pranata YD, Wang KC, Wang JC, Idram I, Lai JY, Liu JW, Hsieh IH. Deep learning and SURF for automated classification and detection of calcaneus fractures in CT images. Comput Methods Programs Biomed. 2019;171:27-37. [RCA] [PubMed] [DOI] [Full Text] [Cited by in Crossref: 70] [Cited by in RCA: 69] [Article Influence: 11.5] [Reference Citation Analysis (0)]

18.

Urakawa T, Tanaka Y, Goto S, Matsuzawa H, Watanabe K, Endo N. Detecting intertrochanteric hip fractures with orthopedist-level accuracy using a deep convolutional neural network. Skeletal Radiol. 2019;48:239-244. [RCA] [PubMed] [DOI] [Full Text] [Cited by in Crossref: 118] [Cited by in RCA: 143] [Article Influence: 23.8] [Reference Citation Analysis (0)]

19.

Cheng CT, Ho TY, Lee TY, Chang CC, Chou CC, Chen CC, Chung IF, Liao CH. Application of a deep learning algorithm for detection and visualization of hip fractures on plain pelvic radiographs. Eur Radiol. 2019;29:5469-5477. [RCA] [PubMed] [DOI] [Full Text] [Full Text (PDF)] [Cited by in Crossref: 112] [Cited by in RCA: 158] [Article Influence: 26.3] [Reference Citation Analysis (0)]

20.

Lindsey R, Daluiski A, Chopra S, Lachapelle A, Mozer M, Sicular S, Hanel D, Gardner M, Gupta A, Hotchkiss R, Potter H. Deep neural network improves fracture detection by clinicians. Proc Natl Acad Sci U S A. 2018;115:11591-11596. [RCA] [PubMed] [DOI] [Full Text] [Full Text (PDF)] [Cited by in Crossref: 260] [Cited by in RCA: 329] [Article Influence: 47.0] [Reference Citation Analysis (0)]

21.	Zhou Z, Zhao G, Kijowski R, Liu F. Deep convolutional neural network for segmentation of knee joint anatomy. Magn Reson Med. 2018;80:2759-2770. [RCA] [PubMed] [DOI] [Full Text] [Cited by in Crossref: 129] [Cited by in RCA: 119] [Article Influence: 17.0] [Reference Citation Analysis (0)]

22.

Liu F, Zhou Z, Samsonov A, Blankenbaker D, Larison W, Kanarek A, Lian K, Kambhampati S, Kijowski R. Deep Learning Approach for Evaluating Knee MR Images: Achieving High Diagnostic Performance for Cartilage Lesion Detection. Radiology. 2018;289:160-169. [RCA] [PubMed] [DOI] [Full Text] [Cited by in Crossref: 159] [Cited by in RCA: 162] [Article Influence: 23.1] [Reference Citation Analysis (0)]

23.

Fritz B, Marbach G, Civardi F, Fucentese SF, Pfirrmann CWA. Deep convolutional neural network-based detection of meniscus tears: comparison with radiologists and surgery as standard of reference. Skeletal Radiol. 2020;49:1207-1217. [RCA] [PubMed] [DOI] [Full Text] [Full Text (PDF)] [Cited by in Crossref: 27] [Cited by in RCA: 49] [Article Influence: 9.8] [Reference Citation Analysis (0)]

24.

Shin H, Choi GS, Shon OJ, Kim GB, Chang MC. Development of convolutional neural network model for diagnosing meniscus tear using magnetic resonance image. BMC Musculoskelet Disord. 2022;23:510. [RCA] [PubMed] [DOI] [Full Text] [Full Text (PDF)] [Cited by in Crossref: 16] [Cited by in RCA: 11] [Article Influence: 3.7] [Reference Citation Analysis (0)]

25.

Tang X, Guo D, Liu A, Wu D, Liu J, Xu N, Qin Y. Fully Automatic Knee Joint Segmentation and Quantitative Analysis for Osteoarthritis from Magnetic Resonance (MR) Images Using a Deep Learning Model. Med Sci Monit. 2022;28:e936733. [RCA] [PubMed] [DOI] [Full Text] [Full Text (PDF)] [Cited by in Crossref: 16] [Cited by in RCA: 6] [Article Influence: 2.0] [Reference Citation Analysis (0)]

26.

Hiemstra LA, Peterson D, Youssef M, Soliman J, Banfield L, Ayeni OR. Trochleoplasty provides good clinical outcomes and an acceptable complication profile in both short and long-term follow-up. Knee Surg Sports Traumatol Arthrosc. 2019;27:2967-2983. [RCA] [PubMed] [DOI] [Full Text] [Cited by in Crossref: 29] [Cited by in RCA: 34] [Article Influence: 5.7] [Reference Citation Analysis (0)]

27.

Li M, Wang F, Ji G, Liu F, Fan C, Yang G, Lu J. Combined medial and lateral patellar retinaculum plasty for skeletally immature patients with patellar dislocation and low-grade trochlear dysplasia. Knee. 2020;27:9-17. [RCA] [PubMed] [DOI] [Full Text] [Cited by in Crossref: 1] [Cited by in RCA: 3] [Article Influence: 0.6] [Reference Citation Analysis (0)]

28.

Vermeulen D, van der Valk MR, Kaas L. Plaster, splint, brace, tape or functional mobilization after first-time patellar dislocation: what's the evidence? EFORT Open Rev. 2019;4:110-114. [RCA] [PubMed] [DOI] [Full Text] [Full Text (PDF)] [Cited by in Crossref: 6] [Cited by in RCA: 11] [Article Influence: 1.8] [Reference Citation Analysis (0)]

29.

Xing X, Shi H, Feng S. Does surgical treatment produce better outcomes than conservative treatment for acute primary patellar dislocations? J Orthop Surg Res. 2020;15:118. [RCA] [PubMed] [DOI] [Full Text] [Full Text (PDF)] [Cited by in Crossref: 7] [Cited by in RCA: 16] [Article Influence: 3.2] [Reference Citation Analysis (0)]

30.

Lippacher S, Dejour D, Elsharkawi M, Dornacher D, Ring C, Dreyhaupt J, Reichel H, Nelitz M. Observer agreement on the Dejour trochlear dysplasia classification: a comparison of true lateral radiographs and axial magnetic resonance images. Am J Sports Med. 2012;40:837-843. [RCA] [PubMed] [DOI] [Full Text] [Cited by in Crossref: 172] [Cited by in RCA: 180] [Article Influence: 13.8] [Reference Citation Analysis (0)]

31.

Fucentese SF, von Roll A, Koch PP, Epari DR, Fuchs B, Schottle PB. The patella morphology in trochlear dysplasia--a comparative MRI study. Knee. 2006;13:145-150. [RCA] [PubMed] [DOI] [Full Text] [Cited by in Crossref: 119] [Cited by in RCA: 110] [Article Influence: 5.8] [Reference Citation Analysis (0)]

32.

Lin DJ, Johnson PM, Knoll F, Lui YW. Artificial Intelligence for MR Image Reconstruction: An Overview for Clinicians. J Magn Reson Imaging. 2021;53:1015-1028. [RCA] [PubMed] [DOI] [Full Text] [Cited by in Crossref: 132] [Cited by in RCA: 133] [Article Influence: 33.3] [Reference Citation Analysis (0)]

33.

Chaudhari AS, Sandino CM, Cole EK, Larson DB, Gold GE, Vasanawala SS, Lungren MP, Hargreaves BA, Langlotz CP. Prospective Deployment of Deep Learning in MRI: A Framework for Important Considerations, Challenges, and Recommendations for Best Practices. J Magn Reson Imaging. 2021;54:357-371. [RCA] [PubMed] [DOI] [Full Text] [Cited by in Crossref: 33] [Cited by in RCA: 46] [Article Influence: 11.5] [Reference Citation Analysis (0)]

34.	Lundervold AS, Lundervold A. An overview of deep learning in medical imaging focusing on MRI. Z Med Phys. 2019;29:102-127. [RCA] [PubMed] [DOI] [Full Text] [Cited by in Crossref: 780] [Cited by in RCA: 800] [Article Influence: 114.3] [Reference Citation Analysis (0)]

35.	LaPrade RF, Cram TR, James EW, Rasmussen MT. Trochlear dysplasia and the role of trochleoplasty. Clin Sports Med. 2014;33:531-545. [RCA] [PubMed] [DOI] [Full Text] [Cited by in Crossref: 42] [Cited by in RCA: 35] [Article Influence: 3.2] [Reference Citation Analysis (0)]

36.

Pennock AT, Chang A, Doan J, Bomar JD, Edmonds EW. 3D Knee Trochlear Morphology Assessment by Magnetic Resonance Imaging in Patients With Normal and Dysplastic Trochleae. J Pediatr Orthop. 2020;40:114-119. [RCA] [PubMed] [DOI] [Full Text] [Cited by in Crossref: 15] [Cited by in RCA: 27] [Article Influence: 5.4] [Reference Citation Analysis (0)]

37.	Levy BJ, Tanaka MJ, Fulkerson JP. Current Concepts Regarding Patellofemoral Trochlear Dysplasia. Am J Sports Med. 2021;49:1642-1650. [RCA] [PubMed] [DOI] [Full Text] [Cited by in Crossref: 16] [Cited by in RCA: 32] [Article Influence: 8.0] [Reference Citation Analysis (0)]

38.

Cerveri P, Belfatto A, Baroni G, Manzotti A. Stacked sparse autoencoder networks and statistical shape models for automatic staging of distal femur trochlear dysplasia. Int J Med Robot. 2018;14:e1947. [RCA] [PubMed] [DOI] [Full Text] [Cited by in Crossref: 9] [Cited by in RCA: 9] [Article Influence: 1.3] [Reference Citation Analysis (0)]

39.	Nolan JE 3rd, Schottel PC, Endres NK. Trochleoplasty: Indications and Technique. Curr Rev Musculoskelet Med. 2018;11:231-240. [RCA] [PubMed] [DOI] [Full Text] [Cited by in Crossref: 29] [Cited by in RCA: 32] [Article Influence: 4.6] [Reference Citation Analysis (0)]