Logo of halHAL - Archives Ouvertes - Home page.
Conf Proc IEEE Eng Med Biol Soc. 2008; 1: 394–397.
doi: 10.1109/IEMBS.2008.4649173.

In medicine, the knowledge of experts is a mixture of textbook knowledge and experience through real life clinical cases. Consequently, there is a growing interest in case-based reasoning (CBR), introduced in the early 1980s, for the development of medical decision support systems [1]. The underlying idea of CBR is the assumption that analogous problems have similar solutions, an idea backed up by physicians’ experience. In CBR, the basic process of interpreting a new situation revolves around the retrieval of relevant cases in a case database. The retrieved cases are then used to help interpreting the new one.

We propose in this article a CBR system for the retrieval of medical cases made up of a series of images with contextual information: a class of CBR problems which has hardly been treated. The proposed system is applied to a Diabetic Retinopathy (DR) multimedia database built up in our laboratory; to diagnose DR, physicians analyze series of multimodal photographs together with contextual information such as the patient age, sex and medical history. To show that the method is generic, we also applied it to DDSM, a public access database for screening mammography; to screen mammography, physicians analyze two views of each breast, with associated contextual information.

When designing a CBR system to retrieve such cases, several problems arise. We have to aggregate heterogeneous sources of evidence (images, contextual information) and to manage missing information. These sources may be uncertain and conflicting. As a consequence, we applied the Dezert-Smarandache Theory (DSmT) of plausible and paradoxical reasoning, proposed in recent years [2], which is well suited to fuse uncertain, highly conflicting and imprecise sources of evidence.

Images have a varying definition, of about 2000 pixels/line for 5000 lines/image. An example of image sequence is given in figure 2. Each patient file has been graded by a physician. Patients are then classified in three groups: normal, benign and cancer. The distribution of grades among the patients is given in table I.
B. Including images in the retrieval system
To include images in the proposed retrieval system, we have to define a distance measure between images and to cluster images acquired at a given imaging modality into a finite number of groups. For this purpose, we follow the usual steps of Content-Based Image Retrieval (CBIR) [5]: 1) building a signature for each image (i.e. extracting a feature vector summarizing their numerical content), and 2) defining a distance measure between two signatures. Thus, measuring the distance between two images comes down to measuring the distance between two signatures. We can then cluster similar image signatures according to the defined distance measure.

In previous studies, we proposed to compute a signature for images from their wavelet transform (WT) [6]. These signatures model the distribution of the WT coefficients in each subband of the decomposition. The associated distance measure d [6] computes the divergence between these distributions. We used these signature and distance measure to cluster similar images.

Any clustering algorithm can be used, provided that the distance measure between feature vectors can be specified. We used FCM (Fuzzy C-Means) [7], one of the most common algorithms, and replaced the Euclidian distance by d.

C. Dezert-Smarandache Theory
The Dezert-Smarandache Theory (DSmT) allows to combine any type of independent sources of information represented in term of belief functions. It is more general than probabilistic fusion or Dempster-Shafer theory (DST). It is particularly well suited to fuse uncertain, highly conflicting and imprecise sources of evidence [2].
  • The fusion model: let θ = {θ1, θ2, …} be a set of hypotheses under consideration for a fusion problem; θ is called the frame of discernment. In DST, these hypotheses are assumed incompatibles (constrained model ℳ0(θ)), while in DSmT they are not (free model ℳf (θ)): if θ = {“blue”, “red”}, we may model objects that are both blue and red (magenta). Thus, in DSmT, a belief mass m(A) is assigned to each element A of the hyper-power set D(θ), i.e. the set of all composite propositions built from elements of θ with ∩ and ∪ operators, such that m(∅) = 0 and ΣAD(θ)m(A) = 1; m is called a generalized basic belief assignment (gbba). Nevertheless, it is possible to introduce constraints in the model [2] (hybrid model ℳ(θ)): we can specify pairs of incompatible hypotheses (θa, θb), i.e. each subset A of θaθb must have a null mass, noted AC(θ).
  • The fusion operators and the decision functions: to fuse information in the DSmT framework, the user specifies a gbba mj for each source of evidence Sj, j = 1..N. Then, these gbba are fused into a global gbba mf, according to a given rule of combination. Several rules have been proposed to combine mass functions, including the hybrid rule of combination or the PCR (Proportional Conflict Redistribution) rules [2].
  • The decision functions: once the fused gbba mf has been computed, a decision function (such as the credibility, the plausibility or the pignistic probability) is used to evaluate the probability of each hypothesis. The pignistic probability BetP, a compromise between the other two functions, is used since it provides the best system performance:
D. Dezert-Smarandache Theory Based Retrieval
  • Outline of the method: let cq be a case placed as a query by the user. We want to rank the cases in the database by decreasing order of relevance for cq. In that purpose, for each case ci in the database, i = 1..M, we estimate the degree of relevance of ci for cq, noted DR(ci, cq); then we rank the cases ci by decreasing order of DR(ci, cq), and the top five results are returned to the user. To compute DR(ci, cq), we see each case descriptor as a source of evidence: the degrees of relevance are first estimated for each case feature Fj, j = 1..N, noted DRj(ci, cq) (as described in section II-D.3), they are then translated into gbba (see section II-D.4) which are fused in the DSmT framework. Finally, DR(ci, cq) is estimated from the fused gbba by the pignistic probability (see equation 1). The procedure is summed up in figure 5.
  • The model: the following frame of discernment is used for the fusion problem: θ = {C1,C2,,CM}, where Ci is the hypothesis “ci is relevant for cq”, i = 1..M. The cardinal of D(θ) is hyper-exponential in M (it is majored by 22M). As a consequence, from computational considerations, it is necessary to include constraints in the model. These constraints are also justified by logical considerations. Indeed, it is not possible for the case cq to be similar to both cases ca and cb if
    • ca and cb are dissimilar, or
    • ca and cb have different severity levels
    In those cases, we set AC(θ) ∀ ACaCb. To build the model ℳ(θ), we first define a non-oriented graph Gc = (V,E), that we call compatibility graph; each vertex vV represents an hypothesis and each edge eE a couple of compatible hypothesis. To build Gc, we link the hypothesis associated with each case ci to the hypothesis associated with its l nearest neighbors at the same severity level. The distance measure used to find the nearest neighbors is simply a linear combination of heterogeneous distance functions (one for each case feature Fj - see section II-B for images), managing missing values [8]. We noticed that the complexity of the fusion operators mainly depends on the cardinality of the largest clique in Gc (a clique is a set of vertices V such that for every two vertices in V, there exists an edge connecting the two). The number l is closely related to the cardinality of the largest clique in Gc and consequently to the complexity of the fusion operation. Finally, the hybrid model ℳ(θ) is built from Gc by identifying all the cliques in Gc (see figure 4).
  • The degree of relevance: to compute the degree of relevance DRj(ci, cq) that a case ci is relevant for cq, given the feature Fj, we first define a finite number of states fjk for Fj. Then we compare the membership degree of ci (αjk(ci)) and of cq (αjk(cq)) to each state fjk. If Fj is a discrete variable, we associate a state with each possible value for Fj and αjk is a Boolean function. If Fj is an image attribute, αjk(c) is the membership degree of a case c to the kth image cluster (see section II-B - the same clustering algorithm is used for simple continuous variables such as age). We assume that the state of the cases in the same class are predominantly in a subset of states for Fj. So, in order to estimate the conditional probabilities, we use a correlation measure Sjk1k2 between two feature states fjk1 and fjk2, regarding the class of the cases at these states. To compute Sjk1k2, we first compute the mean membership Djk1c (resp. Djk2c) of cases in a given class c to the state fjk1 (resp. fjk2) (equation 2):
    the expression of DRj(ci, cq) is given by the following equation:
  • The generalized basic belief assignments (gbba): The gbba mj is then defined as follows:
    • We identify the cases (ci)i=1..M′≤M such that DRj(ci, cq) > τj,
    • We set ,
    • We set .
To find τj, we define the following test Tj=“if DRj(ci, cq) > τj then Ci is true, otherwise Ci is false”. The sensitivity (resp. the specificity) of Tj represents the degree of confidence in a positive (resp. negative) answer to Tj. Tj is relevant if it is both sensitive and specific. As τj increases, sensitivity increases and specificity decreases. So, we set τj as the intersection of the two curves “sensitivity according to τj” and “specificity according to τj ”, using a dichotomic search. We set mj0 as the sensitivity of Tj. is assigned the degree of uncertainty of Tj. Note that a single setup (τj, mj0) is used for each feature, whatever ci and cq.

The mean precision at five (mp5) of the system, i.e. the mean number of relevant cases among the top five results, is 78.6% for DRD and 82.1% for DDSM. As a comparison, the mp5 obtained by CBIR (when cases are made up of a single image), with the same image signatures, is 46.1% for DRD and 70.0% for DDSM [6]. To evaluate the contribution of the proposed system for the retrieval of heterogeneous and incomplete cases, it is compared to the linear combination of heterogeneous distance functions [8] that was used to build the model. A mp5 of 52.3% was achieved by this method for DRD and of 71.4% for DDSM. To assess the robustness of the method regarding missing information, 1) we generated artificial cases from each case in the database by removing attributes, 2) we placed sequentially each artificial case as a query to the system and 3) we plotted on figure 2 the precision at five of these queries according to the number of available attributes.