You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: annotation_pipeline/README.md
+4-4Lines changed: 4 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -8,13 +8,13 @@ At the IMPC, genetic effects are identified using three statistical analysis pla
8
8
## Continuous data
9
9
Continuous data are typically analysed using a linear mixed model framework. These continuous measurements are particularly informative because the direction of change can be determined through the effect size.
10
10
11
-
However, due to the complexity of the data, not all continuous variables can be analysed using this framework. In such cases, the IMPC often employs the Reference Range Plus (RR) method. Control data are first discretising into three categories: low, normal, and high. Mutant data points are then classified into one of these reference categories. Finally, a Fisher's Exact Test is applied to determine whether there is a statistically significant deviation from the normal category.
11
+
However, due to the complexity of the data, not all continuous variables can be analysed using this framework. In such cases, the IMPC often employs the Reference Range Plus (RR) method. Control data are first discretised into three categories: low, normal, and high. Mutant data points are then classified into one of these reference categories. Finally, a Fisher's Exact Test is applied to determine whether there is a statistically significant deviation from the normal category.
12
12
13
13
## Categorical data
14
14
Categorical data in the IMPC encompasses a range of qualitative measurements and is analysed using Fisher’s Exact Test, as implemented in the R package OpenStats.
15
15
16
16
# How IMPC Annotation Pipeline Works
17
-
The `annotationChooser` function is processes statistical analysis results called statpacket. It determines calls based on significance levels. These calles are then mapped to Mammalian Phenotype (MP) ontology terms using a provided `mp_chooser_file`. Finally, it updates the input statpacket's JSON component with the identified MP terms. If no relevant annotation is found or the statistical result is not significant, it returns the original statpacket with no MP terms added.
17
+
The `annotationChooser` function processes statistical analysis results called statpackets. It determines calls based on significance levels. These calls are then mapped to Mammalian Phenotype (MP) ontology terms using a provided `mp_chooser_file`. Finally, it updates the input statpacket's JSON component with the identified MP terms. If no relevant annotation is found or the statistical result is not significant, it returns the original statpacket with no MP terms added.
18
18
19
19
The annotation pipeline requires a reference table that summarises the available MP terms for a given IMPC parameter. This reference can be retrieved from [IMPReSS](https://www.mousephenotype.org/impress/index).
20
20
The ETL pipeline handles this by generating the `mp_chooser.json` file.
@@ -30,7 +30,7 @@ The ETL pipeline handles this by generating the `mp_chooser.json` file.
30
30
- UNSPECIFIED
31
31
32
32
In the `mp_chooser.json` file each MP term can have different levels:
33
-
- Ontology term levels: ABNORMAL, INCREASE, DECREASE.
33
+
- Ontology term levels: ABNORMAL, INCREASED, DECREASED.
34
34
- Sex levels: FEMALE, MALE and UNSPECIFIED.
35
35
36
36
MP term assignment logic can be seen below:
@@ -48,7 +48,7 @@ MP term assignment logic can be seen below:
48
48
}
49
49
}%%
50
50
graph TD;
51
-
Start{Which method is used for the analysis?} --> |MM| MM[Prioritise INCREASE/DECREASE MP term] --> A
51
+
Start{Which method is used for the analysis?} --> |MM| MM[Prioritise INCREASED/DECREASED MP term, otherwise use ABNORMAL] --> A
52
52
Start --> |FE or RR| FE_RR[Only use ABNORMAL MP term] --> A
53
53
54
54
A{"Is FEMALE/MALE specific MP term available in the mp_chooser file?"}
0 commit comments