SRR8574911
SRR8574890
SRR8574897
...
>ref_seq
ATGGAACACGACCTTGAGAGGGGCCCACCGGGCCCGCGACGGCCCCCTCGAGGACCCCCC
...
@DJB775P1:248:D0MDGACXX:7:1202:12362:49613
TGCTTACTCTGCGTTGATACCACTGCTTAGATCGGAAGAGCACACGTCTGAA
+
JJJJJIIJJJJJJHIHHHGHFFFFFFCEEEEEDBD?DDDDDDBDDDABDDCA
...
./data/fastqc/
./data/clean/
./data/fastqc_clean/
./public/reference/
./data/fastqbind/
Sequence alignment results of successful splicing: ./result/bwasam/ Sequence alignment results of failed splicing: ./result/unbindsam/
Successful splicing sequence: ./result/cleansample
Failed splicing sequence: ./result/unbindcleansample
Merged file: ./result/finaldata/
Read counts for each site: ./result/finaldata/count/
![]() |
![]() |
![]() |
Feature results: ./feature/data/raw: mutation, count, entropy_nuc, entropy_slide, entropy_group
Number of reads contained in each site: ./feature/data/each
By limiting the sequencing depth, it is easier to build the optimal machine learning model later: ./feature/data/file
./feature/data/sampledata
| Sample | Group | NC_site | Mutation | Shannon_entropy |
|---|---|---|---|---|
| 8 | NPC | 14 | 9.92950054612253e-05 | 0.0014636622713465 |
| 3 | NPC | 9 | 8.858965272856131e-05 | 0.0013204405924585 |
| 11 | non_NPC | 253 | 1.0 | 0.0240075762881998 |
| 15 | non_NPC | 317 | 1.0 | 0.0073675609537136 |
| 2 | non_NPC | 38 | 1.0 | 0.0188142080168351 |
| 11 | non_NPC | 317 | 1.0 | 0.0 |
| 1 | non_NPC | 317 | 1.0 | 0.0033416393172456 |
| 2 | non_NPC | 317 | 1.0 | 0.0 |
| 16 | non_NPC | 317 | 1.0 | 0.0 |
| 16 | non_NPC | 253 | 1.0 | 0.0291534793859807 |