In this folder you will find the script train_test.sh, with commands illustrating how to train SVM models and test their accuracies.
Please take a look at script run.sh, to see examples on different commands showcasing how to prepare feature tables for training models, as well as making predictions.
Raw data used to train the SVM models (i.e. m6A-modified and unmodified 'curlcakes') can be found at https://trace.ncbi.nlm.nih.gov/Traces/sra/?study=SRP174366
Reference sequences can be found in cc.fasta
In this folder, you will also find error and electric signal features extracted with Epinano_Variants and Epinano_Current scripts.
- middleAs means [GCT][GCT]A[GCT][GCT] kmers
- All A-bases are modified from the mod samples (i.e. curlcakes are modified at 100% stoichiometry).