Before Data School I was focused within the Molecular/Micro Laboratory space where our team is involved in looking at the effects of different diets on the gut microbiome. Spending the majority of my time developing and optimising molecular assays I have no prior background in either R or coding.
The main goal I wanted to achieve with my data was to practice the R skills I have learnt within Data School and to experience wrangling 16S Sequencing data.
Example of code used to create my Table
library(readxl)
Data <- read_xlsx("../Data/VAN_Data.xlsx")
knitr::kable(head(Data, n = 5), format = "html", caption = "Section of 16S Sequencing VAN data") %>%
kable_styling("striped")
Sample | Participant | Method | Replicate | Actinobacteria | Bacteroidetes | Candidatus Saccharibacteria | Cyanobacteria/Chloroplast | Euryarchaeota | Firmicutes | Fusobacteria | Lentisphaerae | Planctomycetes | Proteobacteria | Spirochaetes | Synergistetes | Tenericutes | Verrucomicrobia |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
VAN001 | 1 | Blue_bag | a | 20930 | 1905 | 4 | 0 | 2 | 105421 | 0 | 0 | 0 | 84 | 0 | 0 | 0 | 11 |
VAN002 | 1 | Norgen | a | 12501 | 51928 | 0 | 2 | 3 | 113688 | 0 | 0 | 0 | 3757 | 0 | 0 | 0 | 4 |
VAN003 | 2 | Blue_bag | a | 12383 | 5874 | 17 | 3 | 3362 | 116150 | 3 | 0 | 0 | 984 | 0 | 0 | 0 | 820 |
VAN004 | 2 | Norgen | a | 4643 | 36435 | 15 | 6 | 1614 | 85000 | 11 | 35 | 0 | 2980 | 0 | 0 | 0 | 733 |
VAN005 | 3 | Blue_bag | a | 3769 | 26592 | 16 | 0 | 1 | 70422 | 0 | 0 | 0 | 52 | 0 | 0 | 0 | 1053 |
Rather than demonstrating the results of the project, I wanted to present a glimpse of what I have learnt to do within R.
Example of a Plot from R
Figure 1: ggplot of Data Sample
I came into Data School with essentially no coding knowledge and very little experience using data so I have learnt an incredible amount during my time within Data School. These include:
It really surprised me how long it took to tidy my data and just how important the initial imput of data is. In addition to this, it surprised me how versatile and powerful R can be once the data is in the correct format.
There are a vast number of ways to develop the skills I learnt at Data School. I hope to refine my coding skills to be able to perform more within R and expose myself to more packages that are available within R.
I have really enjoyed the Data School excperience. I have really enjoyed the new challenges of learning the R coding language and being exposed to new ways of analysing data. I hope to take the new skills I have learnt during my time at Data School to make some improvements within the way we manage data. Data School has really changed the way I think about data and has challenge a lot of the previous methods I had to manage data. I feel that Data School has provided me with superior methods of handling and presenting data which I feel will vastly improve the way I analysis and present my data.