I am a Senior Research Scientist in the Coke Making Team at QCAT in QLD. My team operates two pilot scale coke ovens and characterises the coke that we produce to support metallurgical coal producers with their marketing efforts. My team deals with quite a large amount of data in excel to facilitate the reporting of process data and coke test results to our clients. Prior to data school I had no experience in coding in R.
The Coke Reactivity Test is routinely carried out at QCAT by the coke making team to assess the quality of coke produced in the Research Coke Oven and Non Recovery Coke Oven.
A 20mm sample of coke is reacted with Carbon Dioxide for a period of two hours at a temperature of 1100oC, The mass loss is indicated by the Coke Reactivity Index (CRI). The sample is then tumbled to determine the Coke Strength after Reaction (CSR) which is the mass of coke that remains greater than 10mm in size. The indices indicate the behavior of coke in the iron-making blast furnace. A coke that has a low CRI and high CSR is highly regarded in the market.
My project is to validate the implementation of automated temperature control for the Coke Reactivity Test. The questions that we set out to answer through validation testing were; Is there an improvement in the repeatability of the test results? and Is there a difference in the test results between manual and automated control?
The data collected for this project was both test result data recorded in excel and process data extracted from the control system as a text file. Figure 1 shows the temperature of the sample over the length of the test, the temperature must be controlled within a tight range which is why a PID control system is useful.
Figure 1: Example process plot
Three coke samples with different reactivity were included in the validation program. Figure 2 (below) shows the distinct groupings of results, at first glance it also shows that the grouping of results for tests run in automated mode are tighter, indicating that the automated method improves the repeatability of the results. A student’s t- test should be used to determine if the differences in the means of the CRI and CSR between the automated and manual methods for each sample are significant.
Figure 2: CRI vS CSR for Control Samples
Further plotting in Figure 3 (below) showed that there is an operator effect, this is not an unexpected finding as in the manual method the temperature is controlled by the operator. An additional source of variation in test results between a single coke is sample selection.
Figure 3: CRI vS CSR for Control Samples by operator
Throughout the course of Data School I have learned to use the Tidyverse and the packages within it (e.g.dplyr, readr and tidyr) as well as readxl to import and manipulate data. I have also used ggplot2 to visualise my data. Additionally I have learnt how to use of Git and Github to track changes and collaborate on R projects. I also discovered a fun function called beep()
that makes a noise when a script has sucessfully executed.
I spent quite a bit of time tidying and joining my data into a single data frame using the bind_rows()
functions. I also added additional columns of calculated results data using the mutate()
function. I found it most challenging to deal with time series data, using the Lubridate package.
I’d like to develop my skills in R coding further and utilise R to conduct data analysis and to generate plots in future research projects (and to finalise the commissioning of the automated control system). I’m also interested in using R to make it easier and quicker to plot and present untidy data collected by my team to monitor equipment over time.
I have enjoyed the experience of learning in a supportive and collaborative environment with people so willing to share their skills, the other really wonderful aspect of data school is that it brings together a diverse group of people from across the country and organisation. I have also appreciated the opportunity to spend dedicated time on learning a skill that will set me up for success as a researcher and team leader.