For this post, I’d like to share the thoughts of my colleague and good friend, Dr Brenda Finney as she has a pretty interesting take on the use and re-use of data in the drug development pipeline. The introduction of SEND has contributed to the proliferation of such electronic data and so I thought it would make an interesting addition to the blog. Here are Brenda’s words:
In the vast landscape of the digital world, data touches and flows through every aspect of our connected existence. Although I have heard some people say that data is the new oil, I rather like to imagine this continuous flow of data as a rain cycle, where information is generated, collected, stored, and then reused in a perpetual loop. In this way, I can compare it to the water cycle that sustains life on Earth. In our data-driven ecosystem, data production is akin to rain, and understanding its journey is crucial for harnessing its full potential. This might be a bit metaphoric for some, but please stick with me.
We know that research and development data is generated from various sources – in vitro experiments, in vivo experiments, different analyzers, laboratories and more. This constant downpour of information shapes the environment in which we operate, influencing decisions, innovations, and everyday experiences. Data production, like rainfall, is both diverse and dynamic, contributing to an ever-expanding pool of information.
As rainwater converges into rivers and streams, data from different sources gathers into channels of data acquisition systems, databases and data lakes. Ideally this movement of data from acquisition to collection should be smooth and, if at all possible, automatic rather than manually facilitated. The resulting reservoirs ensure that the valuable information doesn’t get lost. Data lakes can be vast repositories capable of storing raw, unstructured data, reflecting the potential in the rain of information.
Once stored, data reservoirs become a resource waiting to be used or reused. In the same way that rainwater is used to irrigate crops, stored data can be employed to inform additional experiments, drive innovations, and make informed decisions. This reuse of data is a crucial aspect of the cycle, contributing to the sustainability and growth of the digital landscape. Indeed, this is a key reason we are working to bring data together in the Centrus® platform for analysis and improving predictive toxicological assessments. The process of bringing this information together filters and transforms the data, turning it into an even more valuable and accessible resource.
Archived data, like an ancient glacier, holds historical information that is not forgotten, but is dormant and possibly hard to access. These frozen data repositories could hold references for understanding past trends, patterns, and insights. We want to help our collaborators unlock the potential in these frozen resources. The insights that could be gained from this are a tantalizing prospect, allowing us to learn from the past and shape the future.
Understanding the data cycle – from production to storage and reuse – is essential in our data-centric world. Much like the water cycle sustains and influences life on Earth, the data cycle fuels innovation, leads to discoveries, and shapes the way we navigate the digital landscape. As we embrace the continuous downpour of data, let’s not only appreciate its immediate impact, but also recognize the potential it holds when stored, processed, and reused. In this ever-evolving ecosystem, the key to unlocking a future rich in insights and possibilities is to harness the power of the data cycle and, here at Instem, we want to be a collaborative partner making that happen.
