The pharmaceutical R&D landscape continues to evolve rapidly with advances in computational technologies and artificial intelligence. Our recent Q1 2025 scientific webinar brought together industry veterans Dr. Chris Waller and Dr. Dimitris Agrafiotis to discuss the current state and future directions of drug discovery informatics in large pharmaceutical organizations which was led by CDD's Founder and CEO, Dr. Barry Bunin.
This blog post summarizes the critical insights shared during this informative session.
A recurring theme throughout the webinar was the fundamental importance of high-quality, structured data. As Dr. Agrafiotis emphasized, the substrate for effective AI implementation is "clean, good, reliable data in a format that is machine learnable." This seemingly mundane aspect of informatics is often overlooked in discussions dominated by algorithmic advances, yet remains the cornerstone of successful drug discovery platforms.
Dr. Agrafiotis noted that a successful informatics system must "create a structured, organized repository where data would be available at the speed of light in an analysis-ready format." This approach ensures that scientists can access and utilize critical information without unnecessary delays or preprocessing steps.
The discussion highlighted significant differences in how large pharmaceutical companies and biotechs approach informatics innovation:
Despite these differences, both environments share a common challenge: transforming raw data into actionable insights that drive decision-making in drug discovery pipelines.
A significant focus at CDD has been making sophisticated computational tools accessible to bench scientists. Our "complexity simplified" philosophy aims to ensure that any chemist or biologist can interact with data in an intuitive manner.
The webinar demonstrated several practical implementations of this approach, including:
As Chris Waller noted, the goal is to create systems where scientists can access powerful tools "just like surfing the web," without requiring specialized informatics expertise.
The webinar highlighted an important advancement in computational drug discovery: chemically-aware artificial intelligence. CDD's approach ensures that AI-generated outputs (such as bioisosteric replacements) are constrained by chemical feasibility.
This means avoiding problematic structures like strained ring systems or compounds with unfavorable physicochemical properties. By integrating medicinal chemistry knowledge into AI algorithms, the system generates suggestions that align with scientific intuition while still exploring novel chemical space.
The webinar acknowledged the valuable contributions of open-source projects to the advancement of cheminformatics. Examples mentioned included:
These collaborative efforts have accelerated innovation across the industry and provided a foundation for more specialized commercial applications.
Looking ahead, the discussion touched on emerging concepts like agentic workflows, where AI-driven agents could help scientists navigate complex informatics tasks through natural language interaction. This represents the next frontier in democratizing access to sophisticated tools.
An important insight was that AI technologies can simultaneously benefit experts (allowing them to work at higher levels of sophistication) and non-experts (providing guidance and reducing barriers to entry). This dual impact may help address talent gaps in specialized areas of drug discovery.
A key differentiator for modern informatics platforms is their ability to capture and integrate proprietary data across the drug discovery process. As described in the webinar, CDD Vault provides capabilities for:
This comprehensive approach creates a "moat" or barrier to entry by preserving institutional knowledge and facilitating data-driven decision making.
Despite the remarkable advances in computational methods, drug discovery remains fundamentally empirical. The process involves multiple levels of modeling: "a model of a model" - from molecular interactions to cellular effects to animal models and ultimately human clinical outcomes.
The role of informatics is not to replace this experimental cascade, but to make it more efficient, more informed, and ultimately more successful in delivering new therapeutics to patients.
If you would like to explore CDD Vault's capabilities further, please submit a demo request to discuss of how our platform can address your specific drug discovery informatics needs.