Boston Area Group for Informatics and Modeling: November 2020

About BAGIM

BAGIM is an active community of Boston area scientists bringing together people from diverse fields of modeling and informatics to impact life and health sciences. BAGIM strives to create a forum for great scientific discussions covering a wide range of topics including data management, visualization, computational chemistry, drug discovery, protein structure, molecular modeling, structure-based drug design, data mining, software tools, and the sharing of goals and experiences. Our community is made up of participants from academia, government, and industry whose goal is to engage in the discussion of science involving a synthesis of theory and technology. Discussions sponsored by BAGIM are targeted to the needs and interests of informatics scientists, computational chemists, medicinal chemists, and statisticians. BAGIM also provides opportunities for networking within these disciplines as well as an arena for the dissemination of information of specific interest to the membership.

Wednesday, November 4, 2020

Andrew Dalke: Cheminformatics before there was Cheminformatics

Join us on Thursday, October 29, 2020

Cheminformatics Before "Cheminformatics"

The roots of modern cheminformatics, which I regard as the machine-based organization, analysis, and prediction of chemical structures and their properties, especially as applied to drug discovery, stretch back to the 1940s when punched cards were used to find structure-activity relationships and to search for chemical documentation. Computerization started in the 1950s, with large-scale projects taking place in the 1960s and commercial systems by the 1970s. The process required developing new forms of chemical nomenclature, new methods for data entry, and new algorithms like canonicalization and substructure matching. These were primarily funded by the cost savings for finding known chemical information. While it has long been known that similar molecules tend to have similar properties, it wasn't until the 1980s that there was widespread research into different ways to automate similarity and apply it to property prediction. In parallel, Moore's law made it possible to work with corporate-sized data sets in memory, allowing an orders-of-magnitude increase in effective performance and enabling the large-scale clustering and other early machine learning projects of the 1990s, and in turn helping the field transform from chemical documentation and chemical information management to cheminformatics.

My presentation will highlight some of the people, ideas, hardware, software, and economic factors from the 50 years of cheminformatics before the modern label was coined and hopefully show both how far we've come and how little things have changed.

Andrew Dalke started with molecular dynamics in 1992 as a summer programming job at Florida State University before starting graduate school at UIUC. He joined Klaus Schulten's group where he was one of the co-authors of VMD and NAMD. He was the full-time VMD developer for two years then went west to the Bay Area to develop modeling and bioinformatics software for the Molecular Applications Group then to the southwest to Santa Fe to develop cheminformatics software to support machine learning at Bioreason. At the same time he promoted Python and open source software in bioinformatics and cheminformatics, including co-founding the Biopython project and supporting the Open Bioinformatics Foundation. He has been an independent consultant since 1999. His income comes from custom software development for cheminformatics, Python training for cheminformatians, and sales of chemfp, a high-performance fingerprint similarity search package.

Andrew lives in Trollhättan, Sweden

Presentation https://youtu.be/y6dUkCxlrd8

Future Directions in Medicinal Chemistry

September 29: Noon GMT-5 (Eastern Standard Time)

In collaboration with Novartis, BAGIM is pleased to announce a our first multi-continent panel on current issues in the pursuit of medicinal chemistry.

Medicinal chemistry is a challenging research area crossing multiple scientific boundaries. Finding and optimizing chemical entities toward a drug candidate is a process in which complex and multi-factorial problems are addressed. Not only do potent compounds need to be synthetically feasible, but they also need to fulfill an acceptable balance between efficacy, and safety, while being unique enough to be marketable.

Today’s medicinal chemist needs to be akin to a renaissance man, knowledgeable in different fields, able to work effectively with other scientists on drug discovery campaigns, able to hunt for needles in haystacks and communicate disparate with strong communication skills to allow efficient internal and external collaborations.

Much effort, both technical and financial has been made to support the early drug discovery workflows. High throughput experimental techniques, lab automation, and computational prediction tools hold big promises and are still actively developed in drug research programs. The continued development of which still suggests there is still much room for improvement.

In this panel discussion, we discuss:

(1) the main challenges medicinal chemists still face in their research as of today,

(2) among those challenges, which ones would deserve more attention to improve the efficiency of drug discovery research programs.

(3) Overall, what would a happy medicinal chemist look like within the next 10 years?

Keynote Speaker: Derek Lowe (20 minutes)

Panelists (20 minutes each)

* Anthony Donofrio (Merck)

* Peter Pschmidtke (Discngine/Novartis Collaboration)

* Conner Colley (MIT Chemical Engineering)

Followed by a moderated Q/A session.