Categories
2022

LigGPT: Molecular Generation using a Transformer-Decoder Model

U. Deva Priyakumar, Viraj Bagal ,Rishal Aggarwal ,P. K. Vinod

Application of deep learning techniques for the de novo generation of molecules, termed as inverse molecular design, has been gaining enormous traction in drug design. The representation of molecules in SMILES notation as a string of characters enables the usage of state of the art models in Natural Language Processing, such as the Transformers, for molecular design in general. Inspired by Generative Pre-Training (GPT) model that have been shown to be successful in generating meaningful text, we train a Transformer-Decoder on the next token prediction task using masked self-attention for the generation of druglike molecules in this study. 

Categories
2022

Enhanced Sampling of Chemical Space for High Throughput Screening Applications using Machine Learning

Sarvesh Mehta ,Siddhartha Laghuvarapu ,Yashaswi Pathak ,Aaftaab Sethi ,Mallika Alvala ,U. Deva Priyakumar

In drug discovery applications, high throughput virtual screening exercises are routinely performed to determine an initial set of candidate molecules referred to as “hits”. In such an experiment, each molecule from large small-molecule drug library is evaluated for physical property such as the binding affinity (docking score) against a target receptor. In real-life drug discovery experiments, the drug libraries are extremely large but still a minor representation of the essentially infinite chemical space , and evaluation of physical property for each molecule in the library is not computationally feasible.

Categories
2022

Modified Variable Kernel Length ResNets for Heart Murmur Detection and
Clinical Outcome Prediction using Multi-positional Phonocardiogram
Recording

U Deva Priyakumar, Vijay Vignesh Venkataramani , Akshit Garg

In our approach, the PCG recordings are first downsampled to 1000 Hz before being passed through a Butterworth’s low and high pass filter to remove baseline
wanders and high-frequency noise present in the recordings. The PCG recordings are then broken down into
10-second segments and normalized to bring all trainable samples to the same size.

Categories
2022

Carbodicarbenes and Striking Redox Transitions of their Conjugate Acids: Influence of NHC versus CAAC as Donor Substituents

Prof. U. Deva Priyakumar, Dr. Ramapada Dolai, Rahul Kumar, Dr. Benedict J. Elvers, Pradeep Kumar Pal, Dr. Benson Joseph, Dr. Rina Sikari, Mithilesh Kumar Nayak, Dr. Avijit Maiti, Dr. Tejender Singh, Nicolas Chrysochos, Dr. Arumugam Jayaraman, Dr. Ivo Krummenacher, Prof. Jagannath Mondal

Herein, a new type of carbodicarbene (CDC) comprising two different classes of carbenes is reported; NHC and CAAC as donor substituents and compare the molecular structure and coordination to Au(I)Cl to those of NHC-only and CAAC-only analogues. The conjugate acids of these three CDCs exhibit notable redox properties. Their reactions with [NO][SbF6] were investigated. The reduction of the conjugate acid of CAAC-only based CDC with KC8 results in the formation of hydrogen abstracted/eliminated products, which proceed through a neutral radical intermediate, detected by EPR spectroscopy. 

Categories
2022

Efficient and enhanced sampling of drug-like chemical space for virtual screening and molecular design using modern machine learning methods

Manan Goel, Rishal Aggarwal, Bhuvanesh Sridharan, Pradeep Kumar Pal, U. Deva Priyakumar

Drug design involves the process of identifying and designing novel molecules that have desirable properties and bind well to a given target receptor. Typically, such molecules are identified by screening large chemical libraries for desirable physicochemical properties and binding strength with the target protein. This traditional approach, however, has severe limitations as exhaustively screening every molecule in known chemical libraries is computationally infeasible.

Categories
2022

Tetra-Coordinated Boron-Functionalized Phenanthroimidazole-Based Zinc Salen as a Photocatalyst for the Cycloaddition of CO2 and Epoxides

U. Deva Priyakumar, Prakash Nayak, Anna Chandrasekar Murali, Pradeep Kumar Pal

A unique B–N coordinated phenanthroimidazole-based zinc salen was synthesized. The zinc salen thus synthesized acts as a photocatalyst for the cycloaddition of carbon dioxide with terminal epoxides under ambient conditions. DFT study of the cycloaddition of carbon dioxide with terminal epoxide indicates the preference of the reaction pathway when photocatalyzed by zinc salen. We anticipate that this strategy will help to design new photocatalysts for CO2 fixation.

Categories
2022

MO-MEMES: A method for accelerating virtual screening using multi-objective Bayesian optimization

U. Deva Priyakumar, Sarvesh Mehta, Manan Goel

The pursuit of potential inhibitors for novel targets has become a very important problem especially over the last 2 years with the world in the midst of the COVID-19 pandemic. This entails performing high throughput screening exercises on drug libraries to identify potential “hits”. These hits are identified using analysis of their physical properties like binding affinity to the target receptor, octanol-water partition coefficient (LogP) and more. However, drug libraries can be extremely large and it is infeasible to calculate and analyze the physical properties for each of those molecules within acceptable time and moreover, each molecule must possess a multitude of properties apart from just the binding affinity.

Categories
2022

PLAS-5k: Dataset of Protein-Ligand Affinities from Molecular Dynamics for Machine Learning Applications

U. Deva Priyakumar, Divya B. Korlepara, C. S. Vasavi, Shruti Jeurkar, Pradeep Kumar Pal, Subhajit Roy, Sarvesh Mehta, Shubham Sharma, Vishal Kumar, Charuvaka Muvva, Bhuvanesh Sridharan, Akshit Garg, Rohit Modee, Agastya P. Bhati, Divya Nayar

Computational methods and recently modern machine learning methods have played a key role in structure-based drug design. Though several benchmarking datasets are available for machine learning applications in virtual screening, accurate prediction of binding affinity for a protein-ligand complex remains a major challenge. New datasets that allow for the development of models for predicting binding affinities better than the state-of-the-art scoring functions are important. For the first time, we have developed a dataset, PLAS-5k comprised of 5000 protein-ligand complexes chosen from PDB database. The dataset consists of binding affinities along with energy components like electrostatic, van der Waals, polar and non-polar solvation energy calculated from molecular dynamics simulations using MMPBSA (Molecular Mechanics Poisson-Boltzmann Surface Area) method.

Categories
2022

Structure-based drug repurposing: Traditional and advanced AI/ML-aided methods

Chinmayee Choudhury, N Arul Murugan, U Deva Priyakumar

The current global health emergency in the form of the Coronavirus 2019 (COVID-19) pandemic has highlighted the need for fast, accurate, and efficient drug discovery pipelines. Traditional drug discovery projects relying on in vitro high-throughput screening (HTS) involve large investments and sophisticated experimental set-ups, affordable only to big biopharmaceutical companies. In this scenario, application of efficient state-of-the-art computational methods and modern artificial intelligence (AI)-based algorithms for rapid screening of repurposable chemical space [approved drugs and natural products (NPs) with proven pharmacokinetic profiles] to identify the initial leads is a powerful option to save resources and time. Structure-based drug repurposing is a popular in silico repurposing approach. In this review, we discuss traditional and modern AI-based computational methods and tools applied at various stages for structure-based drug discovery (SBDD) pipelines. Additionally, we highlight the role of generative models in generating molecules with scaffolds from repurposable chemical space.

Categories
2022

Benchmark study on deep neural network potentials for small organic molecules

R Modee, S Laghuvarapu, UD Priyakumar

There has been tremendous advancement in machine learning (ML) applications in computational chemistry, particularly in neural network potentials (NNP). NNPs can approximate potential energy surface (PES) as a high dimensional function by learning from existing reference data, thereby circumventing the need to solve the electronic Schrödinger equation explicitly. As a result, ML accelerates chemical space exploration and property prediction compared to quantum mechanical methods. Novel ML methods have the potential to provide efficient means for predicting the properties of molecules. However, this potential has been limited by the lack of standard comparative evaluations. In this work, we compare four selected models, that is, ANI, PhysNet, SchNet, and BAND-NN, developed to represent the PES of small organic molecules. We evaluate these models for their accuracy and transferability on two different test sets (i) Small organic molecules of up to eight-heavy atoms on which ANI and SchNet achieve root mean square error (RMSE) of 0.55 and 0.60 kcal/mol, respectively. (ii) On random selection of molecules from the GDB-11 database with 10-heavy atoms, ANI achieves RMSE of 1.17 kcal/mol and SchNet achieves RMSE of 1.89 kcal/mol. We examine their ability to produce smooth meaningful surface by performing PES scans for bond stretch, angle bend, and dihedral rotations on relatively large molecules to assess their possible application in molecular dynamics simulations. We also evaluate their performance for yielding minimum energy structures via geometry optimization using various minimization algorithms. All these models were also able to accurately differentiate different isomers of the same empirical formula C10H20C10H20 . ANI and PhysNet achieve an RMSE of 0.29 and 0.52 kcal/mol, respectively, on C10H20C10H20 isomers.