Categories
2021

IMLE-Net: An Interpretable Multi-level Multi-channel Model for ECG Classification

U. Deva Priyakumar,Likith Reddy; Vivek Talwar; Shanmukh Alle; Raju. S. Bapi

Early detection of cardiovascular diseases is crucial for effective treatment and an electrocardiogram (ECG) is pivotal for diagnosis. The accuracy of Deep Learning based methods for ECG signal classification has progressed in recent years to reach cardiologist-level performance. In clinical settings, a cardiologist makes a diagnosis based on the standard 12-channel ECG recording. Automatic analysis of ECG recordings from a multiple-channel perspective has not been given enough attention, so it is essential to analyze an ECG recording from a multiple-channel perspective.

Categories
2021

MolGPT: Molecular Generation Using a Transformer-Decoder Model

U. Deva Priyakumar,Viraj Bagal, Rishal Aggarwal, P. K. Vinod

Application of deep learning techniques for de novo generation of molecules, termed as inverse molecular design, has been gaining enormous traction in drug design. The representation of molecules in SMILES notation as a string of characters enables the usage of state of the art models in natural language processing, such as Transformers, for molecular design in general. Inspired by generative pre-training (GPT) models that have been shown to be successful in generating meaningful text, we train a transformer-decoder on the next token prediction task using masked self-attention for the generation of druglike molecules in this study. 

Categories
2021

A Model of Graph Transactional Coverage Patterns with Applications to Drug Discovery

AS Reddy, PK Reddy, A Mondal, UD Priyakumar

2021 IEEE 28th International Conference on High Performance Computing, SUBMITTED.

Facilitating the discovery of drugs by combining diverse compounds is becoming prevalent, especially for treating complex diseases like cancers and HIV. A drug is a chemical compound structure and any sub-structure of a chemical compound is designated as a fragment. A chemical compound or a fragment can be modeled as a graph structure. Given a set of chemical compounds and their corresponding large set of fragments modeled as graph structures, we address the problem of identifying potential combinations of diverse chemical compounds, which cover a certain percentage of the set of fragments.

Categories
2021

MoleGuLAR: Molecule Generation Using Reinforcement Learning with Alternating Rewards

M Goel, S Raghunathan, S Laghuvarapu, UD Priyakumar

The design of new inhibitors for novel targets is a very important problem especially in the current scenario with the world being plagued by COVID-19. Conventional approaches such as high-throughput virtual screening require extensive combing through existing data sets in the hope of finding possible matches. In this study, we propose a computational strategy for de novo generation of molecules with high binding affinities to the specified target and other desirable properties for druglike molecules using reinforcement learning. 

Categories
2021

Stereomutation in Tetracoordinate Centers via Stabilization of Planar Tetracoordinated Systems

K Yadav, U Lourderaj, UD Priyakumar

The quest for stabilizing planar forms of tetracoordinate carbon started five decades ago and intends to achieve interconversion between [R]- and [S]-stereoisomers without breaking covalent bonds. Several strategies are successful in making the planar tetracoordinate form a minimum on its potential energy surface. However, the first examples of systems where stereomutation is possible were reported only recently. In this study, the possibility of neutral and dications of simple hydrocarbons (cyclopentane, cyclopentene, spiropentane, and spiropentadiene) and their counterparts with the central carbon atom replaced by elements from groups 13, 14, and 15 are explored using ab initio MP2 calculations. The energy difference between the tetrahedral and planar forms decreases from row II to row III or IV substituents. Additionally, aromaticity involving the delocalization of the lone pair on the central atom appears to help in further stabilizing the planar form compared to the tetrahedral form, especially for the row II substituents.

Categories
2021

Synthesis and reactivity of NHC-coordinated phosphinidene oxide.

D Dhara, PK Pal, R Dolai, N Chrysochos, H Rawat, BJ Elvers, I Krummenacher, H Braunschweig, C Schulzke, V Chandrasekhar, UD Priyakumar, A Jana,D Dhara, PK Pal, R Dolai, N Chrysochos, H Rawat, BJ Elvers, I Krummenacher, H Braunschweig, C Schulzke, V Chandrasekhar, UD Priyakumar, A Jana,

Here we report the synthesis of an N-heterocyclic carbene (NHC)-stabilised phosphinidene oxide by the controlled oxygenation of a phosphinidene under ambient conditions. This compound can be further oxygenated to a phosphinidene dioxide. The stoichiometric reduction of a phosphinidene oxide with KC8 resembles the pinacol coupling reaction–the reduction of a carbonyl compound. We also looked at the stoichiometric oxidation of NHC-coordinated phosphinidene, phosphinidene oxide and phosphinidene dioxide with [NO][SbF6].

Categories
2021

Desolvation of peptide bond by O to S substitution impacts protein stability .

B Khatri, S Ragunathan, S Chakraborti, Rahisuddin, S Kumaran, R Tadala, P Wagh, UD Priyakumar, J Chatterjee,

A C=O to C=S substitution in the amide bond dramatically alters the water structure around the thioamide bond, ultimately reducing the microenvironment polarity. The increased hydrophobicity of the modified peptide bond is utilized to amplify the thermostability of proteins by this single atom substitution.

Categories
2021

Mining Subgraph Coverage Patterns from Graph transactions.

AS Reddy, PK Reddy, A Mondal, UD Priyakumar,

Pattern mining from graph transactional data (GTD) is an active area of research with applications in the domains of bioinformatics, chemical informatics and social networks. Existing works address the problem of mining frequent subgraphs from GTD. However, the knowledge concerning the coverage aspect of a set of subgraphs is also valuable for improving the performance of several applications. In this regard, we introduce the notion of subgraph coverage patterns (SCPs). Given a GTD, a subgraph coverage pattern is a set of subgraphs subject to relative frequency, coverage and overlap constraints provided by the user. We propose the Subgraph ID-based Flat Transactional (SIFT) framework for the efficient extraction of SCPs from a given GTD.

Categories
2021

SCONES: Self Consistent Neural Network for Protein Stability Prediction Upon Mutation.

Y Samaga BL., S Raghunathan, UD Priyakumar,

Engineering proteins to have desired properties by mutating amino acids at specific sites is commonplace. Such engineered proteins must be stable to function. Experimental methods used to determine stability at throughputs required to scan the protein sequence space thoroughly are laborious. To this end, many machine learning based methods have been developed to predict thermodynamic stability changes upon mutation. These methods have been evaluated for symmetric consistency by testing with hypothetical reverse mutations. In this work, we propose transitive data augmentation, evaluating transitive consistency with our new Stransitive data set, and a new machine learning based method, the first of its kind, that incorporates both symmetric and transitive properties into the architecture. Our method, called SCONES, is an interpretable neural network that predicts small relative protein stability changes for missense mutations that do not significantly alter the structure. 

Categories
2021

DART – Deep Learning Enabled Topological Interaction Model for Energy Prediction of Metal Clusters and its Application in Identifying Unique Low Energy Isomers

R Modee, A Verma, K Joshi, UD Priyakumar,

Recently, machine learning (ML) has proven to yield fast and accurate predictions of chemical properties to accelerate the discovery of novel molecules and materials. The majority of the work is on organic molecules, and much more work needs to be done for inorganic molecules, especially clusters. In the present work, we introduce a simple topological atomic descriptor called TAD, which encodes chemical environment information of each atom in the cluster. TAD is a simple and interpretable descriptor where each value represents the atom count in three shells. We also introduce the DART deep learning enabled topological interaction model, which uses TAD as a feature vector to predict energies of metal clusters, in our case gallium clusters with sizes ranging from 31 to 70 atoms. The DART model is designed based on the principle that the energy is a function of atomic interactions and allows us to model these complex atomic interactions to predict the energy. We further introduce a new dataset called GNC_31–70, which comprises structures and DFT optimized energies of gallium clusters with sizes ranging from 31 to 70 atoms. We show how DART can be used to accelerate the process of identification of low energy structures without geometry optimization.