Browsing by Author "Barigye, Stephen J."
Now showing 1 - 20 of 50
Results Per Page
Sort Options
Item 2D-Discrete Fourier Transform: Generalization of the MIA-QSAR strategy in molecular modeling(Chemometrics and Intelligent Laboratory Systems, 2015) Barigye, Stephen J.; Freitas, Matheus P.Adequate alignment of chemical structure images with respect to the basic scaffold in a series of chemical compounds constitutes an indispensable requirement for constructing multivariate images (MVIs) and subsequent molecular modeling using the Multivariate Image Analysis applied to Quantitative Structure–Activity Relationship (MIA-QSAR) approach. However, up to the moment, this alignment procedure has been manually performed, based on subjective ocular precision. The 2D-Discrete Fourier Transform (2D-DFT) is introduced as a strategy for creating a common base to construct MVIs for chemical structures using their magnitude spectra. The utility of magnitude spectra in QSAR studies has been evaluated through models for the antimalarial, anticancer and trichomonicidal activity of a series of 2, 5-diaminobenzophenone, 4-phenylpyrrolocarbazole and benzimidazole derivatives, respectively, yielding satisfactory results comparable to superior to those reported in the literature. It is anticipated that this strategy should enable the application of the MIA-QSAR approach to structurally diverse datasets other than a series of congeneric datasets.Item Antiprotozoan lead discovery by aligning dry and wet screening: Prediction, synthesis, and biological assay of novel quinoxalinones(Bioorganic & medicinal chemistr, 2014) Martins Alho, Miriam A.; Marrero-Ponce, Yovani; Barigye, Stephen J.; Meneses-Marcel, AlfredoProtozoan parasites have been one of the most significant public health problems for centuries and several human infections caused by them have massive global impact. Most of the current drugs used to treat these illnesses have been used for decades and have many limitations such as the emergence of drug resistance, severe side-effects, low-to-medium drug efficacy, administration routes, cost, etc. These drugs have been largely neglected as models for drug development because they are majorly used in countries with limited resources and as a consequence with scarce marketing possibilities. Nowadays, there is a pressing need to identify and develop new drug-based antiprotozoan therapies. In an effort to overcome this problem, the main purpose of this study is to develop a QSARs-based ensemble classifier for antiprotozoan drug-like entities from a heterogeneous compounds collection. Here, we use some of the TOMOCOMD-CARDD molecular descriptors and linear discriminant analysis (LDA) to derive individual linear classification functions in order to discriminate between antiprotozoan and non-antiprotozoan compounds as a way to enable the computational screening of virtual combinatorial datasets and/or drugs already approved. Firstly, we construct a wide-spectrum benchmark database comprising of 680 organic chemicals with great structural variability (254 of them antiprotozoan agents and 426 to drugs having other clinical uses). This series of compounds was processed by a k-means cluster analysis in order to design training and predicting sets. In total, seven discriminant functions were obtained, by using the whole set of atom-based linear indices. All the LDA-based QSAR models show accuracies above 85% in the training set and values of Matthews correlation coefficients (C) vary from 0.70 to 0.86. The external validation set shows rather-good global classifications of around 80% (92.05% for best equation). Later, we developed a multi-agent QSAR classification system, in which the individual QSAR outputs are the inputs of the aforementioned fusion approach. Finally, the fusion model was used for the identification of a novel generation of lead-like antiprotozoan compounds by using ligand-based virtual screening of ‘available’ small molecules (with synthetic feasibility) in our ‘in-house’ library. A new molecular subsystem (quinoxalinones) was then theoretically selected as a promising lead series, and its derivatives subsequently synthesized, structurally characterized, and experimentally assayed by using in vitro screening that took into consideration a battery of five parasite-based assays. The chemicals 11(12) and 16 are the most active (hits) against apicomplexa (sporozoa) and mastigophora (flagellata) subphylum parasites, respectively. Both compounds depicted good activity in every protozoan in vitro panel and they did not show unspecific cytotoxicity on the host cells. The described technical framework seems to be a promising QSAR-classifier tool for the molecular discovery and development of novel classes of broad—antiprotozoan—spectrum drugs, which may meet the dual challenges posed by drug-resistant parasites and the rapid progression of protozoan illnesses.Item Assessing the Chemical-induced Estrogenicity using in Silico and in Vitro Methods(Environmental Toxicology and Pharmacology, 2021) Goya-Jorge, Elizabeth; Amber, Mazia; Connolly, Lisa; Barigye, Stephen J.Multiple substances are considered endocrine disrupting chemicals (EDCs). However, there is a significant gap in the early prioritization of EDC’s effects. In this work, in silico and in vitro methods were used to model estrogenicity. Two Quantitative Structure-Activity Relationship (QSAR) models based on Logistic Regression and REPTree algorithms were built using a large and diverse database of estrogen receptor (ESR) agonism. A 10-fold external validation demonstrated their robustness and predictive capacity. Mechanistic interpretations of the molecular descriptors (C-026, nArOH,PW5, B06[Br-Br]) used for modelling suggested that the heteroatomic fragments, aromatic hydroxyls, and bromines, and the relative bond accessibility areas of molecules, are structural determinants in estrogenicity. As validation of the QSARs, ESR transactivity of thirteen persistent organic pollutants (POPs) and suspected EDCs was tested in vitro using the MMV-Luc cell line. A good correspondence between predictions and experimental bioassays demonstrated the value of the QSARs for prioritization of ESR agonist compounds.Item Atom Type Independent Modeling of the Conformational Energy of Benzylic, Allylic, and Other Bonds Adjacent to Conjugated Systems(Journal of chemical information and modeling, 2019) Champion, Candide; Barigye, Stephen J.; Labute, Paul; Moitessier, NicolasApplications of computational methods to predict binding affinities for protein/drug complexes are routinely used in structure-based drug discovery. Applications of these methods often rely on empirical force fields (FFs) and their associated parameter sets and atom types. However, it is widely accepted that FFs cannot accurately cover the entire chemical space of drug-like molecules, due to the restrictive cost of parametrization and the poor transferability of existing parameters. To address these limitations, initiatives have been carried out to develop more transferable methods, in order to allow for more rigorous descriptions of any drug-like molecule. We have previously reported H-TEQ, a method which does not rely on atom types and incorporates well established chemical principles to assign parameters to organic molecules. The previous implementation of H-TEQ (a torsional barrier prediction method) only covered saturated and lone pair containing molecules; here, we report our efforts to incorporate conjugated systems into our model. The next step was the evaluation of the introduction of unsaturations. The developed model (H-TEQ3.0) has been validated on a wide variety of molecules containing heteroaromatic groups, alkyls, and fused ring systems. Our method performs on par with one of the most commonly used FFs (GAFF2), without relying on atom types or any prior parametrization.Item Atom Types Independent Molecular Mechanics Method for Predicting the Conformational Energy of Small Molecules(Journal of chemical information and modeling, 2018) Liu, Zhaomin; Barigye, Stephen J.; Shahamat, Moeed; Labute, Paul; Moitessier, NicolasWe previously implemented a well-known qualitative chemical principle into an accurate quantitative model computing relative potential energies of conformers. According to this principle, hyperconjugation strength correlates with electronegativity of donors and acceptors. While this earlier version of our model applies to σ bonds, lone pairs, disregarded in this earlier version, also have a major impact on the conformational preferences of molecules. Among the well-established principles used by organic chemists to rationalize some organic chemical behaviors are the anomeric effect, the alpha effect, basicity, and nucleophilicity. These effects are directly related to the presence of lone pairs. We report herein our effort to incorporate lone pairs into our model to extend its applicability domain to any saturated small molecules. The developed model H-TEQ 2 has been validated on a wide variety of molecules from polyaromatic molecules to carbohydrates and molecules with high heteroatoms/carbon ratios. Interestingly, this method, in contrast to common force field-based methods, does not rely on atom types and is virtually applicable to any organic molecules.Item Aug-MIA-SPR/PLS-DA Classification of Carbonyl Herbicides according to Levels of Soil Sorption(Geoderma, 2016) Freitas, Mirlaine R.; Barigye, Stephen J.; Daré, Joyce K.; Freitas, Matheus P.A major challenge in the design of new herbicides lies in the development of highly active, environmentally friendly compounds. Soil sorption is an ecotoxicological parameter used to probe the prospective environmental fate of persistent organic pollutants, such as some herbicides. This parameter, described in terms of logKOC (the logarithm of the soil/water partition coefficient normalized to organic carbon), is usually estimated using the octanol/water partition coefficient (logP, easily calculated or determined experimentally). However, estimations obtained with the logP are not always accurate. Thus, this work reports the use of molecular descriptors derived from multivariate image analysis of carbonyl herbicides to achieve a predictive classification model based on the partial least squares-discriminant analysis (PLS-DA) method. This model yields 80% accuracy in calibration, 75% in leave-one-out cross-validation and 100% in external validation. In addition, the Y-randomization test reveals that the obtained model is stable from fortuitous correlation, since the accuracy in calibration after shuffling the classes block is only 0.5%. Chemical interpretation in terms of the structural features that affect soil sorption is performed, based on the weights of the selected variables in the classification model. Finally, novel herbicides are rationally designed, based on the inferences arrived at in the structural interpretation experiment and predictions of their qualitative and quantitative soil sorption profiles performed, using the built aug-MIA-SPR and Wang's models, respectively.Item Computational Strategies for the Discovery of Biological Functions of Health Foods, Nutraceuticals and Cosmeceuticals: A Review(Molecular Diversity, 2021) Carpio, Laureano E.; Sanz, Yolanda; Gozalbes, Rafael; Barigye, Stephen J.Scientific and consumer interest in healthy foods (also known as functional foods), nutraceuticals and cosmeceuticals has increased in the recent years, leading to an increased presence of these products in the market. However, the regulations across different countries that define the type of claims that may be made, and the degree of evidence required to support these claims, are rather inconsistent. Moreover, there is also controversy on the effectiveness and biological mode of action of many of these products, which should undergo an exhaustive approval process to guarantee the consumer rights. Computational approaches constitute invaluable tools to facilitate the discovery of bioactive molecules and provide biological plausibility on the mode of action of these products. Indeed, methodologies like QSAR, docking or molecular dynamics have been used in drug discovery protocols for decades and can now aid in the discovery of bioactive food components. Thanks to these approaches, it is possible to search for new functions in food constituents, which may be part of our daily diet, and help to prevent disorders like diabetes, hypercholesterolemia or obesity. In the present manuscript, computational studies applied to this field are reviewed to illustrate the potential of these approaches to guide the first screening steps and the mechanistic studies of nutraceutical, cosmeceutical and functional foods.Item Derivatives in discrete mathematics: a novel graph-theoretical invariant for generating new 2/3D molecular descriptors. I. Theory and QSPR application(Journal of computer-aided molecular design, 2012) Marrero-Ponce, Yovani; Santiago, Oscar Martínez; Barigye, Stephen J.; Torrens, FranciscoIn this report, we present a new mathematical approach for describing chemical structures of organic molecules at atomic-molecular level, proposing for the first time the use of the concept of the derivative (∂ ) of a molecular graph (MG) with respect to a given event (E), to obtain a new family of molecular descriptors (MDs). With this purpose, a new matrix representation of the MG, which generalizes graph’s theory’s traditional incidence matrix, is introduced. This matrix, denominated the generalized incidence matrix, Q, arises from the Boolean representation of molecular sub-graphs that participate in the formation of the graph molecular skeleton MG and could be complete (representing all possible connected sub-graphs) or constitute sub-graphs of determined orders or types as well as a combination of these. The Q matrix is a non-quadratic and unsymmetrical in nature, its columns (n) and rows (m) are conditions (letters) and collection of conditions (words) with which the event occurs. This non-quadratic and unsymmetrical matrix is transformed, by algebraic manipulation, to a quadratic and symmetric matrix known as relations frequency matrix, F, which characterizes the participation intensity of the conditions (letters) in the events (words). With F, we calculate the derivative over a pair of atomic nuclei. The local index for the atomic nuclei i, Δ i , can therefore be obtained as a linear combination of all the pair derivatives of the atomic nuclei i with all the rest of the j′s atomic nuclei. Here, we also define new strategies that generalize the present form of obtaining global or local (group or atom-type) invariants from atomic contributions (local vertex invariants, LOVIs). In respect to this, metric (norms), means and statistical invariants are introduced. These invariants are applied to a vector whose components are the values Δ i for the atomic nuclei of the molecule or its fragments. Moreover, with the purpose of differentiating among different atoms, an atomic weighting scheme (atom-type labels) is used in the formation of the matrix Q or in LOVIs state. The obtained indices were utilized to describe the partition coefficient (Log P) and the reactivity index (Log K) of the 34 derivatives of 2-furylethylenes. In all the cases, our MDs showed better statistical results than those previously obtained using some of the most used families of MDs in chemometric practice. Therefore, it has been demonstrated to that the proposed MDs are useful in molecular design and permit obtaining easier and robust mathematical models than the majority of those reported in the literature. All this range of mentioned possibilities open “the doors” to the creation of a new family of MDs, using the graph derivative, and avail a new tool for QSAR/QSPR and molecular diversity/similarity studies.Item Development of an in Silico Model of DPPH‚ Free Radical Scavenging Capacity: Prediction of Antioxidant Activity of Coumarin Type Compounds(International Journal of Molecular Sciences, 2016) Jorge, Elizabeth Goya; Barigye, Stephen J.; Rodríguez, María Elisa Jorge; Veitía, Maité Sylla-IyarretaA quantitative structure-activity relationship (QSAR) study of the 2,2-diphenyl-l-picrylhydrazyl (DPPH•) radical scavenging ability of 1373 chemical compounds, using DRAGON molecular descriptors (MD) and the neural network technique, a technique based on the multilayer multilayer perceptron (MLP), was developed. The built model demonstrated a satisfactory performance for the training (R2=0.713) and test set (Q2ext=0.654) , respectively. To gain greater insight on the relevance of the MD contained in the MLP model, sensitivity and principal component analyses were performed. Moreover, structural and mechanistic interpretation was carried out to comprehend the relationship of the variables in the model with the modeled property. The constructed MLP model was employed to predict the radical scavenging ability for a group of coumarin-type compounds. Finally, in order to validate the model’s predictions, an in vitro assay for one of the compounds (4-hydroxycoumarin) was performed, showing a satisfactory proximity between the experimental and predicted pIC50 valuesItem Development of Generalized QSAR Models for Predicting Cytotoxicity and Genotoxicity of Metal Oxides Nanoparticles(International Journal of Quantitative Structure-Property Relationships, 2020) Ambure, Pravin; Ballesteros, Arantxa; Barigye, Stephen J.; Gozalbes, RafaelIn recent years, nanomaterials have gained tremendous attention due to their wide variety of industrial applications including food packaging, consumer products, nanomedicines, etc. The fascinating properties of nanoparticles which are responsible for creating several exciting opportunities, however, are also accountable for growing concerns of their toxic effects on humans as well as the environment. Thus, in the present study, the authors have developed generalized models for predicting the cytotoxicity and genotoxicity of seven metal oxide nanoparticles. The models not only take into account the structural features, but also the diverse experimental conditions under which the toxicity of nanoparticles was determined. The diverse experimental conditions were captured in the generalized models using the Box-Jenkins moving average approach. Here, two machine learning techniques, namely, linear discriminant analysis and random forest were utilized to build the final models. Importantly, the validation metrics showed that the developed models have significant discriminatory power.Item Discrete Derivatives for Atom-Pairs as a Novel Graph Theoretical Invariant for Generating New Molecular Descriptors: Orthogonality, Interpretation and QSARs/ QSPRs on Benchmark Databases(Molecular Informatics, 2014) Martínez-Santiago, Oscar; Marrero-Ponce, Yovani; Barigye, Stephen J.; Torrens, Francisco; Pérez-Giménez, FacundoThis report presents a new mathematical method based on the concept of the derivative of a molecular graph (G) with respect to a given event (S) to codify chemical structure information. The derivate over each pair of atoms in the molecule is defined as ∂G/∂S(vi , vj)=(fi−2fij+fj)/fij, where fi (or fj) and fij are the individual frequency of atom i (or j) and the reciprocal frequency of the atoms i and j, respectively. These frequencies characterize the participation intensity of atom pairs in S. Here, the event space is composed of molecular sub-graphs which participate in the formation of the G skeleton that could be complete (representing all possible connected sub-graphs) or comprised of sub-graphs of certain orders or types or combinations of these. The atom level graph derivative index, Δi, is expressed as a linear combination of all atom pair derivatives that include the atomic nuclei i. Global [total or local (group or atom-type)] indices are obtained by applying the so called invariants over a vector of Δi values. The novel MDs are validated using a data set of 28 alkyl-alcohols and other benchmark data sets proposed by the International Academy of Mathematical Chemistry. Also, the boiling point for the alcohols, the adrenergic blocking activity of N,N-dimethyl-2-halo-phenethylamines and physicochemical properties of polychlorinated biphenyls and octanes are modeled. These models exhibit satisfactory predictive power compared with other 0–3D indices implemented successfully by other researchers. In addition, tendencies of the proposed indices are investigated using examples of various types of molecular structures, including chain-lengthening, branching, heteroatoms-content, and multiple bonds. On the other hand, the relation of atom-based derivative indices with 17O NMR of a series of ethers and carbonyls reflects that the new MDs encode electronic, topological and steric information. Linear independence between the graph derivative indices and other 0-3D MDs is demonstrated by using principal component analysis on a dataset of 41 heterogeneous molecules. It is concluded that the graph derivative indices are independent indices containing important structural information to be used in QSPR/QSAR and drug design studies, and permit obtaining easier, more interpretable and robust mathematical models than the majority of those reported in the literature.Item Discrete Fourier Transform based Multivariate Image Analysis: Application to Modeling Aromatase Inhibitory Activity.(ACS Combinatorial Science, 2018) Barigye, Stephen J.; Freitas, Matheus P.; Ausina, Priscila; Castillo-Garit, Juan AlbertoWe recently generalized the formerly alignment-dependent multivariate image analysis applied to quantitative structure–activity relationships (MIA-QSAR) method through the application of the discrete Fourier transform (DFT), allowing for its application to noncongruent and structurally diverse chemical compound data sets. Here we report the first practical application of this method in the screening of molecular entities of therapeutic interest, with human aromatase inhibitory activity as the case study. We developed an ensemble classification model based on the two-dimensional (2D) DFT MIA-QSAR descriptors, with which we screened the NCI Diversity Set V (1593 compounds) and obtained 34 chemical compounds with possible aromatase inhibitory activity. These compounds were docked into the aromatase active site, and the 10 most promising compounds were selected for in vitro experimental validation. Of these compounds, 7419 (nonsteroidal) and 89 201 (steroidal) demonstrated satisfactory antiproliferative and aromatase inhibitory activities. The obtained results suggest that the 2D-DFT MIA-QSAR method may be useful in ligand-based virtual screening of new molecular entities of therapeutic utility.Item Examining the predictive accuracy of the novel 3D N‑linear algebraic molecular codifications on benchmark datasets(Journal of Cheminformatics, 2016) García‑Jacas, César R.; Contreras‑Torres, Ernesto; Barigye, Stephen J.; Cabrera‑Leyva, LissetRecently, novel 3D alignment-free molecular descriptors (also known as QuBiLS-MIDAS) based on two-linear, three-linear and four-linear algebraic forms have been introduced. These descriptors codify chemical information for relations between two, three and four atoms by using several (dis-)similarity metrics and multi-metrics. Several studies aimed at assessing the quality of these novel descriptors have been performed. However, a deeper analysis of their performance is necessary. Therefore, in the present manuscript an assessment and statistical validation of the performance of these novel descriptors in QSAR studies is performed.Item Exploring MIA-QSPR's for the modeling of biomagnification factors of aromatic organochlorine pollutants(Ecotoxicology and Environmental Safety, 2017) Mota, Estella G. da; Duarte, Mariene H.; Barigye, Stephen J.; Ramalho, Teodorico C.; Freitas, Matheus P.Biomagnification of organic pollutants in food webs has been usually associated to hydrophobicity and other molecular descriptors. However, direct information on atoms and substituent positions in a molecular scaffold that most affect this biological property is not straightforward using traditional QSPR techniques. This work reports the QSPR modeling of biomagnification factors (logBMF) of a series of aromatic organochlorine compounds using three MIA-QSPR (multivariate image analysis applied to QSPR) approaches. The MIA-QSPR model based on augmented molecular images (described with atoms represented as circles with sizes proportional to the respective van der Waals radii and having colors numerically proportional to the Pauling's electronegativity) encoded better the logBMF data. The average results for the main statistical parameters used to attest the model's predictability were r2=0.85, q2=0.72 and r2test=0.85. In addition, chemical insights on substituents and respective positions at the biphenyl rings A and B, and dibenzo-p-dioxin and dibenzofuran motifs are given to aid the design of more ecofriendly derivatives.Item Extended GT-STAF Information Indices based on Markov Approximation Models(Chemical Physics Letters, 2013) Barigye, Stephen J.; Marrero-Ponce, Yovani; Alfonso-Reguera, Vitalio; Pérez-Giménez, FacundoA series of novel information theory-based molecular parameters derived from the insight of a molecular structure as a chemical communication system were recently presented and usefully employed in QSAR/QSPRs (J. Comp. Chem, 2013, 34, 259; SAR and QSAR in Environ. Res. 2013, 24). This approach permitted the application of Shannon’s source and channel coding entropic measures to a chemical information source comprised of molecular ‘fragments’, using the zero-order Markov approximation model (atom-based approach). This report covers the theoretical aspects of the extensions of this approach to higher-order models, introducing the first, second and generalized-order Markov approximation models.Item Extending Graph (Discrete) Derivative Descriptors to N-Tuple Atom-Relations(Match-Communications in Mathematical and in Computer Chemistry, 2015) Santiago, Oscar Martínez; Marrero-Ponce, Yovani; Cabrera, Reisel Millán; Barigye, Stephen J.; Martínez, Luis M. ArtilesIn the present manuscript, an extension of the previously defined Graph Derivative Indices (GDIs) is discussed. To achieve this objective, the concept of a hypermatrix, conceived from the calculation of the frequencies of triple and quadruple atom relations in a set of connected sub-graphs, is introduced. This set of subgraphs is generated following a predefined criterion, known as the event (S), being in this particular case the connectivity among atoms. The triple and quadruple relations frequency matrices serve as a basis for the computation of triple and quadruple discrete derivative indices, respectively. The GDIs are implemented in a computational program denominated DIVATI (acronym for DIscrete DeriVAtive Type Indices), a module of TOMOCOMD-CARDD program. Shannon‟s entropy-based variability analysis demonstrates that the GDIs show major variability than others indices used in QSAR/QSPR researches. In addition, it can be appreciated when the indices are extended over n-elements from the graph, its quality increases, principally when they are used in a combined way.Item Gene Prioritization, Communality Analysis, Networking and Metabolic Integrated Pathway to better Understand Breast Cancer Pathogenesis(Scientific reports, 2018) López-Cortés, Andrés; Cabrera-Andrade, Alejandro; Barigye, Stephen J.; Munteanu, Cristian R.; Tejera, EduardoConsensus strategy was proved to be highly efficient in the recognition of gene-disease association. Therefore, the main objective of this study was to apply theoretical approaches to explore genes and communities directly involved in breast cancer (BC) pathogenesis. We evaluated the consensus between 8 prioritization strategies for the early recognition of pathogenic genes. A communality analysis in the protein-protein interaction (PPi) network of previously selected genes was enriched with gene ontology, metabolic pathways, as well as oncogenomics validation with the OncoPPi and DRIVE projects. The consensus genes were rationally filtered to 1842 genes. The communality analysis showed an enrichment of 14 communities specially connected with ERBB, PI3K-AKT, mTOR, FOXO, p53, HIF-1, VEGF, MAPK and prolactin signaling pathways. Genes with highest ranking were TP53, ESR1, BRCA2, BRCA1 and ERBB2. Genes with highest connectivity degree were TP53, AKT1, SRC, CREBBP and EP300. The connectivity degree allowed to establish a significant correlation between the OncoPPi network and our BC integrated network conformed by 51 genes and 62 PPi. In addition, CCND1, RAD51, CDC42, YAP1 and RPA1 were functional genes with significant sensitivity score in BC cell lines. In conclusion, the consensus strategy identifies both well-known pathogenic genes and prioritized genes that need to be further explored.Item Generative Adversarial Networks (GANs) Based Synthetic Sampling for Predictive Modeling(Molecular Informatics, 2020) Barigye, Stephen J.; García-de la Vega, José Manuel; Perez-Castillo, YunierkisIn the present report we evaluate the possible utility of the Generative Adversarial Networks (GANs) in mapping the chemical structural space for molecular property profiles, with the goal of subsequently yielding synthetic (artificial) samples for ligand-based molecular modeling. Two case studies are considered: BACE-1 (β-Secretase 1) and DENV (Dengue Virus) inhibitory activities, with the former focused on data populating and the latter on data balancing tasks. We train GANs using subsamples extracted from datasets for each bioactivity endpoint, and apply the trained networks in generating synthetic examples from the respective bioactivity chemical spaces. Original and synthetic samples are pooled together and employed to build BACE-1 and DENV inhibitory activity classifiers and their performance evaluated over tenfold external validation sets. In both case studies, the obtained classifiers demonstrate satisfactory predictivity with the former yielding accuracy (ACC) and Mathew's correlation coefficient (MCC) values of 0.80 and 0.59, while the latter produces balanced accuracy(BACC) and MCC values of 0.81 and 0.70, respectively. Moreover, the statistics of these classifiers are compared with those of other models in the literature demonstrating comparable to better performance. These results suggest that GANs may be useful in mapping the chemical space for molecular property profiles of interest, and thus allow for the extraction of synthetic examples for computational modeling.Item Identification of NLRP3PYD Homo-Oligomerization Inhibitors with Anti-Inflammatory Activity(International Journal of Molecular Sciences, 2022) Ghafary, Soroush Moasses; Soriano-Teruel, Paula M.; Karami, Fatemeh; Barigye, Stephen J.; Fernández-Pérez, IvánInflammasomes are multiprotein complexes that represent critical elements of the inflammatory response. The dysregulation of the best-characterized complex, the NLRP3 inflammasome, has been linked to the pathogenesis of diseases such as multiple sclerosis, type 2 diabetes mellitus, Alzheimer’s disease, and cancer. While there exist molecular inhibitors specific for the various components of inflammasome complexes, no currently reported inhibitors specifically target NLRP3PYD homo-oligomerization. In the present study, we describe the identification of QM380 and QM381 as NLRP3PYD homo-oligomerization inhibitors after screening small molecules from the MyriaScreen library using a split-luciferase complementation assay. Our results demonstrate that these NLRP3PYD inhibitors interfere with ASC speck formation, inhibit pro-inflammatory cytokine IL1-β release, and decrease pyroptotic cell death. We employed spectroscopic techniques and computational docking analyses with QM380 and QM381 and the PYD domain to confirm the experimental results and predict possible mechanisms underlying the inhibition of NLRP3PYD homo-interactions.Item IMMAN: Free Software for Information theory-based Chemometric Analysis(Molecular diversity, 2015) Urias, Ricardo W. Pino; Barigye, Stephen J.; Marrero-Ponce, Yovani; García-Jacas, César R.; Perez-Gimenez, FacundoThe features and theoretical background of a new and free computational program for chemometric analysis denominated IMMAN (acronym for Information theory-based CheMoMetrics ANalysis) are presented. This is multi-platform software developed in the Java programming language, designed with a remarkably user-friendly graphical interface for the computation of a collection of information-theoretic functions adapted for rank-based unsupervised and supervised feature selection tasks. A total of 20 feature selection parameters are presented, with the unsupervised and supervised frameworks represented by 10 approaches in each case. Several information-theoretic parameters traditionally used as molecular descriptors (MDs) are adapted for use as unsupervised rank-based feature selection methods. On the other hand, a generalization scheme for the previously defined differential Shannon’s entropy is discussed, as well as the introduction of Jeffreys information measure for supervised feature selection. Moreover, well-known information-theoretic feature selection parameters, such as information gain, gain ratio, and symmetrical uncertainty are incorporated to the IMMAN software (http://mobiosd-hub.com/imman-soft/), following an equal-interval discretization approach. IMMAN offers data pre-processing functionalities, such as missing values processing, dataset partitioning, and browsing. Moreover, single parameter or ensemble (multi-criteria) ranking options are provided. Consequently, this software is suitable for tasks like dimensionality reduction, feature ranking, as well as comparative diversity analysis of data matrices. Simple examples of applications performed with this program are presented. A comparative study between IMMAN and WEKA feature selection tools using the Arcene dataset was performed, demonstrating similar behavior. In addition, it is revealed that the use of IMMAN unsupervised feature selection methods improves the performance of both IMMAN and WEKA supervised algorithms.
- «
- 1 (current)
- 2
- 3
- »