A simple QSPR model for predicting soil sorption coefficients of polar and non-polar organic compounds from molecular formula

Similar documents
Chapter 25: The Chemistry of Life: Organic and Biological Chemistry

BIOB111 - Tutorial activities for session 8

is given for the isotopic fingerprinting methodology.

Chem 1075 Chapter 19 Organic Chemistry Lecture Outline

Chapter 20: Identification of Compounds

3. Organic Geochemisty Organic Chemistry is the chemistry... of Carbon -Morrison and Boyd

PSI Chemistry. 3) How many electron pairs does carbon share in order to complete its valence shell? A) 1 B) 2 C) 3 D) 4 E) 8

CHM 292 Final Exam Answer Key

Physical Pharmacy PHR 211. Lecture 1. Solubility and distribution phenomena.

Table 8.2 Detailed Table of Characteristic Infrared Absorption Frequencies

Aromatic Hydrocarbons

Electronegativity Scale F > O > Cl, N > Br > C, H

Deduce the following information from the structure of estradiol, a phenol contain compound.

Course Information. Instructor Information

SUPPLEMENTAL HOMEWORK SOLUTIONS WEEK 10

Atomic weight = Number of protons + neutrons

Virtual Libraries and Virtual Screening in Drug Discovery Processes using KNIME

Prediction and QSAR Analysis of Toxicity to Photobacterium phosphoreum for a Group of Heterocyclic Nitrogen Compounds

5. (6 pts) Show how the following compound can be synthesized from the indicated starting material:

Matter: Elements and Compounds

Chapter 21: Hydrocarbons Section 21.3 Alkenes and Alkynes

Atomic and molecular interaction forces in biology

2: CHEMICAL COMPOSITION OF THE BODY

AGS Chemistry 2007 Correlated to: Prentice Hall Chemistry (Wilbraham) including AGS Differentiated Instruction Strategies

Chemistry Review Unit 4 Chemical Bonding

A Gentle Introduction to (or Review of ) Fundamentals of Chemistry and Organic Chemistry

CHAPTER 2. Structure and Reactivity: Acids and Bases, Polar and Nonpolar Molecules

CHEMISTRY Topic #3: Using Spectroscopy to Identify Molecules: Radicals and Mass Spectrometry (MS) Spring 2018 Dr.

Molecular descriptors and chemometrics: a powerful combined tool for pharmaceutical, toxicological and environmental problems.

Chem 1075 Chapter 14 Solutions Lecture Outline

STATE UNIVERSITY OF NEW YORK COLLEGE OF TECHNOLOGY CANTON, NEW YORK

ORGANIC CHEMISTRY. Classification of organic compounds

Quiz QSAR QSAR. The Hammett Equation. Hammett s Standard Reference Reaction. Substituent Effects on Equilibria

Finding Formulas. using mass information about a compound to find its formula

Detailed Course Content

Organic Chemistry. Organic chemistry is the chemistry of compounds containing carbon.

Alkenes, Alkynes, and Aromatic Compounds

18.8 Oxidation. Oxidation by silver ion requires an alkaline medium

Glendale Community College, AZ

Unit 10: Part 1: Polarity and Intermolecular Forces

Molecular Geometry: VSEPR model stand for valence-shell electron-pair repulsion and predicts the 3D shape of molecules that are formed in bonding.

Aliphatic Hydrocarbons Anthracite alkanes arene alkenes aromatic compounds alkyl group asymmetric carbon Alkynes benzene 1a

Chapter 12: Unsaturated Hydrocarbons

Mining Molecular Fragments: Finding Relevant Substructures of Molecules

INTERNATIONAL JOURNAL OF PHARMACY & LIFE SCIENCES

CHAPTER 6 Intermolecular Forces Attractions between Particles

EXPERIMENT 1: Survival Organic Chemistry: Molecular Models

Chapter 2 Structure and Properties of Organic Molecules. Advanced Bonding: Review

EXPERIMENT 1: Survival Organic Chemistry: Molecular Models

WHAT IS A SOLUTION? PROPERTIES OF SOLUTIONS SOLUTION TYPES. Possible answers BRAINSTORM: CH. 13

Statistical concepts in QSAR.

Metabolic diversity is based on the Electron donors, acceptors, and carbon sources available - thermodynamics

HYDROCARBONS: STRUCTURE & PROPERTIES

Solutions and Organic Chemistry

Name: Period: Date: UNIT 12: Solutions Lesson 2: Electronegativity and Polarity!

Lec.1 Chemistry Of Water

LECTURE WORKSHEETS. Lecture Worksheets

Exam (6 pts) Show which starting materials are used to produce the following Diels-Alder products:

MOLECULAR REPRESENTATIONS AND INFRARED SPECTROSCOPY

Name. CHEM 115 Exam #4 PRACTICE. Complete the following questions/calculations. (Show all of your work.)

CHAPTER 12 REVIEW. Solutions. Answer the following questions in the space provided. b. sea water. c. water-absorbing super gels

Lecture 1 Solubility and Distribution Phenomena

Exercise 9 - Petrochemicals and Climate

Chapter 13 Alkenes and Alkynes Based on Material Prepared by Andrea D. Leonard University of Louisiana at Lafayette

A QSPR STUDY OF NORMAL BOILING POINT OF ORGANIC COMPOUNDS (ALIPHATIC ALKANES) USING MOLECULAR DESCRIPTORS. B. Souyei * and M.

10 Chemical reactions in aqueous solutions

CHAPTER 4 ENVIRONMENTAL FATE

A Quantitative Structure-Property Relationship (QSPR) Study of Aliphatic Alcohols by the Method of Dividing the Molecular Structure into Substructure

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 1)

Luckily this intermediate has three saturated carbons between the carbonyls, which again points to a Michael reaction:

Name: Period: Date: solution

Modeling in Solution. Important Observations. Energy Concepts

CHEM 1412 Practice Exam 1 - Chapters Zumdahl

CHAPTER 2: Structure and Properties of Organic Molecules

Biochemistry,530:,, Introduc5on,to,Structural,Biology, Autumn,Quarter,2015,

Doctor of Philosophy

cp final review part 2

Look for absorption bands in decreasing order of importance:

Chapter 4 Carbon and the Molecular Diversity of Life

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Figure 2.1

VOCABULARY. Set #2. Set #1

CHEMISTRY 31 Name: KEY Exam #1 100 pts 1. (6 pts) Provide the complete IUPAC name for each of the following compounds:

Carbon atoms are the most versatile building blocks of molecules

6.1 Intro to Chemical Bonding Name:

The chemical building blocks of life Carbon, water and possible alternatives. Carbon. Hybridization of carbon valence orbitals

All information in full detail is available. 2.9.Availability of another QMRF for exactly the same model: No other QMRF available for the same model

SAMPLE FINAL 3150:110

Synthesis of Nitriles a. dehydration of 1 amides using POCl 3 : b. SN2 reaction of cyanide ion on halides:

Chapter 12. Physical Properties of Solutions. Chemistry, Raymond Chang 10th edition, 2010 McGraw-Hill

ORGANIC - EGE 5E CH. 2 - COVALENT BONDING AND CHEMICAL REACTIVITY

Notes of Dr. Anil Mishra at 1

All organic compounds contain carbon, however, not all carbon containing compounds are classified as organic. Organic compounds covalently bonded

Curriculum Correlation

Paper 12: Organic Spectroscopy

5. Solve the following a) What energy is required to heat 55.5 g of carbon from -10 C to 47 C (Ccarbon = 0.71 J/g C)

Chapter 1. Introduction

CHEMISTRY. 1). Solutions are classified into aqueous and non-aqueous solutions, based on. a) Nature of solute particles

The Chemistry and Energy of Life

2. Separate the ions based on their mass to charge (m/e) ratio. 3. Measure the relative abundance of the ions that are produced

Transcription:

A simple QSPR model for predicting soil sorption coefficients of polar and non-polar organic compounds from molecular formula J. Chem. Inf. Comput. Sci. Eduardo J. Delgado, Joel B. Alderete, and Gonzalo A. Jana (2003) Presented by: Mohamed KASSAB DIIAR, Politecnco di Milano

Outlines Introduction Model description, Data set, Results, Validation set, Physical interpretation, Conclusion. DIIAR-Politecnico di Milano 2

Introduction The term sorption is used frequently in environmental situations to denote the uptake of a solute by a solid (soil or sediment or component of soil) without reference to a specific mechanism. Sorption processes play a major role in determining the environmental fate and impact of organic chemicals. Sorption affects a variety of specific fate processes, including: Volatilization, Bioavailability, Biodegradability, Photolysis, Hydrolysis. DIIAR-Politecnico di Milano 3

Sorption coefficients, Koc: quantitatively describe the organic chemical distribution between an environmental solid, soil, sediment, suspended sediment, and the aqueous phase in contact with at equilibrium CSoil is the concentration of solute per gram of carbon in the soil phase, CW denotes the concentration of solute in the aqueous phase. The experimental measurement of Koc is expensive, time-consuming and often related with considerable experimental error; consequently, there is a great need for reliable calculation methods which can be used for the prediction of Koc. DIIAR-Politecnico di Milano 4

Sorption coefficient prediction There are many approach for predicting sorption coefficient such as: Molecular properties that are obtained from quantum chemical calculations. Universal solvation model SMx based on semiemprical molecular orbital theory in combination with a dielectrical continuum model. Since all the above methods, based either on other experimental properties or topological parameters in conjunction with fragment contributions, or quantum chemical calculations, require information and/or computer programs which are not always readily available, Its important to have a model based on parameters obtained directly from the molecular formula without any further calculation, especially for scientists with no background in QC. DIIAR-Politecnico di Milano 5

Model description The QSPR model was developed using the Microsoft Windows version of the Codessa program which is a chemical multipurpose quantitative activity and structure-property statistical analysis and prediction program, using the best correlation option. Codessa calculates a total of 38 constitutional descriptors including the following: absolute and relative numbers of atoms, number of single, double, triple, and aromatic bonds, number of rings, number of benzene rings, and molecular weight. DIIAR-Politecnico di Milano 6

CHEMICAL DATA The data set of the sorption coefficients was collected from several literature sources. A total of 82 structurally diverse compounds were considered. * The data set contains both polar and nonpolar, saturated, unsaturated, aliphatic, aromatic, and polycyclic aromatic compounds covering a log Koc range from about 1 to 6 log units. * Syracuse Research Corporation, Physical/Chemical Property Database (Physptop); SRC Environmental Science Center: Syracuse, NY, 1994. * Linders, J. B. H. J.; Jansma, J. W.; Mensik, B. J. W. G.; Otermann, K. National Institute of Public Health and Environmental Protection (RIVM), Bilthoven, The Netherlands, Report No. 679101014. DIIAR-Politecnico di Milano 7

82 38 Best correlation number of benzene rings, molecular weight, number of nitrogen atoms, Cal. Log Koc number of oxygen atoms, number of sulfur atoms Est. Log Koc Log Koc DIIAR-Politecnico di Milano 8

For the 82 compounds the best correlation equation involving five constitutional descriptors is: DIIAR-Politecnico di Milano 9

Summary statistics for the correlation of experimental vs calculated log Koc for the 82 compounds are: Where: R is the correlation coefficient, F is the value of the Fisher test, s is the standard deviation of the fit, and RCV is the cross-validated correlation coefficient. For each data point, the regression is recalculated with the same descriptors but for the data set without this point. The obtained regression is used to predict the value of this point, and the set of estimated values calculated in this way is correlated with the experimental values. Positive values in the regression coefficients indicate that those descriptors contribute positively to the value of the soil sorption coefficient, whereas negative values indicate that the greater the value of these descriptors the lower the value of Koc. DIIAR-Politecnico di Milano 10

Model validation To check the predictive capability of the model, it was tested with an external set of chemicals not included in the training set. The validation data set included 43 compounds with a diverse selection of chemical structures. It contained not only many chemicals that are similar in structure to chemicals in the training set but also some chemicals having other functional groups, namely, alcohols and carboxylic acids. Statistical performance for calculated vs experimental log Koc for the validation was as follows: These results confirm the predictive capability of the model. DIIAR-Politecnico di Milano 11

Physical interpretation In the present model, the number of benzene rings and molecular weight are the most important descriptors, for these descriptors indicate the value of Koc goes up as the number of benzene rings and molecular weight increase. This behavior may be explained as follows. The number of benzene rings encode the hydrophobicity of the compound, i.e., its tendency to exclude from water. Thus, an increase in this descriptor leads to an decrease in the solubility of the compound. On the other hand, the special mobility of bi electrons will result in an enhanced polarizability and interaction energy of unsaturated molecules with the solid and therefore favoring the sorption process. Both phenomena, surely operating simultaneously, lead to an increase in the value of Koc. The larger the molecular weight the lower the solubility of the compound, leading to an increase in the value of Koc due to its inversely dependence on solubility. DIIAR-Politecnico di Milano 12

Conclusion The merit of the QSPR model developed in this article lays in its simplicity. Soil sorption coefficients can be predicted straightforwardly from the molecular formula, for both nonpolar and polar compounds, without any calculation of indices or more complicated quantum chemical calculations or without the need of having the values of polar fragment contributions. Therefore the model is applicable to compounds with polar fragments for which no group contributions have been fitted before. Currently, the model is being developed to include organophosphorus aliphatic and aromatic compounds. DIIAR-Politecnico di Milano 13