{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "# `molli` Study case: Cladosporin GIAO-DFT NMR prediction\n", "\n", "## Objective\n", "The objective of this task is to use the `molli` workflow to transition from `.CDXML` molecular drawings to full-fledged 3D structures with subsequent CREST//GFN2-XTB conformer generation, ORCA equilibrium structure minimizations and GIAO-DFT NMR predictions.\n", "This workflow is particularly useful in validation of relative stereochemistry where experimental NMR is not conclusive alone (see e. g. [*J. Org. Chem. 2020*, **85**, 5, 3297](doi.org/10.1021/acs.joc.9b03129))\n", "\n", "## Prerequisites\n", "- `molli` installation\n", "- `pandas` installation (it is NOT a dependency of molli and must be installed separately)\n", "- `OpenBabel` installation (it is NOT a dependency of molli and must be installed separately)\n", " Note: if you experience problems with openbabel refusing to optimize the structures, do not forget to set the `BABEL_DATADIR` environment variable to point to a valid location with openbabel data files. With conda installations, it seems to be consistently pointing to `$CONDA_PREFIX/share/openbabel`\n", "- `openpyxl` installation (it is NOT a dependency of molli and must be installed separately)\n", "- `rmsd` installation (it is NOT a dependency of molli and must be installed separately)\n", "- `cladosporin.csv` file that has the experimental NMR chemical shifts\n", "- `cladosporin.cdxml` file that contains the structural drawings\n", "\n", "## Methods\n", "\n", "### Conformer generation\n", "Conformers are generated with iMTD-GC workflow as implemented in CREST. \n", "\n", "### " ] }, { "cell_type": "code", "execution_count": 6, "metadata": {}, "outputs": [], "source": [ "import molli as ml\n", "import pandas as pd\n", "import numpy as np\n", "\n", "# This configures on-screen preview of molecules\n", "ml.visual.configure(bgcolor=\"black\", theme=\"dark\")" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Step 1. `CDXML` Parsing\n", "\n", "In this step, all of the molecules are read from the CDXML file and converted into their 3D versions.\n", "This can be done from both the command line interface, as well as programmatically. Here, the command line interface will be shown." ] }, { "cell_type": "code", "execution_count": 7, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Parsing cladosporin.cdxml: 100%|█████████████████| 4/4 [00:00<00:00, 148.00it/s]\n" ] } ], "source": [ "!molli parse cladosporin.cdxml -o cladosporin_g0.mlib --hadd" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "We will now use IPython line magic `mlib_view` to visualize the entries in the resulting library." ] }, { "cell_type": "code", "execution_count": 8, "metadata": {}, "outputs": [ { "data": { "application/3dmoljs_load.v0": "
You appear to be running in JupyterLab (or JavaScript failed to load for some other reason). You need to install the 3dmol extension:
\n jupyter labextension install jupyterlab_3dmol
You appear to be running in JupyterLab (or JavaScript failed to load for some other reason). You need to install the 3dmol extension:
\n",
" jupyter labextension install jupyterlab_3dmol
You appear to be running in JupyterLab (or JavaScript failed to load for some other reason). You need to install the 3dmol extension:
\n jupyter labextension install jupyterlab_3dmol
You appear to be running in JupyterLab (or JavaScript failed to load for some other reason). You need to install the 3dmol extension:
\n",
" jupyter labextension install jupyterlab_3dmol
You appear to be running in JupyterLab (or JavaScript failed to load for some other reason). You need to install the 3dmol extension:
\n jupyter labextension install jupyterlab_3dmol
You appear to be running in JupyterLab (or JavaScript failed to load for some other reason). You need to install the 3dmol extension:
\n",
" jupyter labextension install jupyterlab_3dmol