Calculate Chemical Descriptors (CDs) for a list of molecules. Molecules can appear multiple times in the list.
Arguments
- df
dataframe with two mandatory columns: "NAME" and "SMILES"
- verbose
0: no output, 1: progress, 2: more progress and warnings
- nw
number of workers for parallel processing
- keepdf
If TRUE,
cbind(df, CDs)is returned. ElseCDs.- cache
If TRUE (default), descriptors are cached on disk (an SQLite database under tools::R_user_dir
("FastRet", "cache")): already-known descriptors are read from the cache and newly computed ones are written back, so repeated calls on the same molecules are fast and the cache is shared across processes. If FALSE, the cache is bypassed entirely and every descriptor is (re)computed via rCDK – useful for benchmarking the uncached runtime.
Value
A dataframe with all input columns (if keepdf is TRUE) and chemical
descriptors as remaining columns.
Examples
cds <- getCDs(head(RP, 3), verbose = 1, nw = 1)
#> 2026-07-05 18:22:42.87 Obtaining chemical descriptors for 3 molecules
#> 2026-07-05 18:22:42.89 Finished obtaining chemical descriptors in 0.01s