This file is the authoritative register of covariate column names
used in nlmixr2lib models. Every covariate referenced inside a model’s
model() block must have an entry here. The register is
seeded from a full audit of inst/modeldb/ and extended
whenever a new paper introduces a covariate that isn’t yet
registered.
How to use this register
- Before adding a covariate to a new model, search this file (by canonical name and by source alias) for the concept you need.
-
If the canonical name exists, use it exactly.
Document any source-paper rename in the model’s
covariateData[[name]]$source_namefield. -
If the source paper uses an alias listed under an existing
canonical name, use the canonical name and note in
covariateData[[name]]$noteswhether the values must be transformed (e.g.,SEXM -> SEXFinverts values; the effect coefficient sign / reference category must be inverted as well). - If the covariate is not in this register at all, propose a new entry with a canonical name, description, units, type, scope, and source aliases. Verify with the user before committing. The addition is part of the model’s PR.
- Do not modify existing model files when you discover an alias; simply document the mapping in the register. Retrofitting existing models is a separate effort.
Scope: general vs specific
Each entry has a Scope: field declaring whether it is
general (any model may use it without warning) or
specific (only the models listed under
Example models may use it; other usage is flagged by
checkModelConventions()). This prevents accidentally
reusing a covariate name whose meaning is tied to a particular
paper.
-
Scope: general – the covariate has a stable,
paper-independent meaning and any future model may use it. Examples:
WT,AGE,SEXF,CREAT,ADA_POS,CRP,CRCL. -
Scope: specific – the covariate’s semantics depend
on a particular study’s design (a specific study indicator, a
drug-product variant, a composite race grouping, a tumor-type
decomposition, etc.). If a new paper needs the same concept with
different semantics, register a new canonical name; if the concept
matches, extend the
Example modelslist (and consider promoting to general).
When adding or updating an entry, choose the most conservative scope:
if in doubt, start with specific and promote when a second
model legitimately ratifies the name.
Case convention
Covariate column names should be ALL CAPS. Current non-all-caps
canonical names are dilution and nonECZTRA
(both scope: specific), preserved from their source files with “future
rename” notes. New entries should default to all caps.
Entry schema
- name: <CANONICAL_NAME>
description: <one-sentence definition>
units: <unit string, or "(binary)" / "(categorical)">
type: continuous | binary | categorical | count
scope: general | specific
reference_category: <the 0 group for binary/categorical, or NULL>
source_aliases:
- <ALIAS_NAME> (<transformation if any>) -- used in <model.R>
example_models:
- <model.R>
notes: <free text>Demographics
WT (canonical for body weight (baseline or time-varying))
- Description: Body weight (baseline or time-varying).
- Units: kg
- Type: continuous
- Scope: general
-
Reference category: n/a – used with allometric
scaling
(WT / ref_wt)^exponent. Reference weights observed: 70 kg (adults), 75 kg, 84.8 kg, 5 kg (infants). - Source aliases: none known.
-
Example models:
Clegg_2024_nirsevimab.R,Hu_2026_clesrovimab.R,Zhu_2017_lebrikizumab.R,Kovalenko_2020_dupilumab.R,CarlssonPetri_2021_liraglutide.R,Cirincione_2017_exenatide.R,Grimm_2023_gantenerumab.R,Grimm_2023_trontinemab.R,Kyhl_2016_nalmefene.R,Soehoel_2022_tralokinumab.R,Xie_2019_agomelatine.R,PK_2cmt_mAb_Davda_2014.R,phenylalanine_charbonneau_2021.R,Chua_2025_mirikizumab.R,Jackson_2022_ixekizumab.R,Kotani_2022_astegolimab.R,Ma_2020_sarilumab_anc.R,Ma_2020_sarilumab_das28crp.R,Moein_2022_etrolizumab.R,Tiraboschi_2025_amlitelimab.R,Robbie_2012_palivizumab.R,Bajaj_2017_nivolumab.R,Quartino_2019_trastuzumab.R,Wang_2020_ontamalimab.R,Fau_2020_isatuximab.R,Okada_2025_rocatinlimab.R,Kunisawa_2014_olprinone.R,Xu_2020_daratumumab.R(reference 78.6 kg; power exponents 0.451 on linear CL and 0.375 on V1). - Notes: Universal. Verify time-varying vs. baseline-only against the source paper.
AGE (canonical for subject age)
- Description: Subject age in years.
- Units: years
- Type: continuous
- Scope: general
- Reference category: n/a
- Source aliases: none.
-
Example models:
Archary_2019_lamivudine.R,Budha_2023_tislelizumab.R,Chakraborty_2012_canakinumab.R,Chen_2020_luspatercept.R,Conrado_2014_alzheimer.R,Diepstraten_2013_propofol.R,Gandhi_2021_abatacept.R,Goel_2016_Sonidegib.R,Hennig_2013_tobra.R,Hong_2025_datopotamab.R,Ide_2020_elotuzumab.R,Koopman_2023_factorix.R,Kuchimanchi_2024_dostarlimab.R,Kunarajah_2017_doxorubicin.R,Kyhl_2016_nalmefene.R,Lahu_2010_roflumilast.R,Li_2006_meropenem.R,Li_2017_cediranib.R,Li_2019_abatacept.R,Lin_2024_casirivimab.R,Martinez_2019_alirocumab.R,Melhem_2022_dostarlimab.R,Mulyukov_2018_ranibizumab.R,NA_NA_tte_gompertz.R,NA_NA_tte_lognormal.R,Retlich_2015_linagliptin.R,Rosario_2015_vedolizumab.R,Svensson_2016_bedaquiline.R,Thakre_2022_risankizumab.R,Wu_2024_inotuzumab.R,Yassen_2025_asundexian.R,Yu_2022_ofatumumab.R,Zhong_2026_abatacept.R,Zhou_2021_belimumab.R,Zhu_2017_lebrikizumab.R. -
Notes: Zhu 2017 normalizes as
AGE/40.
LBM (canonical for lean body mass)
- Description: Lean body mass.
- Units: kg
- Type: continuous
- Scope: general
- Reference category: n/a
-
Source aliases:
-
LBW(lean body weight) – synonym; same biological quantity (total body weight minus body fat). Hemophilia popPK literature typically usesLBW(Hume or James formula) where mAb / general literature usesLBM. Used inGarmann_2017_BAY81_8973.R(reference 51.1 kg).
-
-
Example models:
Kyhl_2016_nalmefene.R(reference 56.28 kg, exponent 0.626 on CL),Garmann_2017_BAY81_8973.R(aliasLBW; reference 51.1 kg, exponents 0.610 on CL and 0.950 on Vc),Schoemaker_2017_brivaracetam.R(aliasLBW; paediatric cohort, reference 50 kg adult typical value, fixed theoretical allometric exponents 0.750 on CL/F and 1.00 on V/F).
FFM (canonical for fat-free mass)
- Description: Fat-free mass derived from body weight, height, and sex via the Janmahasatian et al. formula (Clin Pharmacokinet 2005;44:1051-1065).
- Units: kg
- Type: continuous
- Scope: general
-
Reference category: n/a – used with power scaling
(FFM / ref)^exponent. Reference values observed: 40.69 kg (Zhou 2021 belimumab pooled adult+pediatric SLE), 45 kg (Aguiar 2021, Crohn’s disease cohort median). -
Source aliases: none;
FFMis the universal abbreviation. -
Example models:
Zhou_2021_belimumab.R(reference 40.69 kg; exponents 0.673 on CL and 0.891 on V1),Aguiar_2021_ustekinumab.R(reference 45 kg; power exponents 0.598 on CL, 0.590 on Vc, 0.586 on Vp). -
Notes: Distinct from
LBM(lean body mass) which is sometimes computed by the Boer or Hume formulae. When the source paper reports the body-composition formula it used (e.g., Janmahasatian for FFM), record it incovariateData[[FFM]]$notes. FFM is preferred over total body weight when scaling monoclonal-antibody PK because mAb distribution is largely confined to extracellular fluid; muscle / lean tissue tracks extracellular volume better than total weight in heavier patients.
HT (canonical for body height at baseline)
- Description: Subject body height at baseline. Time-fixed unless the source paper states otherwise.
- Units: cm
- Type: continuous
- Scope: specific
-
Reference category: n/a – used with a
linear-deviation form
(HT - ref)or with a power-style scaling. Reference values observed: 167 cm (Naik 2016, vortioxetine adult MDD/GAD population median). -
Source aliases:
-
HGT– height (cm) abbreviation appearing in some NONMEM control streams. -
HEIGHT– spelled-out form.
-
-
Example models:
Naik_2016_vortioxetine.R(reference 167 cm; linear-additive effect 0.40 L/hr per (HT - 167) cm on CL/F, retained over weight and BMI in stepwise selection because it produced the larger reduction in CL IIV). -
Notes: Height is sometimes retained as a size
covariate when allometric scaling on weight performs less well; it is
also an input to BSA, BMI, FFM, and LBM derivations, so a model that
retains
HTalongside one of those derived covariates should document the dependency incovariateData[[HT]]$notes. Specific scope until a second adult-popPK model ratifies the name; at that point promote togeneral.
BSA (canonical for body surface area)
- Description: Body surface area (typically computed by DuBois, Mosteller, or Haycock from height and weight).
- Units: m^2
- Type: continuous
- Scope: general
-
Reference category: n/a – used with power scaling
(BSA / ref)^exponent. - Source aliases: none.
-
Example models:
Yamada_2025_zolbetuximab.R(reference 1.70 m^2; exponents 1.06 on clearances and 0.968 on volumes). - Notes: Oncology mAbs dosed by BSA (mg/m^2) often use BSA in place of body weight for allometric-style scaling. Document the BSA computation formula (DuBois / Mosteller / Haycock) the source paper used; if unstated, record “unspecified.”
BMI (canonical for body mass index)
- Description: Body mass index at baseline.
- Units: kg/m^2
- Type: continuous
- Scope: general
-
Reference category: n/a – used with a
linear-deviation form (
1 + e * (BMI - ref)) or a power form ((BMI / ref)^e). Document the reference value incovariateData[[BMI]]$notes. - Source aliases: none known.
-
Example models:
Chua_2025_mirikizumab.R(reference 24.75 kg/m^2; linear-deviation effect on logit of bioavailability),NA_NA_lidocaine.R(DDMODEL00000281; binary stratification at threshold 27.93 kg/m^2 adding +0.939 to the GX rate constant K30 in the BMI > 27.93 cohort). -
Notes: Universal clinical-trial demographic.
Derived as
WT / (height_m)^2; assume time-fixed at baseline unless the source paper states otherwise.
BMIZ (canonical for body-mass-index z-score (age- and sex-standardised))
-
Description: Age- and sex-standardised
body-mass-index z-score (number of standard deviations above or below
the reference-population mean BMI for the subject’s age and sex).
Distinct from raw
BMI(kg/m^2):BMIZis unitless and centred at 0 in the reference population, so the reference value used in linear-deviation effects is 0 (not a population BMI in kg/m^2). Time-varying when the source paper carries a per-visit z-score; document baseline-vs-time-varying status incovariateData[[BMIZ]]$notes. - Units: unitless (z-score; standard-deviation units)
- Type: continuous
- Scope: specific
-
Reference category: n/a – used with a
linear-deviation form
(1 + e * (BMIZ - 0))so the reference is 0 (population mean for the subject’s age/sex). Effect coefficients are interpreted as fractional change per 1 z-score-unit deviation from the reference. -
Source aliases:
-
BMI– when a paper uses the column nameBMIfor what is actually a z-score (e.g., the Harun 2019 NMTRAN control stream columnBMIis documented in the dataset header as “body-mass index z-score”). The canonical column isBMIZ; the source-paper column name is recorded incovariateData[[BMIZ]]$source_name.
-
-
Example models:
Harun_2019_cysticFibrosis.R(time-varying per-visit BMI z-score; linear-deviation effect on baseline FEV1% predicted with reference 0 and coefficient +0.0382 per z-score unit). -
Notes: Distinct from
BMI(raw kg/m^2 used in adult populations). Paediatric and adolescent studies routinely report BMI as a z-score relative to a growth reference (WHO 2007 Growth Reference for school-aged children, CDC 2000, etc.); document the reference standard the source paper used incovariateData[[BMIZ]]$notes. Specific scope until a second paediatric model ratifies the name; at that point promote togeneral.
SEXF (canonical for sex)
- Description: Biological sex indicator, 1 = female, 0 = male.
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (male).
-
Source aliases:
-
SEXM(values inverted:SEXF = 1 - SEXM; effect coefficient sign and reference category both invert) – used inCarlssonPetri_2021_liraglutide.R. -
SEXwith"M"/"F"strings – deriveSEXF = as.integer(SEX == "F"). -
SEXwith1=male /2=female numeric coding – deriveSEXF = as.integer(SEX == 2). Used inNetterberg_2017_docetaxel.RandNA_NA_miridesap.R(DDMODEL00000262 source bundle; Sahota 2015 NONMEM convention). -
FEM(1 = female, 0 = male; same orientation as canonical, no transformation) – used inGuiastrennec_2016_gastric_emptying.R.
-
-
Example models:
Zhu_2017_lebrikizumab.R(canonical),CarlssonPetri_2021_liraglutide.R(aliasSEXM),Bajaj_2017_nivolumab.R(male-indicator source; effect applied asexp(coef * (1 - SEXF))to preserve the paper’s female-reference CL_REF / VC_REF),Fau_2020_isatuximab.R(exponential effect on Vc; reference category 0 = male),Netterberg_2017_docetaxel.R(multiplicative effect on baseline ANC:BACOV *= (1 + theta * SEXF); source columnSEXwith 1 = male, 2 = female encoding, decomposed viaSEXF = as.integer(SEX == 2)),NA_NA_miridesap.R(DDMODEL00000262 / Sahota 2015; multiplicative effect on baseline SAP viaSAP_BASE_ref * (1 + e_sexf_sap0 * SEXF)withe_sexf_sap0 = -0.30; female baseline is ~30% lower than male),Xu_2020_daratumumab.R(additive shift on V1(1 + e_sexf_vc * SEXF)withe_sexf_vc = -0.205: female V1 is 20.5% lower than male, reference category 0 = male),Guiastrennec_2016_gastric_emptying.R(multiplicative +40.7% strengthening of the caloric-feedback slope SLPCAL on gastric emptying in females;SLPCAL_eff = SLPCAL * (1 + 0.407 * SEXF)). -
Notes: When translating a model that used
SEXM, flag the sign/reference-category inversion to the user.
PREG (canonical for pregnancy status indicator)
- Description: 1 = pregnant, 0 = non-pregnant. Time-fixed per subject in trial cohorts that enrol pregnant and non-pregnant women in parallel; not a time-varying flag.
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (non-pregnant).
-
Source aliases: none known; source NONMEM control
streams typically use
PREGdirectly. -
Example models:
Birgersson_2019_artesunate.R(multiplicative effect on dihydroartemisinin clearance; the published structural CLM = 190 L/h is reported with the source-paper reference category PREG = 1, so the model file applies the effect via(1 + e_preg_cl_dha * (1 - PREG))withe_preg_cl_dha = -0.214to preserve verbatim source values; non-pregnant women have ~21% lower CLM relative to pregnant women). -
Notes: Use this canonical for adult clinical-trial
models that test a pregnancy-vs-non-pregnancy contrast (typical
settings: malaria-in-pregnancy PK, antiviral-in-pregnancy PK). Trimester
or gestational-age stratification within the pregnant cohort should use
a separate canonical (e.g., gestational-age weeks via
GAor a trimester indicator, ratified separately when needed). The canonical convention is reference category 0 (non-pregnant) following the broader pharmacology default; source papers that use the pregnant cohort as the reference (Birgersson 2019) preserve their published structural values via a(1 - PREG)form on the effect coefficient. Ratified canonically on 2026-05-07.
CHILD (canonical for child age-cohort indicator)
- Description: 1 = subject is a child, 0 = not a child.
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (not child, i.e., adult baseline).
-
Source aliases:
-
PED– used in the Schoemaker 2018 LEV / BRV pediatric extrapolation (DDMODEL00000239) as the pediatric-vs-adult indicator that gates the Markov-amplitude term, the overdispersion IIV, and the four pediatric offsets on log baseline rate / mixture / placebo / Emax / EC50.
-
-
Example models:
CarlssonPetri_2021_liraglutide.R,Schoemaker_2018_levetiracetam.R(DDMODEL00000239). -
Notes: Age-group indicator used alongside
ADOLESCENT; paper’s age cutoffs must be captured incovariateData[[CHILD]]$notes.
Pediatric / maturation
PAGE (canonical for postmenstrual age)
-
Description: Postmenstrual age in months
(
GA_weeks / 4.35 + postnatal_months). Time-varying. - Units: months
- Type: continuous
- Scope: general
-
Example models:
Clegg_2024_nirsevimab.R,Robbie_2012_palivizumab.R.
PNA (canonical for postnatal age)
- Description: Postnatal age (chronological since birth). Time-varying.
- Units: months
- Type: continuous
- Scope: general
-
Example models:
Hu_2026_clesrovimab.R.
GA (canonical for gestational age at birth)
- Description: Gestational age at birth. Time-fixed per subject.
- Units: weeks
- Type: continuous
- Scope: general
-
Example models:
Hu_2026_clesrovimab.R,Clegg_2024_nirsevimab.R(folded into PAGE).
WT_BIRTH (canonical for birth weight)
-
Description: Body weight at birth. Time-fixed per
subject. Distinct from current body weight (
WT), which is time-varying. - Units: kg
- Type: continuous
- Scope: general
-
Reference category: n/a – used with
linear-deviation
(1 + e * (WT_BIRTH - ref))or power scaling. Reference value observed: 2.59 kg (Voller 2017 newborn cohort). -
Source aliases:
-
BWEIGHT– used inVoller_2017_phenobarbital.R(Voller 2017 source data column for birth weight in kg).
-
-
Example models:
Voller_2017_phenobarbital.R(linear-deviation effect on CL:clbw = 1 + 0.369 * (WT_BIRTH - 2.59)). -
Notes: Time-fixed at birth; characterises
pre-/term-newborn cohorts. Pairs with
GA(gestational age at birth) when both are reported. The conventional clinical-PK abbreviationBWTis intentionally NOT used as the canonical name because it is already used across the codebase (Gandhi 2021, Li 2019, Chen 2022, Wojciechowski 2022, Lu 2019) as a source-name alias for body weight (WT). TheWT_BIRTHform keeps theWTroot consistent with the existing body-weight canonical and avoids theBWTambiguity.
AGE_DPF (canonical for zebrafish-larval age in days post-fertilization)
- Description: Age of a zebrafish (Danio rerio) larva in days post-fertilization (dpf). Time-fixed per subject in destructive-sampling designs (each larva is harvested at exactly one observation, so its dpf is fixed at the value assigned at exposure-start).
- Units: days post-fertilization (dpf)
- Type: continuous
- Scope: specific
-
Reference category: n/a – used in van Wijk 2019
with a step-form effect on the absorption rate
(
IF (AGE_DPF > 3) k12 = k12_3 * (1 + e_age_dpf_k12)) and a per-day power-form effect on the elimination rate (k25 = k25_3 * (1 + e_age_dpf_k25)^(AGE_DPF - 3)). Reference age is 3 dpf (the youngest cohort in the study). -
Source aliases:
-
AGE– van Wijk 2019 NONMEM column. Renamed to canonicalAGE_DPFbecause the human-PK canonicalAGEdenotes subject age in years; the zebrafish dpf semantic is incompatible and would silently corrupt any future model that mixed them. Same orientation, no value transformation.
-
-
Example models:
vanWijk_2019_paracetamol.R. -
Notes: Specific scope because the meaning is
bounded to zebrafish-larval-development PK studies. Distinct from
canonical
AGE(human age in years),PNA(postnatal age in months),PAGE(postmenstrual age in months), andGA(gestational age in weeks) – none of those are appropriate for a non-mammalian organism whose developmental clock is anchored at fertilization rather than birth. Integer values 3, 4, 5 in the van Wijk 2019 dataset, but treated as a continuous covariate in the elimination-rate power form. Future zebrafish-PK or other non-mammalian-developmental-age models should reuse this canonical only when the covariate is indeed dpf-anchored; other developmental-time conventions (e.g., somite-stage, hpf, dph) would warrant separate canonicals. Ratified canonically on 2026-05-07.
Nutritional status
MAL_NOURISH (canonical for malnutrition status indicator)
-
Description: 1 = subject is malnourished at study
entry per a paper-defined anthropometric criterion (e.g., WHO
height-for-age and weight-for-age Z-scores both < -2.0 in Tikiso
2021); 0 = not malnourished. Subject-level baseline indicator; the
time-decaying recovery during nutritional supplementation is carried by
the paired
T_NUT_SUPPcolumn rather than as a time-varying value ofMAL_NOURISH. - Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (not malnourished).
-
Source aliases:
-
MAL– used inTikiso_2021_abacavir.R(the dataset’s paper-defined indicator, 1 = malnourished, 0 = not malnourished).
-
-
Example models:
Tikiso_2021_abacavir.R(gates the time-decaying malnutrition effect:mal_decay = MAL_NOURISH * exp(-T_NUT_SUPP * log(2) / 12.2), which then drives multiplicative shifts of+115%on F and-64%on CL at the start of nutritional supplementation, decaying with a 12.2-day half-life). -
Notes: Specific scope because the malnutrition
definition (WHO Z-score thresholds, mid-upper arm circumference,
weight-for-height vs height-for-age, etc.) is paper-defined; per-model
covariateData[[MAL_NOURISH]]$notesmust document the criterion used. Pairs withT_NUT_SUPP(days on nutritional supplementation) when the model uses a time-decaying recovery function; otherwiseMAL_NOURISHalone serves as a static baseline indicator. Distinct from generic body-weight Z-scores (which are continuous anthropometric metrics rather than a binarised malnutrition indicator).
T_NUT_SUPP (canonical for time on nutritional supplementation)
- Description: Time elapsed since the start of nutritional / refeeding supplementation, in days. 0 at the start of supplementation; increases with time. Used by population PK models that describe the time-varying recovery of PK parameters during nutritional rehabilitation in malnourished cohorts.
- Units: days
- Type: continuous
- Scope: specific
-
Reference category: n/a – enters as the argument of
an exponential decay function
exp(-T_NUT_SUPP * log(2) / T_half)whose half-lifeT_halfis an estimated parameter (12.2 days in Tikiso 2021). -
Source aliases:
-
TNUTRI– used inTikiso_2021_abacavir.R(the dataset’s paper-defined column for days since start of nutritional supplementation; same orientation as the canonical, 0 = start of supplementation, increasing with time on supplementation).
-
-
Example models:
Tikiso_2021_abacavir.R(paired withMAL_NOURISH; drives the recovery decayexp(-T_NUT_SUPP * log(2) / 12.2)of the malnutrition effect on F and CL). -
Notes: Specific scope because the underlying
recovery dynamics are tied to the nutritional-rehabilitation protocol of
the source study. For non-malnourished subjects
(
MAL_NOURISH == 0) the value is irrelevant because the malnutrition effect is gated byMAL_NOURISH; supply 0 as a default. For fully-recovered malnourished subjects, supply a large value (e.g.,>= 100days, well beyond the 12.2-day Tikiso 2021 half-life) so the decay function reaches near zero and the effect vanishes.
DRINK_OGTT (canonical for oral-glucose-tolerance-test-only drink indicator)
- Description: Binary indicator that the postprandial test drink is glucose-only (oral glucose tolerance test, OGTT). 1 = the drink is glucose-only with no fat content (e.g., 25 / 75 / 125 g OGTT); 0 = otherwise (water, or any drink containing fat). Per-occasion (per-test-drink-administration) covariate – a single subject in a crossover challenge protocol receives different drink types across occasions, so the indicator varies per dose-record rather than per subject.
- Units: (binary)
- Type: binary
- Scope: specific
-
Reference category: 0 (water or fat-containing
drink). Mutually exclusive with
DRINK_FAT– both indicators cannot be 1 simultaneously. - Source aliases: paper narrative “glucose solution” / “OGTT” cohort labels driving the gastric-emptying-onset T50OGTT selection in Guiastrennec 2016.
-
Example models:
Guiastrennec_2016_gastric_emptying.R(selects the OGTT-specific half-onset time T50OGTT = 15.7 min for the gastric-emptying delay Hill function; water is recovered when DRINK_OGTT = DRINK_FAT = 0 with the onset factor pinned to 1). -
Notes: Specific scope because the OGTT /
fat-containing partition is tied to the Guiastrennec 2016
postprandial-challenge design (Studies B and C OGTT arms). Set to 1 for
Study B’s 25 / 75 / 125 g OGTT drinks and the Study C 75 g OGTT arm; set
to 0 for Study A water and for all fat-containing drinks. Pairs with
DRINK_FAT: the two indicators jointly select the appropriate gastric-emptying-delay T50 parameter (T50OGTT vs T50Fat) for the Hill onset function. Ratified canonically alongside the Guiastrennec 2016 gastric-emptying / CCK / GBE extraction.
DRINK_FAT (canonical for fat-containing test-drink indicator)
- Description: Binary indicator that the postprandial test drink contains fat (any nonzero fat content). 1 = the drink contains fat (e.g., the Study C low / medium / high-fat isocaloric drinks, or the Study D medium-high-fat drink); 0 = otherwise (water or glucose-only OGTT drinks). Per-occasion covariate – a single subject in a crossover challenge protocol receives different drink types across occasions.
- Units: (binary)
- Type: binary
- Scope: specific
-
Reference category: 0 (water or glucose-only
drink). Mutually exclusive with
DRINK_OGTT– both indicators cannot be 1 simultaneously. - Source aliases: paper narrative “low-fat” / “medium-fat” / “high-fat” / “medium-high-fat” cohort labels driving the gastric-emptying-onset T50Fat selection in Guiastrennec 2016.
-
Example models:
Guiastrennec_2016_gastric_emptying.R(selects the fat-specific half-onset time T50Fat = 23.1 min for the gastric-emptying delay Hill function). -
Notes: Specific scope because the fat-containing
partition is tied to the Guiastrennec 2016 postprandial-challenge
design. Pairs with
DRINK_OGTT: the two indicators jointly select the appropriate gastric-emptying-delay T50 parameter (T50OGTT vs T50Fat) for the Hill onset function. Ratified canonically alongside the Guiastrennec 2016 gastric-emptying / CCK / GBE extraction.
Pregnancy / hormonal status
TPP (canonical for time postpartum (time after delivery))
- Description: Time elapsed since delivery, in weeks. 0 during pregnancy and at delivery; increases after delivery. Used by popPK models that describe the time-varying postpartum recovery of pregnancy-induced PK changes (e.g., renal blood flow, hepatic enzyme activity) that return gradually to the prepregnant baseline over weeks to months rather than instantaneously at delivery.
- Units: weeks
- Type: continuous
- Scope: general
-
Reference category: n/a – enters as the argument of
a sigmoidal recovery function
(TPP^gamma) / (TPP^gamma + T50^gamma)whoseT50(weeks-to-50%-recovery) andgamma(shape parameter) are estimated. For pregnant visits (TPP = 0) the sigmoid evaluates to 0 and the pregnancy-state PK parameter is unchanged; for far-postpartum visits (TPP >> T50) the sigmoid approaches 1 and the parameter reaches its asymptotic non-pregnant value. -
Source aliases:
-
TPP– de Kock 2017 NONMEM column for time after delivery in weeks; same orientation, no transformation; assigned 0 for samples collected during pregnancy.
-
-
Example models:
deKock_2017_sulfadoxinePyrimethamine.R(sigmoidal effect on sulfadoxine CL with asymptotic fractional change -0.757, T50 = 6.35 weeks, gamma = 4.90; the sigmoid approaches its asymptote ~13 weeks postpartum, consistent with the literature for return of GFR and renal blood flow to prepregnant values within 6-12 weeks postpartum). -
Notes: Paired with
PREG(pregnancy status indicator) when the source paper models pregnancy as a step contrast on one PK parameter and as a sigmoidal time-decay on another. During pregnancyTPP = 0; after deliveryTPP > 0. Document the postpartum sampling window incovariateData[[TPP]]$notesper model. Distinct fromT_NUT_SUPP(time on nutritional supplementation, days) andGA(gestational age at birth, weeks) – those are different timescale covariates anchored to different events. Ratified canonically on 2026-05-18 alongside the de Kock 2017 sulfadoxine/pyrimethamine extraction.
TERM_BIRTH (canonical for term-vs-preterm birth indicator)
-
Description: Binary indicator of term-vs-preterm
birth status;
1= term birth (>= 37 weeks gestation),0= preterm birth (< 37 weeks gestation). Time-fixed per subject. In Allegaert 2015 (paracetamol PK in young women) the indicator is used to select between two typical-value clearances for the sulphate-formation pathway. - Units: (binary)
- Type: binary
- Scope: specific
-
Reference category: 0 (preterm). Effect form is the
symmetric
CL = TERM_BIRTH * theta_term + (1 - TERM_BIRTH) * theta_pretermselection – so neither category is “the multiplicative reference”, and both per-stratum clearances are estimated parameters. -
Source aliases:
-
TERM(Allegaert 2015 NONMEM column; same orientation, no transformation) – used inAllegaert_2015_paracetamol.R.
-
-
Example models:
Allegaert_2015_paracetamol.R. -
Notes: Distinct from
GA(continuous gestational age in weeks):TERM_BIRTHis the binarized version with the conventional 37-week cutoff. UseGAwhen the source paper carries gestational age as a continuous covariate; useTERM_BIRTHonly when the paper itself dichotomizes. Do not deriveTERM_BIRTHfromGAprogrammatically insidemodel()– the term-cutoff convention belongs in data assembly, not the model file.
CONMED_BIRTHCONTROL (canonical for oral-contraceptive use indicator)
-
Description: Binary indicator of oral hormonal
contraceptive use;
1= currently taking an oral contraceptive (estrogen-progestin or progestin-only pill),0= not on hormonal contraception. Time-varying as women cycle on/off contraception across study occasions. - Units: (binary)
- Type: binary
- Scope: specific
-
Reference category: 0 (no oral contraceptive).
Effect form in Allegaert 2015 is multiplicative:
CL_glucuronide *= theta_CONMED_BIRTHCONTROLwhenCONMED_BIRTHCONTROL == 1, withtheta_CONMED_BIRTHCONTROL = 1.46(estrogen-driven UGT2B7 induction). -
Source aliases:
-
BC(Allegaert 2015 NONMEM column; same orientation, no transformation) – used inAllegaert_2015_paracetamol.R.
-
-
Example models:
Allegaert_2015_paracetamol.R. -
Notes: Specific scope because the canonical
encoding pools all oral contraceptive types (combined / progestin-only)
into a single binary; future models that need to distinguish
formulations should register a finer-grained canonical (e.g.,
CONMED_BIRTHCONTROL_COMBINED,CONMED_BIRTHCONTROL_PROGESTIN). The full-word canonical name was chosen over a shorterBC_USEform for clarity in source traces.
DIS_EOPE (canonical for early-onset pre-eclampsia indicator)
-
Description: Binary indicator of early-onset
pre-eclampsia (eoPE);
1= eoPE diagnosed before 34 weeks gestation,0= not eoPE. Time-fixed per subject within the gestational PK study window. Used by population PK models that compare drug disposition in pregnant women with vs without early-onset pre-eclampsia. - Units: (binary)
- Type: binary
- Scope: general
-
Reference category: 0 (no eoPE). Effect form in
Schoenmakers 2025 is multiplicative on CL:
CL_eoPE = CL * ThetaPE^DIS_EOPEwithThetaPE = 0.617(38% reduction in betamethasone CL when eoPE is present); encoded in nlmixr2 as the log-additive shiftcl <- exp(lcl + etalcl + e_eope_cl * DIS_EOPE)withe_eope_cl = log(0.617). -
Source aliases:
-
PE– common abbreviation in obstetric pharmacology papers when the cohort restriction is to early-onset PE only; used inSchoenmakers_2025_betamethasone.R(paper notation:eoPE/ThetaPE).
-
-
Example models:
Schoenmakers_2025_betamethasone.R(multiplicative effect on CL, encoded via the log-additive form onlcl; reduces apparent betamethasone clearance from 15.6 L/h to 9.6 L/h). -
Notes: Distinct from a broader
PREECLindicator that would pool early-onset, late-onset and postpartum pre-eclampsia. The “early-onset” specifier corresponds to diagnosis before 34 weeks gestation, the conventional clinical cutoff (Phipps 2019 Nat Rev Nephrol). Future papers that enrol mixed early-/late-onset cohorts or that report PE status without the 34-week stratification should register a separate canonical (e.g.,PREECLfor any-onset PE, orLOPEfor late-onset PE) rather than reusingDIS_EOPEwith relaxed semantics. Distinct fromPREG(pregnancy status indicator):DIS_EOPEis a complication-of-pregnancy stratifier within a pregnant cohort, whereasPREGdiscriminates pregnant-vs-non-pregnant subjects. Ratified canonically on 2026-05-11 alongside the Schoenmakers 2025 betamethasone extraction.
Vital signs
BODYTEMP (canonical for body temperature)
-
Description: Subject body temperature (typically
axillary or oral) at the relevant clinical observation. Captured at
study admission in acute-infection PK studies (fever as a marker of
acute illness severity); may be time-varying when serial temperature
measurements are recorded across visits. Document
baseline-vs-time-varying status in
covariateData[[BODYTEMP]]$notesper model. - Units: degC
- Type: continuous
- Scope: general
-
Reference category: n/a – used with
linear-deviation forms
(1 + e * (BODYTEMP - ref)). Reference values observed: 36.9 degC (Kloprogge 2013 lumefantrine; pooled-cohort median in Ugandan pregnant + non-pregnant women with uncomplicated P. falciparum malaria). -
Source aliases:
-
TEMP– common short form in malaria / infectious-disease NONMEM control streams; used inKloprogge_2013_lumefantrine.R(same orientation as the canonical, no value transformation).
-
-
Example models:
Kloprogge_2013_lumefantrine.R(linear-deviation effect on mean absorption transit time MTT:MTT_indiv = TVMTT * (1 + e_bodytemp_mtt * (BODYTEMP - 36.9))withe_bodytemp_mtt = 0.165per degC; mean transit time increases ~16.5% per degC over 36.0-39.8 degC, plausibly reflecting reduced gut motility / prolonged absorption in feverish malaria patients). -
Notes: General scope because body temperature is a
universally applicable vital sign; the Kloprogge 2013 reference value
36.9 degC is cohort-specific (Ugandan malaria cohort median) and future
models should document their own reference in
covariateData[[BODYTEMP]]$notes. Units are degrees Celsius; convert from Fahrenheit (degF) at data-assembly time, not insidemodel(). Distinct fromBODYTEMP_FEBRILE(not yet registered) which would be a binary fever indicator if a future paper dichotomises at the conventional 37.5 / 38.0 degC threshold. Ratified canonically on 2026-05-16 alongside the Kloprogge 2013 lumefantrine extraction.
Renal / hepatic function
URINE_FLOW (canonical for instantaneous urine flow rate)
- Description: Instantaneous urine flow rate (mL/h) measured over the urine collection interval that includes the current observation. Time-varying.
- Units: mL/h
- Type: continuous
- Scope: specific
-
Reference category: n/a – used with a
centered-linear-effect form
CL_renal = base + theta_URINE_FLOW * (URINE_FLOW - URINE_FLOW_ref)withURINE_FLOW_ref = 100 mL/hin Allegaert 2015. A value of0is a sentinel for “no urine collected during the interval” (i.e., the urine pathway contribution is dropped); the linear-effect term is gated byURINE_FLOW > 0and not extrapolated below the centering reference. -
Source aliases:
-
UF(Allegaert 2015 NONMEM column; same orientation, no transformation) – used inAllegaert_2015_paracetamol.R.
-
-
Example models:
Allegaert_2015_paracetamol.R. -
Notes: Specific scope because the centered-linear
effect form with the
URINE_FLOW == 0sentinel-zero rule reflects an Allegaert-specific convention rather than a universally-agreed-upon parameterization. A second model that uses a different effect form (e.g., directURINE_FLOW / URINE_FLOW_refproportional scaling, no zero-sentinel) should register its own canonical (e.g.,URINE_FLOW_PROP) rather than reusingURINE_FLOWwith conflicting semantics. The full-word canonical name was chosen over the bareUFsource-data abbreviation for clarity in source traces.
CRCL (canonical for creatinine-based renal function, BSA-normalized)
-
Description: Creatinine-based renal function
expressed in mL/min/1.73 m^2. Accepts either an MDRD-/CKD-EPI-estimated
glomerular filtration rate or a measured creatinine clearance that has
been BSA-normalized as
1.73 x CrCl / BSA. The per-modelcovariateData[[CRCL]]$descriptionandnotesmust state which method the source paper used. - Units: mL/min/1.73 m^2
- Type: continuous
- Scope: general
-
Reference category: n/a – used with power scaling
(CRCL / ref)^exponent. Reference values observed: 80 mL/min/1.73 m^2 (Cirincione 2017, MDRD eGFR), 90 mL/min/1.73 m^2 (Li 2019, calculated GFR), 100 mL/min/1.73 m^2 (Xu 2019, measured-CrCl BSA-normalized). -
Reference category: n/a – used with power scaling
(CRCL / ref)^exponent. Reference values observed: 80 mL/min/1.73 m^2 (Cirincione 2017, MDRD eGFR), 100 mL/min/1.73 m^2 (Xu 2019, measured-CrCl BSA-normalized), 90 mL/min/1.73 m^2 (Bajaj 2017, CKD-EPI eGFR). -
Source aliases:
-
eGFR– MDRD-estimated glomerular filtration rate; used inCirincione_2017_exenatide.RandKotani_2022_astegolimab.R.Bajaj_2017_nivolumab.Ruses the CKD-EPI variant. -
EGFR– all-caps variant. -
CRCL_BSA– BSA-normalized creatinine clearance (measured CrCl / BSA x 1.73); used inXu_2019_sarilumab.R. -
1.73*CrCl/BSA– the formula form appearing in Xu 2019 Eq. for Vm. -
cGFR– calculated/estimated GFR, BSA-normalized; used inLi_2019_abatacept.R. -
CLCR– raw Cockcroft-Gault creatinine clearance in mL/min (NOT BSA-normalized); used inDelattre_2010_amikacin.R(median 55.5 mL/min in critically ill septic adults).
-
-
Example models:
Cirincione_2017_exenatide.R(MDRD eGFR),Xu_2019_sarilumab.R(measured CrCl BSA-normalized),Kotani_2022_astegolimab.R(MDRD eGFR),Li_2019_abatacept.R(cGFR),Bajaj_2017_nivolumab.R(CKD-EPI eGFR, reference 90 mL/min/1.73 m^2),NA_NA_lidocaine.R(DDMODEL00000281; binary stratification at threshold 52.7 mL/min adding -0.319 to the GX rate constant K30 in the CRCL <= 52.7 cohort; the source.ctldoes not state the BSA-normalisation method),Delattre_2010_amikacin.R(raw Cockcroft-Gault mL/min, NOT BSA-normalized; reference 55.5 mL/min population median; additive linear effect 1.42 L/h per (CRCL/55.5) on CL). -
Notes: The two estimation methods (MDRD/CKD-EPI vs
measured CrCl) produce values in the same units and are operationally
interchangeable as a covariate on clearance. Document the method
explicitly in each model’s
covariateData[[CRCL]]$descriptionso future reviewers can trace the source assay.
CREAT (canonical for serum creatinine)
- Description: Serum creatinine concentration (baseline or time-varying).
-
Units: umol/L or mg/dL – document the unit used in
each model via
covariateData[[CREAT]]$units. - Type: continuous
- Scope: general
-
Reference category: n/a – used with power scaling
(CREAT / ref)^exponent. -
Source aliases:
-
CRE(umol/L, reference 70.73) – used inThakre_2022_risankizumab.R. -
SCR– common clinical-PK abbreviation; also Llanos-Paez 2020 source column for the patient’s individual serum creatinine.
-
-
Example models:
Thakre_2022_risankizumab.R,Hennig_2013_tobra.R(umol/L; paired withCREAT_REFfor the SCR_mean / SCR ratio used in the Hennig 2013 renal-function factor),Llanos_2017_gentamicin.R(umol/L; standardized per-patient againstCREAT_REFrather than a fixed cohort reference),Llanos-Paez_2020_gentamicin.R(umol/L; used as the patient’sSCR_iin the renal-function ratio(CREAT_REF / CREAT)^0.58on CL). -
Notes:
CREATchosen over the shorterCRE/SCRas the NONMEM/clinical-PK convention that is unambiguous. Per-model reference values must be documented incovariateData[[CREAT]]$notes.
CREAT_REF (canonical for sex/age/size-expected normal-mean serum creatinine)
-
Description: Externally-computed reference serum
creatinine for the individual (the expected normal SCR for a healthy
person of the same sex, age and body size). Used as the numerator of a
ratio against the patient’s measured
CREATto define a renal-function factor on clearance. -
Units: umol/L or mg/dL – must match the unit of the
paired
CREATcolumn. Document viacovariateData[[CREAT_REF]]$units. - Type: continuous
- Scope: specific
-
Reference category: n/a – used as
(CREAT_REF / CREAT)^exponentso that a patient with measured SCR equal to the population-expected normal SCR has factor 1. -
Source aliases:
-
SCR_mean– used inHennig 2013(Eq. 5:f_SCR = (SCR_mean / SCR)^theta_SCR); also Llanos-Paez 2020 paper notation for the Ceriotti 2008 age/sex-matched physiological mean SCR. -
Scrmean– Llanos-Paez 2017 paper notation; computed from Ceriotti et al. 2008 age- and sex-stratified medians (Clin Chem 54:559-566, doi:10.1373/clinchem.2007.099648).
-
-
Example models:
Hennig_2013_tobra.R,Llanos_2017_gentamicin.R(umol/L; computed externally per Ceriotti et al. 2008),Llanos-Paez_2020_gentamicin.R(umol/L; ratio(CREAT_REF / CREAT)^0.58multiplies the maturation-scaled CL). -
Notes: Specific scope because the formula used to
derive the reference value is paper-defined (Hennig 2013 cites a
combination of Ceriotti 2008, Junge 2004 and Johansson 2011
reference-interval relationships; Llanos-Paez 2017 and 2020 both use
Ceriotti 2008); a future paper that uses a different reference-SCR
derivation (e.g., a CKD-EPI-style adult-only reference, or a
Schwartz-derived paediatric-only reference) should pin its formula in
covariateData[[CREAT_REF]]$notesso that a user assembling a virtual cohort can reproduce it. When no covariate data are available to computeCREAT_REF, setCREAT_REF = CREATso the renal-function factor evaluates to 1 (matching the Hennig 2013 ‘covariate set to 1 for missing data’ rule). Ratified canonically on 2026-05-08 alongside the Hennig 2013 tobramycin extraction.
BUN (canonical for blood urea nitrogen)
-
Description: Blood urea nitrogen concentration
(baseline or time-varying). Reflects the nitrogenous waste burden being
cleared by the kidneys; rises with reduced glomerular filtration,
dehydration, increased protein catabolism, or GI bleeding. Distinct from
CREAT(the other commonly-reported renal-function marker) because urea reabsorption is flow-dependent in the renal tubule, so BUN is sensitive to volume status as well as GFR. -
Units: mg/dL or mmol/L – document the unit used in
each model via
covariateData[[BUN]]$units(1 mmol/L urea ~= 2.80 mg/dL BUN). - Type: continuous
- Scope: specific
-
Reference category: n/a – used either as a linear /
hinge effect on a PK parameter, or with power scaling
(BUN / ref)^exponent. Reference values observed: 7 mg/dL (Hall 2017 MARS hinge knot for ka; not a population median). - Source aliases: none known.
-
Example models:
Hall_2017_dapsone.R(mg/dL; population median 13 mg/dL [range 7-28]; enters the MARS-based covariate model on the absorption rate constant via the basis functionBF1 = max(0, BUN - 7), which interacts with a weight hinge to driveKa). -
Notes: Scope kept
specificpending a second model that ratifies BUN with consistent semantics; promote togeneralonce corroborated. Hall 2017 enters BUN only through a piecewise-linear MARS hinge (max(0, BUN - 7) * max(0, 63.7 - WT)), not as a power scaling – the per-modelcovariateData[[BUN]]$notesdocuments this is part of a machine-learning-driven hinge model, not a standard popPK covariate transform. Ratified canonically on 2026-05-18 alongside the Hall 2017 dapsone extraction.
HEMODIAL (canonical for intermittent-hemodialysis treatment-status indicator)
- Description: 1 = the subject was undergoing intermittent hemodialysis during the modeled period; 0 = no intermittent hemodialysis. Treatment-status flag rather than a measured renal-function value; used as a multiplicative covariate on PK parameters that change with chronic dialysis (typically CL and Vc – intermittent hemodialysis decreases vancomycin-class CL and reduces interstitial volume overload, lowering Vc). Per-subject indicator in the source data; in Goti 2018 it is treated as time-fixed at the subject level (the cohort either was or was not receiving intermittent hemodialysis during the admission), and Goti 2018 explicitly notes that actual hemodialysis-session timing was not used because of documentation limitations.
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (no intermittent hemodialysis).
-
Source aliases:
-
DIAL– used inGoti_2018_vancomycin.R(binary indicator on CL and Vc in a 2-compartment vancomycin popPK model). Goti 2018 Methods notes the indicator was created for the routine-TDM cohort (n = 336 hemodialysis subjects of 1812 total) and that all hemodialysis procedures were intermittent and used high-flux membranes.
-
-
Example models:
Goti_2018_vancomycin.R(multiplicative factors on CL and Vc:0.7^HEMODIALon CL and0.5^HEMODIALon Vc, so dialysis subjects have 30% lower CL and 50% lower central volume than non-dialysis subjects). -
Notes: Specific to intermittent hemodialysis (IHD).
Distinct from peritoneal dialysis (PD) and from continuous renal
replacement therapy (CRRT), each of which has different drug-extraction
kinetics and would warrant its own canonical (
PERIT_DIAL,CRRT_STATUS) if a future paper retains them as covariates. Goti 2018 treatsHEMODIALas time-fixed per subject because session-level dialysis timing was not reliably documented in the source EHR data; a future paper that resolves drug clearance during versus between dialysis sessions would use a time-varying form (or a separate per-session covariate) and the per-modelcovariateData[[HEMODIAL]]$noteswould document the time resolution. When pairingHEMODIALwithCRCL, note that the Cockcroft-Gault CRCL of an anuric hemodialysis patient is by convention very low or set per institution to a small floor value (Goti 2018 truncated CRCL > 150 mL/min to 150 mL/min and corrected SCr < 1 mg/dL in elderly subjects); residual renal function in hemodialysis subjects is highly variable and the dialysis indicator captures the bulk PK shift on top of the CRCL covariate. Ratified canonically on 2026-05-16 alongside the Goti 2018 vancomycin extraction.
CRRT_STATUS (canonical for continuous / extended renal-replacement-therapy treatment-status indicator)
- Description: 1 = the subject was undergoing a continuous or extended (long-session) extracorporeal renal-replacement-therapy modality during the modeled period (continuous venovenous hemofiltration CVVH / CVVHF, continuous venovenous hemodiafiltration CVVHDF, sustained low-efficiency dialysis SLED, extended daily diafiltration EDD-f, or similar slow-clearance / long-duration extracorporeal therapies); 0 = no such therapy. Treatment-status flag rather than a measured renal-function value; used as a multiplicative or piecewise covariate on PK parameters that change when slow extracorporeal solute removal is active (typically CL). Per-subject indicator in the source data; in Shekar 2014 it is treated as time-fixed at the subject level (the cohort either was or was not receiving RRT during the entire PK sampling period; all RRT patients were on CVVH or EDD-f continuously / daily during sampling) – the indicator captures the subject-level RRT status, not session-level on/off timing.
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (no continuous / extended RRT).
-
Source aliases:
-
RRT– used inShekar_2014_meropenem.R(binary indicator selecting between the RRT-fixed-CL term and the CrCL-driven non-RRT CL term in a 2-compartment meropenem popPK model). Shekar 2014 Methods describes the RRT cohort as mixed CVVH (control RRT subjects, true CRRT) and EDD-f (ECMO RRT subjects, extended daily diafiltration; pharmacokinetically CRRT-like for slow-clearance solutes such as meropenem) and the model treats the modalities as a single binary covariate without distinguishing them.
-
-
Example models:
Shekar_2014_meropenem.R(piecewise CL:TVCL = exp(lcl) * CRRT_STATUS + e_crcl_cl * CRCL_in_Lh * (1 - CRRT_STATUS), with CRCL in raw Cockcroft-Gault mL/min converted to L/h insidemodel(); 5/11 ECMO patients and 5/10 controls were on RRT). -
Notes: Distinct from
HEMODIAL(intermittent hemodialysis IHD only) and fromDIAL(per-time-point session gate in within-subject time-varying dialysis-clearance models such as Liesenfeld 2013 dabigatran). Anticipated as a future canonical in theHEMODIALregister entry alongsidePERIT_DIALfor peritoneal dialysis. Shekar 2014 ratification uses a mixed CVVH + EDD-f cohort because the source paper treats them identically as a single binary RRT covariate; a future paper that retains modality as a separate covariate (e.g. CVVH vs SLED vs CVVHDF) would either reuseCRRT_STATUSwith finer per-modality columns layered on top, or warrant its own modality-specific canonical (CVVH_STATUS,SLED_STATUS, etc.). When pairingCRRT_STATUSwithCRCL, note that Cockcroft-Gault CrCL is conventionally not defined / not reported for RRT-dependent subjects; Shekar 2014 records CrCL only for non-RRT subjects and the model formula switches off the CrCL term whenCRRT_STATUS = 1. Ratified canonically on 2026-05-18 alongside the Shekar 2014 meropenem extraction.
ALB (canonical for serum albumin)
- Description: Serum albumin concentration.
-
Units: g/dL or g/L – document the unit used in each
model via
covariateData[[ALB]]$units. - Type: continuous
- Scope: general
-
Reference category: n/a – used with power scaling
(ALB / ref)^exponent. -
Source aliases:
-
BALB(baseline albumin) – used inZhou_2021_belimumab.R. Maps directly toALB; baseline-vs-time-varying status documented in per-model notes.
-
-
Example models:
Fasanmade_2009_infliximab.R(g/dL, reference 4.1),Thakre_2022_risankizumab.R(g/L, reference 45),Chua_2025_mirikizumab.R,Moein_2022_etrolizumab.R,Tiraboschi_2025_amlitelimab.R,Yamada_2025_zolbetuximab.R,Li_2019_abatacept.R(g/dL, reference 4.0; the Li 2019 Methods states ‘mg/dL’ which is a publication typo – see the model’scovariateData[[ALB]]$notes),Quartino_2019_trastuzumab.R(g/dL, reference 4; source columnALBU; negative exponent -0.998 on linear CL),Wang_2020_ontamalimab.R(g/L, reference 39),Zhou_2021_belimumab.R(g/L, reference 40; baseline-only, source columnBALB),Okada_2025_rocatinlimab.R(g/L, reference 44; source columnALBU; power exponent -1.30 on linear CL),Xu_2020_daratumumab.R(g/L, reference 37.0; power exponent -1.149 on linear CL). -
Notes: Ratified canonically on 2026-04-19 after
cross-model review. Unit varies by paper (g/dL in US-convention papers,
g/L in SI-convention papers); the per-model
covariateData[[ALB]]$unitsfield is load-bearing. Effect-coefficient magnitude is meaningless without the unit.
TPRO (canonical for total serum protein)
- Description: Total serum protein concentration (sum of albumin + globulins; baseline or time-varying).
-
Units: g/L or g/dL – document the unit used in each
model via
covariateData[[TPRO]]$units(1 g/dL = 10 g/L). - Type: continuous
- Scope: general
-
Reference category: n/a – used with power scaling
(TPRO / ref)^exponent. Reference value observed: 74 g/L (Frey 2010 pooled-cohort median). -
Source aliases:
-
PROT– Frey 2010 abbreviation in the final-model equation. -
TP– common clinical-chemistry abbreviation.
-
-
Example models:
Frey_2010_tocilizumab.R(g/L, reference 74; exponent -1.1 on V1). -
Notes: Distinct from
ALB(serum albumin, the largest single component of total protein). Frey 2010 retains bothTPROandALBon V1 as separate covariates with opposite signs (TPRO negative, ALB positive) and notes there is no clear mechanistic explanation; the joint effect may reflect serum-volume modifications.TPROratified canonically on 2026-04-28 alongside the Frey 2010 extraction.
IGG (canonical for serum immunoglobulin G)
- Description: Serum total immunoglobulin G concentration (baseline or time-varying). Used in mAb PK analyses as a competition-for-FcRn-recycling covariate on therapeutic-mAb clearance – high endogenous IgG is hypothesized to displace the therapeutic mAb from FcRn salvage and increase its catabolic clearance.
-
Units: g/L (typical in SI-convention papers); also
reported as mg/dL in US-convention papers – document the unit used in
each model via
covariateData[[IGG]]$units. - Type: continuous
- Scope: general
-
Reference category: n/a – used with power scaling
(IGG / ref)^exponent. Reference values observed: 14.8 g/L (Zhou 2021), 9.65 g/L (Yang 2021). -
Source aliases:
-
BIGG(baseline IgG) – used inZhou_2021_belimumab.R. -
IGGBL(baseline IgG) – used inYang_2021_cemiplimab.R.
-
-
Example models:
Zhou_2021_belimumab.R(g/L, reference 14.8; baseline-only; exponent 0.293 on CL),Yang_2021_cemiplimab.R(g/L, reference 9.65; small positive exponent 0.184 on shared CL/Q). -
Notes: Mechanistically meaningful for
monoclonal-antibody PK because endogenous IgG competes with the
therapeutic mAb for FcRn-mediated recycling. The per-model
covariateData[[IGG]]$unitsfield is load-bearing (1 g/L ~= 100 mg/dL). Baseline-vs-time-varying status documented incovariateData[[IGG]]$notes. Distinct fromlIgG0/ IgG-as-a-state in mechanistic FcRn-competition TMDD models (e.g.,Valenzuela_2025_nipocalimab.R), where IgG is a dynamic state, not a baseline covariate; useIGGonly when the source paper treats IgG as a static (baseline) covariate column.
IGM (canonical for serum immunoglobulin M)
- Description: Serum total immunoglobulin M (IgM) concentration (baseline). Used in IgRT population-PK analyses as a proxy for B-cell antibody-producing capacity / humoral function – IgM is the first antibody produced after B-cell activation, so circulating IgM reflects ongoing B-cell activity prior to class-switching to IgG.
-
Units: g/L (typical SI-convention reporting); also
reported as mg/dL in US-convention papers – document the unit used in
each model via
covariateData[[IGM]]$units. - Type: continuous
- Scope: specific
-
Reference category: n/a – used with power scaling
(IGM / ref)^exponent. Reference values observed: 0.21 g/L (Cheng 2026, pooled PID + SAD pediatric cohort median). - Source aliases: none known.
-
Example models:
Cheng_2026_immunoglobulin.R(g/L, reference 0.21; baseline-only; power exponent 0.11 on baseline IgG (CBAS) – IgM enters as a humoral-capacity proxy that informs the endogenous-IgG baseline rather than directly modifying clearance). - Notes: IgM is the immune-globulin class produced by activated B cells before class-switching, so it remains detectable in patients with hypogammaglobulinaemia who still have residual B-cell function. Scope: specific because the relevance of IgM as a covariate depends on the paper’s mechanistic interpretation (in Cheng 2026 it acts on the endogenous-IgG baseline; future use cases may differ). Promote to general if a second paper retains IgM with consistent semantics. Ratified canonically on 2026-04-28.
TBILI (canonical for total bilirubin)
- Description: Total serum bilirubin concentration.
-
Units: mg/dL or umol/L – document the unit used in
each model via
covariateData[[TBILI]]$units. - Type: continuous
- Scope: general
-
Reference category: n/a – used with power scaling
(TBILI / ref)^exponent. -
Source aliases:
-
BIL(legacy NONMEM short label for total bilirubin) – used inNA_NA_lidocaine.R(DDMODEL00000281; binarised at threshold 0.53 mg/dL withBIL_HIGH = as.integer(BIL > 0.53)). -
BILT(Urien 2005 capecitabine paper’s NONMEM short label for “total bilirubin”) – used inUrien_2005_capecitabine.R(umol/L, reference 8.8; power scaling on the capecitabine non-transformation CL10 and on the 5’-DFUR -> 5-FU rate constant K34).
-
-
Example models:
Yamada_2025_zolbetuximab.R(mg/dL, reference 0.38; small positive exponent 0.0347 on V1),NA_NA_lidocaine.R(mg/dL, source columnBIL; binary effect at threshold 0.53 mg/dL on the GX elimination rate constant K30),Urien_2005_capecitabine.R(umol/L, reference 8.8; source columnBILT; positive exponent +0.32 on capecitabine non-transformation CL10 and negative exponent -0.36 on the 5’-DFUR -> 5-FU rate constant K34). -
Notes: Hepatic-function marker. Unit varies by
paper (US convention mg/dL, SI convention umol/L; 1 mg/dL ~= 17.1
umol/L). The per-model
covariateData[[TBILI]]$unitsfield is load-bearing.
DBIL (canonical for direct (conjugated) bilirubin)
-
Description: Direct (conjugated) serum bilirubin
concentration. Distinct from
TBILI: direct bilirubin is the water-soluble glucuronide-conjugated fraction processed by hepatocytes and excreted in bile, so a rise in DBIL specifically flags impaired biliary excretion / cholestasis or intrahepatic shunting, whereas total bilirubin also captures unconjugated (indirect) hyperbilirubinaemia from haemolysis or Gilbert-type conjugation defects. -
Units: mg/dL or umol/L – document the unit used in
each model via
covariateData[[DBIL]]$units. - Type: continuous
- Scope: specific
-
Reference category: n/a – used with power scaling
(DBIL / ref)^exponent. Reference values observed: 2.6 umol/L (Chen 2015 voriconazole Chinese ICU cohort population median). - Source aliases: none known.
-
Example models:
Chen_2015_voriconazole.R(umol/L, reference 2.6; negative exponent -0.40 on CL:CL = TVCL * (DBIL / 2.6)^-0.40). -
Notes: Hepatic-function / cholestasis-specific
marker. Unit varies by paper (US convention mg/dL, SI convention umol/L;
1 mg/dL ~= 17.1 umol/L). Distinct entry from
TBILIbecause direct vs total are not interchangeable: total = direct + indirect, and the two fractions track different pathophysiologic processes. Scope keptspecificpending a second model that ratifies DBIL with consistent semantics; promote togeneralonce corroborated.
AST (canonical for aspartate aminotransferase)
- Description: Serum aspartate aminotransferase activity (baseline or time-varying).
-
Units: U/L (IU/L; the two labels are used
interchangeably in the clinical-PK literature). Document per-model via
covariateData[[AST]]$units. - Type: continuous
- Scope: general
-
Reference category: n/a – used with power scaling
(AST / ref)^exponent. -
Source aliases:
-
SGOT(serum glutamic-oxaloacetic transaminase; the legacy clinical-chemistry name for AST) – used inQuartino_2019_trastuzumab.R.
-
-
Example models:
Lu_2014_trastuzumabemtansine.R(U/L, reference 27; small positive exponent 0.071 on CL),Quartino_2019_trastuzumab.R(IU/L, reference 24; source columnSGOT; positive exponent 0.205 on linear CL). -
Notes: Hepatic-function marker. Commonly reported
alongside
ALTandTBILI; register a separateALTcanonical if a future paper requires it.SGOTis the older lab-reporting name; values and units are identical toAST.
ALT (canonical for alanine aminotransferase)
- Description: Serum alanine aminotransferase activity (baseline or time-varying).
-
Units: U/L (IU/L; the two labels are used
interchangeably in the clinical-PK literature). Document per-model via
covariateData[[ALT]]$units. - Type: continuous
- Scope: general
-
Reference category: n/a – used with power scaling
(ALT / ref)^exponent. -
Source aliases:
-
SGPT(serum glutamic-pyruvic transaminase; the legacy clinical-chemistry name for ALT, parallelingSGOT->AST) – used inNA_NA_lidocaine.R(DDMODEL00000281; binarised at threshold 11 withSGPT_HIGH = as.integer(SGPT > 11)).
-
-
Example models:
Nikanjam_2019_siltuximab.R(U/L, reference 19; small negative exponent -0.096 on CL),Melhem_2022_dostarlimab.R(U/L, reference 18; small negative exponent -0.0585 on CL, time-varying),NA_NA_lidocaine.R(source columnSGPT; binary effects at threshold 11 on the GX rate constant K30 and on the 2,6-xylidide rate constant K40). -
Notes: Hepatic-function marker. Commonly reported
alongside
ASTandTBILI. Ratified canonically on 2026-04-24.SGPTis the older lab-reporting name; values and units are identical toALT.
ALP (canonical for alkaline phosphatase)
- Description: Serum alkaline phosphatase activity (baseline or time-varying). Liver-function / cholestasis marker; often used in popPK covariate models either as a continuous concentration with power scaling or as a binary above/below upper-limit-of-normal (ULN) indicator. When binarized inline, document the ULN threshold used (typically ~120 U/L for adults; varies by lab, age, and sex).
-
Units: U/L (IU/L; the two labels are used
interchangeably in the clinical-PK literature). Document per-model via
covariateData[[ALP]]$units. - Type: continuous
- Scope: general
-
Reference category: n/a – used with power scaling
(ALP / ref)^exponent, with a linear-deviation form, or binarized inline asalp_high <- (ALP > uln)for a binary >ULN indicator. - Source aliases: none known.
-
Example models:
Gupta_2016_lenvatinib.R(binarized inline asalp_high <- (ALP > 120); the source paper entersALPas a 0/1 NONMEM indicator withALP = 1when the ratio ALP/ULN > 1; multiplicative effect on CL/F:0.883^alp_high). -
Notes: Liver-function / cholestasis marker; routine
clinical-chemistry covariate. Commonly tested alongside
ALT/AST/GGT/TBILI. Ratified canonically alongside the Gupta 2016 lenvatinib extraction.
GGT (canonical for gamma-glutamyltransferase)
- Description: Serum gamma-glutamyltransferase activity (baseline or time-varying); hepatic / cholestatic biliary-enzyme marker.
-
Units: U/L (IU/L; the two labels are used
interchangeably in the clinical-PK literature). Document per-model via
covariateData[[GGT]]$units. - Type: continuous
- Scope: general
-
Reference category: n/a – used with
linear-deviation form
1 + theta * (GGT - ref)or power scaling(GGT / ref)^exponent. Reference values observed: 33 U/L (Retlich 2015 popPK linagliptin median), 32.3 U/L (Retlich 2015 popPK/PD linagliptin median). - Source aliases: none known.
-
Example models:
Retlich_2015_linagliptin.R(U/L, reference 33; linear-deviation effect on linagliptin CL with coefficient -0.0339 % per U/L deviation. The PK/PD layer uses GGT (reference 32.3 U/L) as a piecewise covariate on baseline DPP-4 activity BSL with a linear-deviation effect below GGT = 175 U/L and a constant +21.3% effect above the threshold). -
Notes: Liver-function / cholestasis marker; routine
clinical-chemistry covariate. Commonly tested alongside
ALT/AST/ALP/TBILI. The piecewise above/below-threshold form in Retlich 2015 reflects empirical saturation of the GGT-vs-DPP-4-activity relationship at extreme values. Ratified canonically alongside the Retlich 2015 linagliptin extraction.
LDH (canonical for serum lactate dehydrogenase)
- Description: Serum lactate dehydrogenase activity (baseline or time-varying). General-purpose marker of tissue / cellular turnover; in oncology PK analyses it is interpreted as a disease-burden / cell-turnover proxy.
-
Units: U/L (IU/L; interchangeable). Document
per-model via
covariateData[[LDH]]$units. - Type: continuous
- Scope: general
-
Reference category: n/a – used with power scaling
(LDH / ref)^exponentor with an additive linear-on-log formexp(coef * (log(LDH) - log(ref)))(algebraically equivalent to(log(LDH) / log(ref))^coef). Reference values observed: 217 U/L (Sanghavi 2020). -
Source aliases:
-
BLDH(baseline LDH) – used inSanghavi_2020_ipilimumab.R.
-
-
Example models:
Sanghavi_2020_ipilimumab.R(linear-on-log form on CL with reference 217 U/L; coefficient 0.703),NA_NA_lidocaine.R(DDMODEL00000281; binary stratification at threshold 195 U/L switching the typical-value baseline of the 2,6-xylidide rate constant K40). -
Notes: Universal lab marker. Sanghavi 2020
log-transforms LDH because the distribution is heavily right-skewed
(range 74-6,245 U/L over a median of 217); other papers may use a simple
(LDH/ref)^exponentform. Document the functional form incovariateData[[LDH]]$notes.
HEPIMP_MILD (canonical for mild hepatic impairment indicator)
- Description: 1 = mild hepatic impairment per the National Cancer Institute Organ Dysfunction Working Group (NCI ODWG) criteria, 0 = normal hepatic function or non-mild category. NCI ODWG mild = total bilirubin <= ULN with AST > ULN, OR total bilirubin > 1.0xULN to <= 1.5xULN with any AST.
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (normal hepatic function; the moderate / severe categories are typically pooled into the reference for population PK analyses where mild impairment is the only category with non-trivial sample size).
-
Source aliases:
-
HEPIMP(with values1 = mild / 0 = others) – used inLin_2024_casirivimab.R.
-
-
Example models:
Lin_2024_casirivimab.R(multiplicative fractional change on CL),Lu_2022_patritumab.R(paired withHEPIMP_MOD_MISSING; multiplicative fractional effect 0.706 on CLDXd for mild impairment vs the normal-hepatic-function reference). -
Notes: Use this column when a model dichotomizes
hepatic-impairment status as “mild vs. others” (i.e., normal + the rare
moderate/severe cases pooled into the reference). For models that test
moderate or severe as separate categories, register additional
canonicals
HEPIMP_MOD/HEPIMP_SEVrather than overloading this entry.
HEPIMP_MOD_MISSING (canonical for composite moderate-or-data-missing hepatic impairment indicator)
- Description: 1 = moderate hepatic impairment per the NCI ODWG criteria OR baseline hepatic-function data missing/unknown; 0 = normal hepatic function or any other (non-moderate, non-missing) category. Composite indicator used by source papers that pool the moderate-impairment subgroup with patients whose hepatic-function data are missing because both subgroups are individually too small to estimate as separate effects.
- Units: (binary)
- Type: binary
- Scope: specific
-
Reference category: 0 (normal hepatic function;
mild impairment is typically captured by a separate
HEPIMP_MILDindicator paired with this column, so all-zero corresponds to NCI ODWG group 1 = normal). -
Source aliases:
-
HEPATIC/HEPATIC_MOD_MISSING– informal NONMEM names for the composite group.
-
-
Example models:
Lu_2022_patritumab.R(paired withHEPIMP_MILD; multiplicative fractional effect 0.532 on CLDXd; the composite group pools n = 6 moderate-impairment patients with n = 6 missing/unknown patients per Lu 2022 Table S5). -
Notes: Specific scope because the composition of
the “moderate or missing” group is paper-defined and the missing/unknown
subgroup may have a different distribution of true hepatic-function
status across studies. Use only when the source paper explicitly pools
the moderate-impairment cases with missing-data cases under a single
coefficient; for models that estimate moderate impairment separately
(without pooling missing data), register a
HEPIMP_MODcanonical instead. Ratified canonically on 2026-04-28.
B2M (canonical for serum beta-2-microglobulin)
- Description: Serum beta-2-microglobulin concentration. Low-molecular-weight (~12 kDa) protein freely filtered at the glomerulus and reabsorbed in the proximal tubule; serum levels rise with renal impairment, with increased plasma-cell turnover in multiple myeloma, and with broader lymphoid-cell turnover. Used in oncology PK analyses both as a renal-function proxy and as a tumor-burden / disease-severity covariate.
- Units: mg/L
- Type: continuous
- Scope: general
-
Reference category: n/a – used with power scaling
(B2M / ref)^exponent. Reference values observed: 3.90 mg/L (Fau 2020 multiple-myeloma cohort median). -
Source aliases: none;
B2Mis the universal abbreviation. -
Example models:
Fau_2020_isatuximab.R(mg/L, reference 3.90; exponent 0.343 on the steady-state linear clearance CLinf). -
Notes: In multiple myeloma B2M is part of the
International Staging System (ISS); in routine PK analyses it is
interpreted simultaneously as a renal-function and disease-burden
marker. Document the interpretation per-model via
covariateData[[B2M]]$notes.
HEPIMP (canonical for hepatic-impairment indicator (NCI ODWG classification))
- Description: Baseline hepatic-impairment indicator per the National Cancer Institute Organ Dysfunction Working Group (NCI ODWG) classification: 1 = mild or worse hepatic impairment (group >= 2 = mild, moderate, or severe), 0 = normal hepatic function (group 1).
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (normal hepatic function, NCI ODWG group 1).
-
Source aliases:
-
BHPTGRPN(categorical: 1 = normal, 2 = mild, 3 = moderate, 4 = severe; 9999 = missing) – used inLu_2019_polatuzumab.R. Decompose:HEPIMP = as.integer(BHPTGRPN > 1.5 & BHPTGRPN != 9999). -
HEP_IMP– retired canonical name; replaced byHEPIMPfor consistency with theHEPIMP_MILDfamily.
-
-
Example models:
Lu_2019_polatuzumab.R(multiplicative effect on FRAC_NS = 1.19, applied as1.19^HEPIMP). -
Notes: NCI ODWG classification (Ramalingam SS et
al., J Clin Oncol 2010;28:4507) groups subjects by total bilirubin and
AST: group 1 = normal, group 2 = mild (TBILI <= ULN and AST > ULN,
or TBILI > 1-1.5 x ULN), group 3 = moderate (TBILI > 1.5-3 x ULN),
group 4 = severe (TBILI > 3 x ULN). Source papers typically pool
groups 2-4 versus group 1 for a binary indicator because the
impaired-liver subgroups are individually small. If a future model needs
finer resolution (separate effects for mild vs moderate-or-worse), add a
parallel
HEPIMP_MODcanonical rather than overloading this one.
NASF (canonical for nonalcoholic steatohepatitis severity score combining NAS and fibrosis staging)
- Description: Integer composite severity score for nonalcoholic steatohepatitis (NASH), summing the NAFLD activity score (NAS: steatosis 0-3 + hepatocyte ballooning 0-2 + lobular inflammation 0-3, total 0-8) and the fibrosis staging score (0 = absent, 1 = perisinusoidal / pericellular, 2 = periportal, 3 = bridging, 4 = cirrhosis). Total range 0-12; healthy subjects without biopsy-confirmed NASH are assigned NASF = 0 by convention. Scores below 5 reflect a benign form of NAFLD; scores >= 5 reflect biopsy-confirmed NASH.
- Units: (count, 0-12)
- Type: count
- Scope: specific
-
Reference category: n/a – used with a linear effect
on
log(NASF / 4)for NASF >= 4 and zero contribution for NASF < 4 (so NASF = 4 and any NASF < 4 reduce to the typical-value reference). The cutoff of 4 distinguishes patients with a benign form of NAFLD from those with biopsy-confirmed NASH (Pierre 2017 Methods ‘Covariate analysis’ and references 31 and 34). -
Source aliases:
-
NASF– used inPierre_2017_morphine.R(Pierre 2017 Methods ‘Covariate analysis’).
-
-
Example models:
Pierre_2017_morphine.R(linear effect onlog(NASF / 4)for NASF >= 4 with coefficient -0.628 on M3G clearance:CL_M3G_i = CL_M3G_pop * (1 + e_nasf_cl_m3g * log(NASF / 4))for NASF >= 4 andCL_M3G_i = CL_M3G_popfor NASF < 4; higher NASF reduces M3G clearance via reduced biliary excretion and increased basolateral efflux of M3G into systemic circulation). -
Notes: The NAFLD activity score (NAS) component is
the histology score described by Bondini 2007 / Kleiner 2005 and
references therein; the fibrosis staging is the Brunt / NASH-CRN system.
The combined NASF score is the noninvasive staging proposed by
Santiago-Rolon 2015 (Proc R Health Sci J 34:189-194) and used by Angulo
2007 as the NAFLD Fibrosis Score cutoff. Scope: specific because the
precise cutoff (NASF >= 4) and the linear-on-log functional form are
Pierre 2017’s modeling choice; future papers may model NASF or its
components differently. Distinct from
HEPIMP*(NCI ODWG oncology-trial hepatic-impairment categories) and from continuous liver enzymes (ALT, AST, ALP) – NASF is a biopsy-derived disease-severity ordinal specific to NAFLD / NASH. Ratified canonically on 2026-05-18 alongside the Pierre 2017 morphine extraction.
HEPIMP_SEV (canonical for severe hepatic impairment indicator)
-
Description: 1 = severe hepatic impairment, 0 =
normal hepatic function or less-than-severe category. The classification
scheme that defines “severe” is paper-specific and must be documented in
per-model
covariateData[[HEPIMP_SEV]]$notes. Two schemes are commonly encountered:- NCI ODWG group 4: total bilirubin > 3 x ULN with any AST (Ramalingam SS et al., J Clin Oncol 2010;28:4507).
- Child-Pugh Class C: composite score 10-15 across bilirubin, albumin, INR, ascites, and encephalopathy.
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (any non-severe category: normal, mild, or moderate; the model typically uses HEPIMP_SEV alongside other severity-specific indicators that partition the non-severe pool further).
-
Source aliases:
-
Child-Pugh Class C– used invanderWalt_2013_dapagliflozin.R(covariate effects on CLP_M15 and V2M; the paper dichotomizes severe hepatic impairment per the Child-Pugh classification).
-
-
Example models:
vanderWalt_2013_dapagliflozin.R(Child-Pugh Class C; multiplicative fractional effects -0.422 on the dapagliflozin -> D3OG metabolic clearance and +1.33 on the D3OG central volume of distribution; paper text “With severe HI (Child-Pugh Class C), CLP M15 decreased by 41% and V2M increased by 134%”). -
Notes: Use this column when a model dichotomizes
severe hepatic impairment as a separate indicator from milder
categories. The classification scheme (NCI ODWG vs Child-Pugh vs other)
is paper-specific and must be documented per-model. For composite
“moderate-or-severe” pooled indicators, use the parallel
HEPIMP_MODSEVcanonical rather than overloading this entry. Companion toHEPIMP_MILD(mild only) andHEPIMP_MODSEV(moderate + severe pooled); the SKILL.md anticipates each severity level as its own canonical when the source paper tests them as separate covariates.
HEPIMP_MODSEV (canonical for composite moderate-or-severe hepatic impairment indicator)
-
Description: 1 = moderate or severe hepatic
impairment, 0 = normal hepatic function or mild impairment. Composite
indicator used by source papers that pool the moderate and severe
subgroups because the severe subgroup alone is too small to support a
separate covariate-effect estimate. Distinct from
HEPIMP_MOD_MISSING(which pools moderate cases with missing-data cases, not with severe cases). The classification scheme that defines the cut points is paper-specific and must be documented in per-modelcovariateData[[HEPIMP_MODSEV]]$notes. Two schemes are commonly encountered:- NCI ODWG groups 3-4 pooled: total bilirubin > 1.5 x ULN with any AST.
- Child-Pugh Class B or C pooled: composite score >= 7.
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (normal hepatic function or mild impairment; the indicator is mutually exclusive with HEPIMP_MILD, so all-zero on both indicators corresponds to the normal-function reference and HEPIMP_MILD=1 with HEPIMP_MODSEV=0 corresponds to mild-only).
-
Source aliases:
-
Child-Pugh Class B,C– used invanderWalt_2013_dapagliflozin.R(covariate effects on V3P and CLM; the paper dichotomizes moderate-or-severe hepatic impairment per the Child-Pugh classification).
-
-
Example models:
vanderWalt_2013_dapagliflozin.R(Child-Pugh Class B or C; multiplicative fractional effects -0.600 on the dapagliflozin peripheral volume of distribution V3P and -0.293 on the D3OG renal clearance CLM; paper text “Moderate or severe HI (Child-Pugh Class B or C) decreased CLM and the peripheral volume of distribution of dapagliflozin (V3P) by 29 and 60%, respectively”). -
Notes: Use this column when a model pools
moderate-and-severe hepatic impairment under a single coefficient
(typically because the severe subgroup alone is too small to estimate as
its own effect). The classification scheme (NCI ODWG vs Child-Pugh vs
other) is paper-specific and must be documented per-model. Companion to
HEPIMP_MILD(mild only) andHEPIMP_SEV(severe only). Distinct fromHEPIMP_MOD_MISSING(which pools moderate cases with subjects whose hepatic-function data are missing/unknown, not with severe cases). The composite mod-or-sev pooling is a different load-bearing convention than the mod-or-missing pooling, so the two canonicals must remain separate.
CPK (canonical for serum creatine phosphokinase / creatine kinase)
- Description: Serum creatine phosphokinase (also called creatine kinase, CK) activity (baseline or time-varying). Skeletal-muscle / cardiac-muscle injury and turnover marker; in macrophage-targeted PK/PD analyses (axatilimab, anti-CSF-1R) it is interpreted as a Kupffer-cell / tissue-macrophage clearance surrogate because Kupffer cells participate in the elimination of circulating muscle-derived enzymes.
-
Units: U/L (IU/L; interchangeable). Document
per-model via
covariateData[[CPK]]$units. - Type: continuous
- Scope: general
-
Reference category: n/a – used with power scaling
(CPK / ref)^exponent. Reference values observed: 63 U/L (Yang 2024 axatilimab; pooled-cohort median). -
Source aliases:
-
BLCPK(baseline CPK) – informal usage in Yang 2024.
-
-
Example models:
Yang_2024_axatilimab.R(baseline-only covariate on baseline NCMC concentrationBL_NCMCwith power exponent 0.376; reference 63 U/L). -
Notes: Muscle-origin enzyme distinct from
AST/ALT(hepatic) andLDH(general tissue turnover). Yang 2024 uses CPK alongsideASTandLDHas tracked safety biomarkers. Per-modelcovariateData[[CPK]]$notesshould document baseline-vs-time-varying status and the clinical interpretation in the source population (skeletal-muscle injury, macrophage-clearance surrogate, or both). Distinct from any model state variable representing CPK time-course dynamics – covariate column is the pre-dose laboratory observation.
DIAL (canonical for hemodialysis-active indicator (time-varying))
- Description: Within-subject time-varying indicator for whether an extracorporeal renal-replacement-therapy session (intermittent hemodialysis, hemofiltration, or hemodiafiltration) is currently running. 1 during the session; 0 in the interdialytic interval and in non-dialysed subjects.
- Units: (binary)
- Type: binary
- Scope: general
-
Reference category: 0 (no dialysis running). Models
that compose an additional
CL_dialysisterm should gate it byDIAL = 1so that the interdialytic clearance reduces to the intrinsic body clearance. -
Source aliases: none known; the source paper for
the ratification model uses
DIALas the data column name directly (Liesenfeld 2013 Methods). -
Example models:
Liesenfeld_2013_dabigatran.R(Michaels-equation gate;cl_total <- cl + DIAL * Michaels(BFR, DFR, KoA)). -
Notes: Distinct from a renal-impairment indicator
(subject-level baseline class) and from a renal-replacement-therapy
modality indicator (categorical: IHD vs CRRT vs SLED) –
DIALis the per-time-point gate that turns the dialysis-clearance term on and off. Pair withBFRandDFRwhen the dialysis clearance depends on flow rates; pair with a filter-specific mass-transfer coefficient (estimatedlkoain the model, not a covariate) when the Michaels parameterisation is used. Ratified canonically on 2026-05-16 alongside the Liesenfeld 2013 dabigatran extraction.
BFR (canonical for blood flow rate through the extracorporeal circuit during dialysis)
-
Description: Instantaneous blood flow rate through
the extracorporeal circuit during an active dialysis session.
Time-varying within subject; meaningful only when
DIAL = 1– in the interdialytic period the value is sentinel and the Michaels-equation term is gated off byDIAL. - Units: mL/min
- Type: continuous
- Scope: general
-
Reference category: n/a – enters the Michaels
equation together with
DFRand a hemodialyzer mass-transfer-area coefficient. Values investigated in the ratification source were 200, 300, and 400 mL/min (Liesenfeld 2013 Methods, Study Design; Table 1). - Source aliases: none known.
-
Example models:
Liesenfeld_2013_dabigatran.R. -
Notes: Pairs with
DIAL(binary on/off gate) andDFR(dialysate flow rate). Ratified canonically on 2026-05-16 alongside the Liesenfeld 2013 dabigatran extraction.
DFR (canonical for dialysate flow rate through the extracorporeal circuit during dialysis)
-
Description: Instantaneous dialysate flow rate
through the extracorporeal circuit during an active dialysis session.
Time-varying within subject; meaningful only when
DIAL = 1. - Units: mL/min
- Type: continuous
- Scope: general
-
Reference category: n/a – enters the Michaels
equation together with
BFR. The ratification source fixed DFR at 700 mL/min throughout (Liesenfeld 2013 Methods, Study Design) and additionally simulated 500 mL/min (Methods, Simulations). - Source aliases: none known.
-
Example models:
Liesenfeld_2013_dabigatran.R. -
Notes: Pairs with
DIALandBFR. Ratified canonically on 2026-05-16 alongside the Liesenfeld 2013 dabigatran extraction.
Hematology
HGB (canonical for hemoglobin)
- Description: Blood hemoglobin concentration.
-
Units: g/L or g/dL – document the unit used in each
model via
covariateData[[HGB]]$units. - Type: continuous
- Scope: general
-
Reference category: n/a – used with power scaling
(HGB / ref)^exponent. -
Source aliases: none;
HGBis the common NONMEM / clinical-PK abbreviation. -
Example models:
Yamada_2025_zolbetuximab.R(g/L, reference 118; exponent -0.374 on V1). -
Notes: Unit varies by paper (SI g/L, US g/dL; 1
g/dL = 10 g/L). The per-model
covariateData[[HGB]]$unitsfield is load-bearing.
WBC (canonical for white blood cell count)
- Description: Total white blood cell count (baseline or time-varying). In chronic lymphocytic leukaemia (CLL) populations the value is elevated because circulating leukaemic B-cells make up the majority of the count, so WBC can serve as a biomarker of target-cell burden rather than general hematology.
-
Units: 10^9 cells/L (equivalent to 10^3 cells/uL).
Document per-model via
covariateData[[WBC]]$units. - Type: continuous
- Scope: general
-
Reference category: n/a – used with power scaling
(WBC / ref)^exponent. Reference values observed: 10 x 10^9/L (Mould 2007, typical CLL Vmax normalization). -
Source aliases: none known (
WBCis the universal clinical-PK abbreviation). -
Example models:
Mould_2007_alemtuzumab.R(reference 10 x 10^9/L; exponent 0.194 on Vmax). -
Notes: Time-varying in treatment studies where the
drug depletes the leukaemic clone (e.g., alemtuzumab in CLL): WBC must
be supplied at every observation time in the event dataset. In diseases
where WBC is not therapeutically targeted the column can be treated as a
baseline-only covariate; record the per-model convention in
covariateData[[WBC]]$notes.
NLR (canonical for neutrophil-to-lymphocyte ratio)
- Description: Ratio of absolute neutrophil count to absolute lymphocyte count from a complete blood count with differential. Used as a peripheral inflammation marker. May be reported as baseline only or as a time-varying covariate.
- Units: ratio (unitless)
- Type: continuous
- Scope: general
-
Reference category: n/a – used with power scaling
(NLR / ref)^exponentor exponential effects. Reference values observed: 2.11 (Lin 2024, median in pooled COVID-19 + non-infected cohort). -
Source aliases: none;
NLRis the universal abbreviation in clinical-PK and inflammation-biomarker literature. -
Example models:
Lin_2024_casirivimab.R(time-varying; reference 2.11; small positive exponent +0.029 on CL). -
Notes: Document baseline-vs-time-varying status in
covariateData[[NLR]]$notes. Although it derives fromWBCdifferential counts, register it as its own canonical because the ratio (not the absolute counts) is what the model uses.
HCT (canonical for hematocrit)
- Description: Hematocrit – packed red blood cell volume fraction (baseline or time-varying).
-
Units: % (volume fraction times 100). Document
per-model via
covariateData[[HCT]]$units. - Type: continuous
- Scope: general
-
Reference category: n/a – used with power scaling
(HCT / ref)^exponent. Reference values observed: 45 % (Nestorov 2014, study-population median for severe hemophilia A adults). -
Source aliases: none;
HCTis the universal NONMEM / clinical-PK abbreviation. -
Example models:
Nestorov_2014_factorviii.R(reference 45 %, exponent -0.419 on V1). -
Notes: Higher HCT (more red-cell volume) leaves a
smaller plasma fraction within total body volume; for plasma-restricted
distribution (e.g., factor VIII activity, which circulates in plasma)
the central volume of distribution decreases as HCT rises, so the
exponent is negative. Document baseline-vs-time-varying status in
covariateData[[HCT]]$notes. Distinct fromHGB(mass concentration of hemoglobin); the two correlate but enter different mechanistic relationships.
THB_MASS (canonical for total hemoglobin mass)
-
Description: Subject-level total hemoglobin mass in
grams. Plasma-volume-independent quantity measured by the optimised
CO-rebreathing method (Schmidt 2005, Pottgiesser 2008); distinct from
HGB(mass concentration in plasma) andHCT(volume fraction). Used in erythropoiesis / RBC-regeneration models as the steady-state set point Base that drives the negative-feedback term and seeds the steady-state initial conditions of the precursor compartments. - Units: g
- Type: continuous
- Scope: specific
- Reference category: n/a – subject-level baseline supplied as a covariate column. Reference values observed: 885.42 g (Tetschke 2018 Table 1 example subject); Pottgiesser 2008 cohort mean ~870 g across 28 estimable adult-male volunteers.
-
Source aliases:
Base(Tetschke 2018 paper symbol). -
Example models:
Tetschke_2018_erythropoiesis.R(reference 885.42 g; Pottgiesser 2008 dataset of 29 healthy adult male volunteers). -
Notes: Specific scope because total hemoglobin mass
requires the optimised CO-rebreathing method to obtain (Schmidt 2005),
which is a specialised technique not present in routine clinical labs;
promote to
generalif a second model registers this quantity. Distinct fromHGB(g/L or g/dL plasma concentration) andHCT(RBC volume fraction):THB_MASSis the absolute body-pool mass and is not perturbed by short-term plasma-volume fluctuations (Pottgiesser 2008 Section 3.2 explicitly motivates the choice of mass over concentration). Sex-dimorphic: typical value in adult males is meaningfully higher than in adult females; document the sex composition of the population incovariateData[[THB_MASS]]$notes.
NEUT (canonical for absolute neutrophil count)
-
Description: Absolute neutrophil count, typically
as a baseline covariate (entered via centred-deviation
(NEUT - ref)or power scaling(NEUT / ref)^exponent) or, in semi-mechanistic myelosuppression models, as a per-subject initial-condition value for the proliferation, transit, and circulating compartments. -
Units: cells/mm^3 (equivalent to cells/uL; i.e.,
the same value reported in 10^9/L x 1000). Document per-model via
covariateData[[NEUT]]$unitsif the source paper uses a different unit (e.g.,10^9 cells/LforOzawa_2007_docetaxel.Rper the paper’s Table-3 reporting unit). - Type: continuous
- Scope: general
-
Reference category: n/a – used in centred-deviation
form
exp(coef * (NEUT - ref)), in power scaling(NEUT / ref)^exponent, or as a direct per-subject initial-condition assignment in semi-mechanistic Friberg-family models. Reference values observed: 4133 cells/mm^3 (BAST PTTE 2017 simulated cohort median;NA_NA_tte_gompertz.REvent 1 base hazard model); 5 x 10^9/L (Ozawa 2007 typical Japanese cancer cohort, used as the initial condition for the proliferation, transit, and circulation compartments). -
Source aliases:
-
BASE– per-subject baseline ANC supplied as a NONMEM data column (used inOzawa_2007_docetaxel.R; Appendix I $INPUT).
-
-
Example models:
NA_NA_tte_gompertz.R(BAST PTTE 2017 / DDMODEL00000243 Event 1 hazard model; centred at NEUT = 4133/mm^3; coefficient -1.56e-4 on the NONMEM rescaled scale, equivalent toexp(-1.56e-4 * (NEUT - 4133))on the hazard),Ozawa_2007_docetaxel.R(Friberg-extension myelosuppression PD; per-subject baseline ANC supplied via theNEUTcolumn, used as the initial condition for the proliferating, three transit, and circulating compartments per the Methods text ‘Circ (t = 0) was fixed at its observed value’). -
Notes: General scope because absolute neutrophil
count is a routine clinical-laboratory measurement that recurs across
cytotoxic-chemotherapy myelosuppression models (centred-deviation hazard
models, Friberg-family per-subject baseline initial conditions, and
time-varying ANC outputs). Promoted to general scope on 2026-05-10 to
support
Ozawa_2007_docetaxel.R. The NEUT canonical units are cells/mm^3, but the reporting unit10^9 cells/L(numerically NEUT_per_mm3 / 1000) is also common in oncology papers; per-modelcovariateData[[NEUT]]$unitsdocuments the per-paper unit. Distinct fromWBC(total white blood cell count, of which neutrophils are the largest fraction in healthy adults) –NEUTis a specific differential-count subfraction. Also distinct fromNLR(neutrophil-to-lymphocyte ratio), which is a derived ratio.
Coagulation / hemostasis biomarkers
INR_BASE (canonical for baseline international normalized ratio)
-
Description: Pre-medication baseline INR
(international normalized ratio of prothrombin time). Time-fixed per
subject (measured once, before the first warfarin dose). Used directly
in the warfarin K-PD INR equation as an additive constant
(
INR = INR_BASE + inrmax * (1 - (coag_s3 + coag_l3)/2)per Xia 2024 supplement Section 1.1) so the simulated INR returns to the subject-specific baseline when the drug is removed. - Units: (unitless ratio; INR has no units)
- Type: continuous
- Scope: general
-
Reference category: n/a – subject-specific
baseline. Default simulation value documented per-model in
covariateData[[INR_BASE]]$notes; the Xia 2024 simulation uses the total-cohort mean of 1.13 (Table 1). -
Source aliases:
-
INR_BASE,BL_INR,INRBASE– pre-medication INR column in NONMEM data sets; document the source-column name per-model incovariateData[[INR_BASE]]$source_name.
-
-
Example models:
Xia_2024_warfarin.R(additive baseline in the INR observation equation; cohort mean 1.13, SD 0.59 per Xia 2024 Table 1). -
Notes: Distinct from a time-varying INR observation
(the model’s observed
INRvariable). Healthy subjects with no anticoagulation typically have INR around 1.0; the Hamberg / Xia 2024 model treats deviations from 1.0 as a subject-specific covariate rather than an estimated parameter so the model returns to the observed baseline when warfarin is withdrawn. Ratified canonically on 2026-05-16 alongside the Xia 2024 warfarin extraction.
VWF (canonical for von Willebrand factor concentration)
- Description: Plasma concentration (or activity) of von Willebrand factor (VWF) – the multimeric carrier protein that binds and protects circulating factor VIII (FVIII) from proteolytic degradation and rapid clearance. Used as a covariate on FVIII (and FVIII-Fc) clearance because the vast majority (>95%) of circulating FVIII is in complex with VWF.
-
Units: IU/dL (equivalent to % of pooled normal
plasma); document per-model via
covariateData[[VWF]]$units. Some sources reportVWF:Ag(antigen) versusVWF:RCo(ristocetin cofactor activity); record which assay was used incovariateData[[VWF]]$notes. - Type: continuous
- Scope: general
-
Reference category: n/a – used with power scaling
(VWF / ref)^exponent. Reference values observed: 118 IU/dL (Nestorov 2014, study-population median). -
Source aliases: none;
VWFis the universal abbreviation. Source papers may writevWF(lowercase v) or specify the assay (VWF:Ag). -
Example models:
Nestorov_2014_factorviii.R(reference 118 IU/dL, exponent -0.343 on CL; VWF antigen). -
Notes: Higher VWF protects FVIII from clearance, so
the exponent on CL is negative. VWF is time-varying within an individual
(acute-phase response, age, blood group, etc.), but most published
population PK models use baseline-only VWF when the within-subject
dynamics are not characterized; document the per-model convention in
covariateData[[VWF]]$notes.
DDIMER (canonical for plasma D-dimer concentration)
- Description: Plasma D-dimer protein concentration, the fibrin-degradation peptide produced by plasmin-mediated cleavage of cross-linked fibrin. Used in vascular / coagulation-pathology models as a circulating biomarker of fibrin turnover, intra-aneurysmal thrombus burden, or systemic fibrinolytic activity.
- Units: ng/mL
- Type: continuous
- Scope: specific
-
Reference category: n/a – used either with
log10-transformed proportional scaling
log10(DDIMER) / median(log10(DDIMER))(Sherer 2012) or with categorical strata (Sherer 2012 sensitivity analysis groups: <=150, 151-300, 301-900, >900 ng/mL). Reference values observed: 326 ng/mL (Sherer 2012 cohort median; log10 approx 2.513). -
Source aliases:
-
C^(D-dimer)– used inSherer_2012_AAA.R(the symbol in Sherer 2012 Methods equation page 2).
-
-
Example models:
Sherer_2012_AAA.R(proportional log10-transformed covariate on the baseline AAA growth ratebeta1(e_ddimer_b1 = 0.90 mm/year) and on the first derivative of growth rate with sizebeta2(e_ddimer_b2 = 0.37/year)). - Notes: Specific scope until a second model registers the canonical. Time-fixed (baseline-only) in Sherer 2012 because the HIMS cohort had a single follow-up D-dimer measurement; the source paper flags this as a limitation. Cohort interquartile range 142-785 ng/mL; extrapolation outside this range is not validated by the source. The log10 transformation reflects Sherer 2012’s finding that “differences in AAA growth were predominantly driven by patients with the highest plasma D-dimer concentrations.” Distinct from the time-varying biomarker columns in indirect-response / TMDD models – DDIMER enters Sherer 2012 as a baseline regression covariate, not as a dynamic exposure / response variable. Ratified canonically on 2026-05-16 alongside the Sherer 2012 extraction.
AAA_DIAM (canonical for baseline abdominal aortic aneurysm diameter)
- Description: Abdominal aortic aneurysm (AAA) maximum infrarenal diameter, ascertained by ultrasound at study entry. Used in vascular disease-progression models as the per-subject baseline severity covariate that anchors the typical-value regression for individual-level growth parameters.
- Units: mm
- Type: continuous
- Scope: specific
-
Reference category: n/a – used in proportional form
AAA_DIAM / median(AAA_DIAM)so the effect coefficients represent the contribution at the cohort median. Reference value observed: 32.7 mm (Sherer 2012 cohort median; q1 30.8, q3 36.0). -
Source aliases:
-
Y(0)– used inSherer_2012_AAA.R(the symbol in Sherer 2012 Methods equation page 2; the baseline screening ultrasound diameter).
-
-
Example models:
Sherer_2012_AAA.R(proportional covariate on all three individual-level parameters:e_aaadiam_b0 = 32.6 mmon baseline size beta0,e_aaadiam_b1 = 2.03 mm/yearon baseline growth rate beta1, ande_aaadiam_b2 = 0.59/yearon the first derivative of growth rate with size beta2). - Notes: Specific scope until a second model registers the canonical. Time-fixed (baseline-only) per subject – the value is the single screening ultrasound diameter; the time-evolving AAA diameter during follow-up is the model’s observation, not the covariate. Sherer 2012 inclusion criterion (HIMS cohort): 30-49 mm small AAA, so extrapolation outside this range to <30 mm (non-aneurysmal aorta) or >=50 mm (surgical-referral threshold) is not validated by the source. Ratified canonically on 2026-05-16 alongside the Sherer 2012 extraction.
Disease severity scores
EASI (canonical for Eczema Area and Severity Index)
- Description: Eczema Area and Severity Index score (atopic-dermatitis severity composite; bounded continuous, scale 0-72 with higher values = more severe disease).
- Units: (score)
- Type: continuous
- Scope: general
-
Reference category: n/a – healthy volunteers have
EASI = 0. Effect enters as an additive term in models that pool AD
patients with HV (e.g.,
Tiraboschi_2025_amlitelimab.R). -
Source aliases:
-
BEASI(baseline EASI) – used inTiraboschi_2025_amlitelimab.R.
-
-
Example models:
Tiraboschi_2025_amlitelimab.R. -
Notes: When used as a time-invariant baseline
covariate (
BEASI), document incovariateData[[EASI]]$notes. Canonical name isEASIwithout theBprefix to match theAGE/WT/ALBpattern where baseline vs time-varying status is recorded in notes rather than the column name.
MGADL (canonical for Myasthenia Gravis Activities of Daily Living score)
- Description: Myasthenia Gravis Activities of Daily Living score – eight-item patient-reported outcome measure (each item 0-3), total 0-24, higher values = greater symptom severity and functional limitation.
- Units: (score)
- Type: continuous
- Scope: general
-
Reference category: n/a – healthy participants (no
gMG) have
MGADL = 0by definition. Effect enters as a baseline covariate on MG-ADL response parameters in gMG cohorts. - Source aliases: none known.
-
Example models:
Valenzuela_2025_nipocalimab.R(reference 7 points; power-form effect onIDecplaceboand on the slope between MG-ADL change and IgG reduction). -
Notes: Baseline-only in Valenzuela 2025 (the
observation is the absolute change from baseline MG-ADL). When used
time-varying (e.g., in pure PD models driven by disease-progression
dynamics), document in
covariateData[[MGADL]]$notes. Canonical name isMGADLwithout aBLprefix to match theEASI/AGE/WT/ALBpattern.
BCVA (canonical for best-corrected visual acuity)
- Description: Best-corrected visual acuity score measured on the Early Treatment Diabetic Retinopathy Study (ETDRS) chart, expressed as the number of letters read correctly (0-100; higher values = better vision). Used as a baseline severity covariate in ophthalmology PK/PD models of anti-VEGF treatment.
- Units: ETDRS letters (0-100)
- Type: continuous
- Scope: specific
- Reference category: n/a – used as a baseline input to set the initial condition of an indirect-response BCVA state or as a power-form effect on response parameters. Reference value observed: 55 letters (Mulyukov 2018 narrative: mean study-population baseline BCVA).
-
Source aliases:
-
BVA(baseline visual acuity) – used inMulyukov_2018_ranibizumab.R.
-
-
Example models:
Mulyukov_2018_ranibizumab.R(baseline BCVA used as the center for the initial-condition drawg0 = BCVA + eta_g0). -
Notes: Ophthalmology-specific. Baseline-only in
Mulyukov 2018 (carried once per subject and used only as the starting
BCVA for the indirect-response model). Canonical name drops the
Bprefix to match theEASI/AGE/WT/ALBpattern (baseline-vs-time-varying status recorded incovariateData[[BCVA]]$notes). Scope isspecificuntil a second ophthalmology model ratifies the name; at that point promote togeneral.
PREV_AE_SCORE (canonical for previous-time-step ordinal adverse-event score)
- Description: Ordinal adverse-event score recorded at the immediately preceding observation time, used as a Markov-state covariate that conditions the current-time logit / probability calculation on the previous outcome. Integer 0..N where N is the maximum AE grade in the source-paper grading scheme; convention is that the value is 0 at the first observation (no prior AE).
-
Units: (ordinal score; document per-model the scale
in
covariateData[[PREV_AE_SCORE]]$notes) - Type: count
- Scope: specific
-
Reference category: n/a – typically used as a
categorical conditioner (
IF (PREV_AE_SCORE == 0) ...) or via piecewise-FPS indicator decomposition; the natural reference isPREV_AE_SCORE = 0(no prior AE). -
Source aliases:
-
PREVSCOR– used inGirard_2012_pimasertib.R(CTCAE 0..3 ocular-AE score).
-
-
Example models:
Girard_2012_pimasertib.R(Markov-state covariate that selects per-previous-score logit thresholdsb01/b11/b21andb02/b12/b22and per-previous-scoreemaxlevels; reset to 0 at TIME = 0 per sourceIF (TIME.EQ.0) PREVSCOR=0). -
Notes: Specific scope because the ordinal scale is
paper-specific (different AE-grading schemes, different number of
categories, different grouping rules – Girard 2012 collapses CTCAE
grades 1+2 into a single “1-2” category and treats grades >=3 as a
third stratum). When assembling the simulation event table, set
PREV_AE_SCORE = 0at the first observation of every subject and update each subsequent observation to the previous observation’s sampled score – matching the NONMEMIF (TIME.EQ.0) PREVSCOR=0/PREVSCOR = DVcarry-forward idiom. Distinct fromPAIN(continuous baseline pain score) and fromMGADL/EASI(continuous severity scores not modelled as Markov states).
CDR_SOB (canonical for Clinical Dementia Rating - Sum of Boxes score)
- Description: Clinical Dementia Rating scale - Sum of Boxes (CDR-SOB) score, the unweighted sum of the six CDR-box scores (memory, orientation, judgement and problem solving, community affairs, home and hobbies, personal care; each scored 0 / 0.5 / 1 / 2 / 3). Total ranges 0-18; higher values indicate more severe cognitive and functional impairment. Widely used as a primary efficacy endpoint in Alzheimer’s-disease (AD) clinical trials and as a disease-progression biomarker in mild-cognitive-impairment (MCI) / prodromal-AD populations.
- Units: (CDR-SOB units, 0-18 score)
- Type: continuous
- Scope: general
- Reference category: n/a – the source paper centres covariate effects on dataset medians (e.g., Delor 2013 uses CDR_SOB / 2 in the DOT power-form and CDR_SOB - 1 in the mixture-logit additive form).
-
Source aliases:
-
CDR_bsl– used inDelor_2013_alzheimer.R(baseline CDR-SOB at study entry). -
CDR– alternative bare-name often seen in ADNI / CAMD-style NONMEM datasets.
-
-
Example models:
Delor_2013_alzheimer.R(time-fixed baseline covariate; enters both the per-subject DOT power form ((CDR_SOB / 2)^e_cdr_sob_dotwithe_cdr_sob_dot = -0.072) and the per-subject slow-progression mixture-logit additive form (+ e_cdr_sob_slow * (CDR_SOB - 1)withe_cdr_sob_slow = -1.27)). -
Notes: Canonical name is
CDR_SOBwithout a_BLsuffix to match theEASI/MGADL/BCVApattern (baseline-vs-time-varying status recorded incovariateData[[CDR_SOB]]$notes). The CDR sum-of-boxes form is distinct from the global CDR rating (CDR_GLOBAL, a 0 / 0.5 / 1 / 2 / 3 ordinal); the sum-of-boxes is preferred in disease-progression modelling for its finer granularity. Ratified canonically on 2026-05-16 alongside the Delor 2013 extraction.
ADAS_COG (canonical for ADAS-cog total cognitive subscale score)
- Description: Alzheimer’s Disease Assessment Scale - cognitive subscale (ADAS-cog) total score. The total-11 form ranges 0-70; the modernised total-13 form ranges 0-85. Higher values = more cognitive impairment. The ADAS-cog is the most widely used cognitive endpoint in AD clinical trials and disease-progression modelling.
-
Units: (ADAS-cog units; document the form (total-11
/ total-13) per-model in
covariateData[[ADAS_COG]]$units) - Type: continuous
- Scope: general
- Reference category: n/a – covariate effects typically centred on a dataset median (e.g., Delor 2013 centres on ADAS_COG = 12.67).
-
Source aliases:
-
ADAS_bsl– used inDelor_2013_alzheimer.R(baseline ADAS-cog total-11 at study entry). -
ADAS,ADAS_COG_11,ADAS_COG_13– alternative bare-name forms seen across ADNI / CAMD datasets.
-
-
Example models:
Delor_2013_alzheimer.R(time-fixed baseline covariate; ADAS-cog total-11 form; enters the per-subject DOT power form(ADAS_COG / 12.67)^e_adas_cog_dotwithe_adas_cog_dot = -0.0439). -
Notes: Canonical name is
ADAS_COG(no_BLsuffix; baseline-vs-time-varying recorded in notes). The total-11 vs total-13 form must be documented per-model incovariateData[[ADAS_COG]]$unitsbecause the same numeric ADAS_COG value has different clinical interpretation across the two forms. Conrado 2014 uses ADAS-cog as the modelled observation (response variable) rather than as a baseline covariate; that model file therefore does not list ADAS_COG in itscovariateData. Ratified canonically on 2026-05-16 alongside the Delor 2013 extraction.
MMSE (canonical for Mini Mental State Examination score)
- Description: Mini Mental State Examination total score (0-30; higher values = better cognitive function). A widely used cognitive-screening instrument in AD and MCI populations.
- Units: (MMSE units, 0-30 score)
- Type: continuous
- Scope: general
- Reference category: n/a – covariate effects typically centred on a dataset median (e.g., Delor 2013 centres on MMSE = 26).
-
Source aliases:
-
MMSE_bsl– used inDelor_2013_alzheimer.R(baseline MMSE at study entry).
-
-
Example models:
Delor_2013_alzheimer.R(time-fixed baseline covariate; modifies the per-subject disease-progression acceleration parameter alpha via a power form(MMSE / 26)^e_mmse_alphawithe_mmse_alpha = -2.01). -
Notes: Canonical name is
MMSE(no_BLsuffix; baseline-vs-time-varying recorded in notes). MMSE is the inverse-direction counterpart of CDR_SOB / ADAS_COG (MMSE high = healthy; CDR_SOB / ADAS_COG high = impaired); covariate-effect coefficient signs are therefore typically opposite to those for CDR_SOB / ADAS_COG. Ratified canonically on 2026-05-16 alongside the Delor 2013 extraction.
FAQ (canonical for Functional Assessment Questionnaire score)
- Description: Functional Assessment Questionnaire (Pfeffer FAQ) total score: sum of ten functional-activities items (each scored 0 = normal to 3 = dependent), total 0-30; higher values = greater functional impairment. Used as a functional-status covariate alongside cognitive scores in AD and MCI populations.
- Units: (FAQ units, 0-30 score)
- Type: continuous
- Scope: general
- Reference category: n/a – covariate effects typically centred on a dataset median (e.g., Delor 2013 centres on FAQ = 1).
-
Source aliases:
-
FAQ_bsl– used inDelor_2013_alzheimer.R(baseline FAQ at study entry).
-
-
Example models:
Delor_2013_alzheimer.R(time-fixed baseline covariate; enters the per-subject slow-progression mixture-logit additive form+ e_faq_slow * (FAQ - 1)withe_faq_slow = -0.341). -
Notes: Canonical name is
FAQ(no_BLsuffix; baseline-vs-time-varying recorded in notes). Distinct from the cognitive scores (CDR_SOB / ADAS_COG / MMSE): FAQ measures instrumental activities of daily living rather than cognitive performance, and adds incremental information about disease-stage severity in MCI cohorts. Ratified canonically on 2026-05-16 alongside the Delor 2013 extraction.
RHPNM (canonical for normalized hippocampal volume (head-size and age-adjusted))
- Description: Normalised hippocampal volume: the subject’s average left+right hippocampal volume divided by the value expected for a healthy subject of the same age and estimated intracranial volume (head size). 1.0 corresponds to the healthy reference; values below 1.0 indicate hippocampal atrophy. Derived from MRI volumetry and a healthy-subject regression on age and intracranial volume.
- Units: (unitless ratio; 1.0 = healthy reference)
- Type: continuous
- Scope: general
- Reference category: n/a – covariate effects typically centred on RHPNM = 1 (the healthy reference, e.g., Delor 2013).
-
Source aliases:
-
RHPNM– used inDelor_2013_alzheimer.R(baseline normalized hippocampal volume; Delor 2013 derivation:RHPNMbsl_i = HIPVbsl_i / HPNMbsl_iwhereHPNMbsl_i = Age_i * (-26.6268 + EICVbsl_i * 0.0016 + 3340.4395)).
-
-
Example models:
Delor_2013_alzheimer.R(time-fixed baseline covariate; enters the per-subject slow-progression mixture-logit additive form+ e_rhpnm_slow * (RHPNM - 1)withe_rhpnm_slow = 7.5, a strongly positive effect indicating that less atrophic hippocampi (RHPNM closer to 1) are associated with a higher probability of being in the slow-progressing subpopulation). -
Notes: Distinct from the raw hippocampal volume
(which would be a
HIPVcanonical not yet registered; raw HIPV is confounded with head size and age, hence the need for the normalisation). The Delor 2013 paper notes that the same effect is only marginally significant with unnormalised HIPV (P = 0.02) but strongly significant with the normalised form (RHPNM). The exact age / EICV regression coefficients are paper-specific and any future model adopting this canonical should re-derive the normalisation for its own population or document why the Delor 2013 regression is reused. Ratified canonically on 2026-05-16 alongside the Delor 2013 extraction.
ACUTE_MED_DAYS (canonical for baseline number of days/month of acute migraine medication use)
- Description: Baseline number of days per month on which acute migraine medication (triptans or ergot compounds) was used during the 28-day run-in period prior to first dose. Enters as a piecewise-linear shift on baseline migraine or moderate-to-severe headache days in migraine exposure-response models.
- Units: days/month
- Type: continuous
- Scope: specific
-
Reference category: n/a – piecewise-linear shift
with breakpoint at 5 d/mo: contributes 0 below 5 and
slope * (ACUTE_MED_DAYS - 5)above 5 (Fiedler-Kelly 2020). The 5-day breakpoint reflects the clinical guideline for medication-overuse headache. -
Source aliases: “Baseline days/month of acute
medications” – used in
FiedlerKelly_2020_fremanezumab_em.RandFiedlerKelly_2020_fremanezumab_cm.R. -
Example models:
FiedlerKelly_2020_fremanezumab_em.R(slope 0.438 d/d, episodic migraine),FiedlerKelly_2020_fremanezumab_cm.R(slope 0.460 d/d, chronic migraine). -
Notes: Specific scope because the variable is
migraine-domain-bound. Time-fixed per subject (baseline-only). When
future migraine E-R models register additional aliases or alternative
breakpoints, document them per-model and consider promoting to
general.
AMLOAD (canonical for systemic-amyloidosis whole-body amyloid load grade)
- Description: Ordinal whole-body amyloid load score in patients with systemic amyloidosis. Integer 0-3: 0 = no amyloid (healthy volunteers), 1 = small amyloid load, 2 = moderate amyloid load, 3 = large amyloid load. The grading combines organ-by-organ amyloid presence (liver, spleen, bone, adrenals, gut, heart) and SAP-scintigraphy uptake into a single per-patient severity grade at baseline. Time-fixed per subject.
- Units: (categorical 0-3)
- Type: categorical
- Scope: specific
-
Reference category: 0 (no amyloid). Sahota 2015 Eq.
2 also treats category 1 as part of the reference for the V4 effect
(categories 0 and 1 share a V4 multiplier of 1); only categories 2 and 3
carry a non-zero effect via the cumulative parameters
e_amload2_vp_sapande_amload2_vp_sap + e_amload3_vp_saprespectively. -
Source aliases: none beyond the source-paper
AMLOADcolumn. -
Example models:
NA_NA_miridesap.R(DDMODEL00000262; Sahota 2015 Eq. 2 multiplicative effect on SAP peripheral volume V4: V4 = V4_ref * (1 + e_amload2_vp_sap * I(AMLOAD>=2) + e_amload3_vp_sap * I(AMLOAD>=3)); reported effects e_amload2_vp_sap = 6.39 / e_amload3_vp_sap = 26.39 yielding ~7.4x V4 at moderate load and ~33.8x at large load),Sahota_2015_miridesap.R(paper-only extraction of the same Sahota 2015 final model with identical Eq. 2 effect on V4; values 6.39 / 26.39 taken from Table 2). -
Notes: Scope: specific because the grading scheme
is amyloidosis-specific (Sahota 2015 Methods: “The whole body amyloid
load covariate, AMLOAD, was a categorical score: 0 for no amyloid in
healthy volunteers, 1 for small, 2 for moderate, and 3 for large”). The
cumulative monotonic parameterisation in Sahota 2015 Eq. 2 encodes a
positive-only step at each grade increment. Co-used with
AMLIVERfor the binary hepatic-involvement modifier. Ratified canonically on 2026-05-15 alongside the DDMODEL00000262 / Sahota 2015 extraction.
AMLIVER (canonical for hepatic amyloid involvement indicator)
- Description: Binary indicator for the presence of amyloid in the liver as a separate organ involvement from the overall whole-body amyloid load. 1 = liver amyloid present at baseline; 0 = no liver amyloid. Time-fixed per subject.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (no liver amyloid).
-
Source aliases: none beyond the source-paper
AMLIVERcolumn. -
Example models:
NA_NA_miridesap.R(DDMODEL00000262; Sahota 2015 Eq. 2 multiplicative effect on SAP intercompartmental clearance Q4: Q4 = Q4_ref * (1 + e_amliver_q4 * AMLIVER); reported effect 4.01, yielding ~5x Q4 in patients with hepatic amyloid),Sahota_2015_miridesap.R(paper-only extraction of the same Sahota 2015 final model with identical Eq. 2 effect on Q4; value 4.01 from Table 2). -
Notes: Scope: specific because the covariate is
amyloidosis-specific (Sahota 2015 Methods names AMLIVER alongside
AMSPLEEN / AMHEART as organ-specific amyloid-involvement binary
indicators; only AMLIVER was retained as a covariate in the final
model). Used in combination with the global
AMLOADgrade so the model can express both general amyloid burden and the specific hepatic-clearance modifier (the SAP-CPHPC complex is cleared by the liver, motivating the hepatic-amyloid-specific Q4 effect). Ratified canonically on 2026-05-15 alongside the DDMODEL00000262 / Sahota 2015 extraction.
Critical-illness severity
ORG_FAIL_COUNT (canonical for number of organs failing in critically ill patients)
- Description: Integer count of failing organs in a critically ill patient at a given observation day, ascertained per-day and reported as the worst-of-day count. The count is decomposed into mutually exclusive strata 0 / 1 / 2 / 3 / >=4 (Vet 2016 used the strata 0, 1, 2, 3, and 4-or-5) that select per-stratum typical clearance values; the strata are not collapsed onto a single linear or power covariate effect because the underlying organ-failure mechanisms (cardiovascular, pulmonary, renal, hepatic, neurologic, hematologic) impair drug elimination heterogeneously.
- Units: (count)
- Type: categorical
- Scope: specific
- Reference category: 0 (no organs failing). Per-stratum typical CL values are estimated for ORG_FAIL_COUNT = 1, 2, 3, and >=4; ORG_FAIL_COUNT = 0 sets the baseline typical CL (frequently FIXED at the baseline value, as in Vet 2016).
-
Source aliases:
-
ORGF– used inVet_2016_midazolam.R(DDMODEL00000249 NMTRAN$INPUTcolumn; values 0..>=4). Renamed to canonicalORG_FAIL_COUNTwhen assembling input data for the packaged model.
-
-
Example models:
Vet_2016_midazolam.R(per-stratum typical CL values: ORG_FAIL_COUNT=0 fixed at 1.6 L/h for a 5 kg child with CRP=32 mg/L; ORG_FAIL_COUNT=1 -> 1.29 L/h; ORG_FAIL_COUNT=2 -> 0.957 L/h; ORG_FAIL_COUNT=3 -> 0.842 L/h; ORG_FAIL_COUNT>=4 -> 0.678 L/h). -
Notes: Specific scope because the variable is
critical-care-population-bound (PICU / ICU). Time-varying within subject
(re-evaluated each ICU day). The Vet 2016 organ-failure ascertainment
follows the Wilkinson 1987 paediatric multiple organ system failure
(MOSF) criteria – operator-confirmed per-paper criteria should be
documented in each model’s
covariateData[["ORG_FAIL_COUNT"]]$notes. Decompose insidemodel()into binary indicators (orgf1 <- (ORG_FAIL_COUNT == 1),orgf2 <- (ORG_FAIL_COUNT == 2),orgf3 <- (ORG_FAIL_COUNT == 3),orgf_ge4 <- (ORG_FAIL_COUNT >= 4)) and select per-stratum CL with mutually-exclusive multiplicative-flag arithmetic. Ratified canonically on 2026-05-06.
SAPS_II (canonical for new Simplified Acute Physiology Score II at ICU admission)
- Description: New Simplified Acute Physiology Score II (SAPS II) at intensive-care-unit admission. Validated 17-item ICU severity-of-illness score (Le Gall, Lemeshow & Saulnier, JAMA 1993;270:2957-2963) computed from age, vital signs, laboratory values, type of admission, and chronic-disease history during the first 24 hours after ICU admission; higher scores indicate greater severity and a higher predicted hospital mortality. Range theoretically 0-163; in adult ICU cohorts admission scores typically span ~10-100 with cohort means in the 35-65 range. Time-fixed per ICU stay.
- Units: points
- Type: continuous
- Scope: specific
-
Reference category: n/a – used with power scaling
(SAPS_II / ref)^exponent. Reference value observed: 50 points (Abboud 2009 typical-subject reference for the septic-shock cohort, mean SAPS II = 64 +/- 23). -
Source aliases:
-
SAPS II(with whitespace, as printed in the source paper’s prose) – used inAbboud_2009_epinephrine.R.
-
-
Example models:
Abboud_2009_epinephrine.R(power exponent -0.67 on epinephrine CL with reference 50; higher SAPS II is associated with lower clearance). -
Notes: Specific scope because the column is
critical-care-population-bound (adult ICU) and the score’s clinical
meaning depends on the SAPS-II derivation rules; future ICU models
reusing the score with the same definition can extend
Example modelsrather than registering a new canonical. Should not be confused with SAPS I or SAPS 3 (different scoring rules / item sets) – register those under separate canonicals if a future paper uses them. Ratified canonically on 2026-05-18 alongside the Abboud 2009 epinephrine extraction. ### RACHS1 (canonical for Risk Adjustment for Congenital Heart Surgery 1 (RACHS-1) category) - Description: Integer 1-6 RACHS-1 surgical-risk category, ascertained pre-operatively from the type of congenital heart defect and the planned operation. Higher categories indicate greater perioperative risk of in-hospital mortality (Jenkins 2002 Pediatrics). Time-fixed per subject.
- Units: (categorical; 1-6 integer)
- Type: categorical
- Scope: specific
- Reference category: Paper-specific. Oualha 2014 uses RACHS-1 = 2 as the low-risk reference stratum on SV*SVR_max (categories 1 are absent from the cohort; 3 and 4 are pooled as the higher-risk stratum).
-
Source aliases:
-
RACHS-1– the publication’s printed form with a hyphen, not a valid R identifier; renamed toRACHS1when assembling input data.
-
-
Example models:
Oualha_2014_epinephrine.R(decomposed insidemodel()into a binary indicatorrachs1_high <- (RACHS1 >= 3)that selects an additive log-shift on SV*SVR_max from 0.44 to 0.26 for the high-risk pool). -
Notes: Specific scope because the variable is
paediatric-cardiac-surgery-population-bound and the reference category
depends on which RACHS-1 strata the cohort contains (Oualha 2014 has
categories 2-4 only; a paper with categories 1-6 would need a different
decomposition). Decompose inside
model()into mutually exclusive binary indicators matching the source’s pooling (e.g.,rachs1_high <- (RACHS1 >= 3)) and document the pooling rule incovariateData[["RACHS1"]]$notes.
CVP (canonical for central venous pressure)
- Description: Central venous pressure (CVP), measured in mmHg through a central venous catheter. Time-varying when monitored continuously; in cardiovascular Emax PD models it enters the mean-arterial-pressure equation (MAP = HR * SV*SVR + CVP) as an additive constant rather than a fitted covariate effect.
- Units: mmHg
- Type: continuous
- Scope: specific
- Reference category: n/a – used additively in the MAP equation. Cohort medians observed: 11 mmHg (Oualha 2014, range 8-15).
- Source aliases: none.
-
Example models:
Oualha_2014_epinephrine.R(enters Eq. 7 as the additive offset in MAP = HR * SV*SVR + CVP; the vignette defaults to the cohort median 11 mmHg when CVP is not supplied per subject). - Notes: Specific scope because CVP is meaningful only for haemodynamic-PD models that resolve mean arterial pressure into its component cardiac-output and venous-return terms. The Oualha 2014 final model does not test CVP as a fitted covariate on PK or PD parameters; the column is used only as the structural offset in the MAP equation. Future haemodynamic models that fit a covariate effect on CVP itself can re-use the canonical name.
Interferon / biomarker panels
BGENE21 (canonical for 21-gene type I interferon signature score)
- Description: Baseline 21-gene type I interferon signature score – a composite transcriptomic score summarising the expression of 21 interferon-regulated genes in whole blood relative to a healthy-donor reference, used as a biomarker of type I IFN pathway activation in SLE and related autoimmune conditions.
- Units: unitless fold-change score (relative to healthy-donor reference).
- Type: continuous
- Scope: specific
-
Reference category: n/a – used with power scaling
(BGENE21 / ref)^exponent. Reference values observed: 32 in Narwal 2013 (study-population median was 33), 12.04 in Zheng 2016 (median of the SLE phase IIb cohort, range 0.32-38.59). - Source aliases: none.
-
Example models:
Narwal_2013_sifalimumab.R(reference 32, exponent 0.0558 on CL),Zheng_2016_sifalimumab.R(reference 12.04, power effect on CL with exponent 0.09). -
Notes: Specific to drugs whose mechanism targets
the type I IFN pathway (e.g., anti-IFN-alpha antibodies like
sifalimumab, anifrolumab). Higher BGENE21 indicates stronger target
engagement / disease activity and is associated with increased drug
clearance via target-mediated mechanisms. The 21-gene panel composition
is tied to the MedImmune/AstraZeneca SLE development programme; a
different IFN gene signature (e.g., a 4-gene or 5-gene panel) should be
registered under its own canonical name (
BGENE4,IFN_SIG, …) to avoid conflating panel definitions.
BGENE21_HIGH (canonical for binary high-vs-low IFN-21-gene indicator)
- Description: 1 = subject’s baseline 21-gene type I IFN signature score is at or above the paper-specified high/low cut-off, 0 = below cut-off.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (low).
- Source aliases: none.
-
Example models:
Almquist_2022_anifrolumab.R(binary high-IFN indicator on CL). -
Notes: Pair with continuous
BGENE21when the paper reports both. The high/low cut-off is paper-specific (commonly the population median) and must be documented incovariateData[[BGENE21_HIGH]]$notesfor every model that uses this covariate. Operator decision (2026-04-28): useBGENE21_HIGH(notIFNGS_HIGH) so the link to the existingBGENE21register entry is explicit while the binary nature stays visible in the column name.
Inflammation markers
EOS (canonical for blood eosinophil count)
- Description: Blood eosinophil count (baseline or time-varying).
- Units: cells/uL
- Type: continuous
- Scope: general
-
Reference category: n/a – used with power scaling
(EOS / ref)^exponent. -
Source aliases:
-
BEOS(baseline EOS) – used inKotani_2022_astegolimab.R.
-
-
Example models:
Kotani_2022_astegolimab.R(reference 180 cells/uL, baseline). -
Notes: Used as a surrogate of inflammatory burden
that correlates with protein turnover and therefore mAb clearance.
Canonical name drops the
Bprefix to match theEASI/AGE/WT/ALBpattern; baseline-vs-time-varying status is documented incovariateData[[EOS]]$notes.
BLBCELL (canonical for baseline CD19+ B cell count)
- Description: Baseline CD19+ B cell count (cells/uL) measured by fluorescence-activated cell sorting (FACS) prior to first dose. Used as a covariate / scaling biomarker for B-cell-targeted antibody PK-PD models (e.g., anti-CD20 mAbs in multiple sclerosis or B cell malignancies).
- Units: cells/uL
- Type: continuous
- Scope: specific
-
Reference category: n/a – used with power scaling
(BLBCELL / ref)^exponent. Reference value observed: 200 cells/uL (Yu 2022, median of the pooled five-study cohort). -
Source aliases:
-
Bcell0– used inYu_2022_ofatumumab.R. -
BBCC(NHL Phase I/Ib/II convention; values in 10^6 cells/L = cells/uL) – used inLu_2019_polatuzumab.R.
-
-
Example models:
Yu_2022_ofatumumab.R(power effect on the maximum B-cell-lysis stimulatory effect Emax, exponent 0.275, reference 200 cells/uL),Lu_2019_polatuzumab.R(two distinct effects: power on CL_INF with input floored at 1 cell/uL, and a thresholded power on CL_T with the BLBCELL/121-cells/uL ratio floored at 1). -
Notes: Distinct from a time-varying B cell
count, which is the PD response variable rather than a covariate. Scope:
specific because the clinically relevant baseline depends on the surface
marker (CD19, CD20, CD22) and whether the panel reports total B cells or
memory/naive subsets – register a new canonical name if a future paper
uses a different marker. Both Yu 2022 (anti-CD20 ofatumumab) and Lu 2019
(anti-CD79b polatuzumab vedotin) use CD19+ counts, so the canonical is
reused; subtype-specific differences are documented in each model’s
covariateData[[BLBCELL]]$notes.
BL_PARP_PBL (canonical for baseline poly(ADP-ribose) polymerase activity in peripheral blood lymphocytes)
- Description: Subject-specific baseline (pre-dose) poly(ADP-ribose) polymerase (PARP) activity in peripheral blood lymphocytes (PBL), measured as picomoles of PAR polymer formed per 10^6 PBL by an enzyme-activity assay. Used as a covariate / scaling biomarker on the maximal-inhibition residual activity (Emin) parameter in PARP-inhibitor PK/PD models.
-
Units: pmol/10^6 PBL (document per-model via
covariateData[[BL_PARP_PBL]]$units). - Type: continuous
- Scope: specific
-
Reference category: n/a – used with power scaling
(BL_PARP_PBL / ref)^exponent. Reference value observed: 90.8 pmol/10^6 PBL (Wang 2015 rucaparib; population typical baseline E0 used as a stand-in for the unreported study-cohort median because the paper reports only the typical E0 and the exponent value, not the numeric BLB_median). -
Source aliases:
-
BLB(Wang 2015’s notation for “baseline level in blood / PBL”) – used inWang_2015_rucaparib.R.
-
-
Example models:
Wang_2015_rucaparib.R(power effect on residual maximum-inhibition parameter Emin; exponent 0.620 with the formEmin = TV(Emin) * (BL_PARP_PBL / BLB_median)^alpha; PBL paired with a separate tumor-tissue PARP activity covariate that is not yet a registered canonical because the units differ – pmol/mg protein for tumor vs pmol/10^6 PBL for blood). -
Notes: Specific scope because the column is
meaningful only for PARP-inhibitor PK/PD models (rucaparib, olaparib,
niraparib, talazoparib, veliparib, etc.) and the units are tied to the
PBL-specific assay format. A future tumor-tissue PARP activity covariate
would need a separate canonical because the units differ (pmol/mg
protein) and the biology of cellular PARP activity per mg of protein is
not numerically interchangeable with PBL-normalized PARP activity. The
Wang 2015 model uses BL_PARP_PBL only on Emin (residual maximum
inhibition) and not on E0 or IC50; per-paper effects must be documented
in each model’s
covariateData[[BL_PARP_PBL]]$notes. The paper does not publish the numeric study-cohort median of BLB used to center the covariate; the model file uses 90.8 pmol/10^6 PBL (the population typical baseline E0 reported in Wang 2015 Table 2) as a defensible default reference and documents the assumption in the vignette’s Assumptions and deviations section.
CSF1 (canonical for colony-stimulating factor 1 / macrophage-colony-stimulating factor concentration)
- Description: Plasma colony-stimulating factor 1 (CSF-1, also known as macrophage colony-stimulating factor, M-CSF) concentration (baseline or time-varying). The hematopoietic cytokine that signals through CSF-1R to drive monocyte / macrophage differentiation and survival; used as both a target-engagement biomarker (anti-CSF-1R mAbs increase circulating free CSF-1) and a baseline covariate on PK / PD parameters in CSF-1R-pathway PopPK/PD models.
-
Units: pg/mL (= ng/L; the two labels are
numerically equivalent). Document per-model via
covariateData[[CSF1]]$units. - Type: continuous
- Scope: specific
-
Reference category: n/a – used with power scaling
(CSF1 / ref)^exponent. Reference values observed: 549 pg/mL (Yang 2024 axatilimab; pooled-cohort median). -
Source aliases: none formally; informal aliases
include
BLCSF1(baseline CSF-1) andBL_CSF1(model-parameter notation in Monolix / NONMEM control streams). -
Example models:
Yang_2024_axatilimab.R(baseline-only covariate on linear clearanceCLwith power exponent 0.912 and on the model parameterBL_CSF1with power exponent 0.656; reference 549 pg/mL). -
Notes: Specific scope because the column is
meaningful only for CSF-1R-pathway-targeting drugs (axatilimab and
future anti-CSF-1R molecules). Distinct from any CSF-1 model state
representing time-course dynamics – covariate column is the pre-dose
laboratory observation, typically measured by an ELISA assay (Yang 2024
used the R&D Systems Quantikine ELISA). Per-model
covariateData[[CSF1]]$notesshould document the assay used and any LOQ-related imputation for samples below the assay’s limit of detection.
CRP (canonical for C-reactive protein)
-
Description: C-reactive protein concentration.
Covers both standard and high-sensitivity (hs-CRP) assays and both
baseline and time-varying usages. Each model’s
covariateData[[CRP]]$descriptionandnotesmust state the assay type (standard vs hs-CRP) and whether the column carries a baseline-only or time-varying value, including the paper-specific reference value used for power scaling. -
Units: mg/L (document per-model via
covariateData[[CRP]]$units). - Type: continuous
- Scope: general
-
Reference category: n/a – used with power scaling
(CRP / ref)^exponentor exponential effectsexp(coef * (CRP - ref)). Reference values observed: 4.23 mg/L (Moein 2022, IBD standard assay), 4.31 mg/L (Moein 2022 Table 3 median), 5.21 mg/L (Thakre 2022, baseline hs-CRP), 7.41 mg/L (Chua 2025, baseline standard assay), 14.2 mg/L (Xu 2019, baseline standard assay), 15.7 mg/L (Ma 2020, baseline standard assay), 0.837 mg/dL = 8.37 mg/L (Wang 2020, IBD standard assay; the model carries the source unit mg/dL). -
Source aliases:
-
hsCRP– high-sensitivity CRP (mixed-case preserved from earlier register drafts). -
HSCRP– all-caps variant. -
CRPHS– used inThakre_2022_risankizumab.R(baseline, high-sensitivity assay). -
BLCRP– baseline CRP; used inXu_2019_sarilumab.RandMa_2020_sarilumab_das28crp.R.
-
-
Example models:
Thakre_2022_risankizumab.R,Xu_2019_sarilumab.R,Chua_2025_mirikizumab.R,Moein_2022_etrolizumab.R,Ma_2020_sarilumab_das28crp.R,Wang_2020_ontamalimab.R(mg/dL, reference 0.837). -
Notes: The prior separate
hsCRP,BLCRP, and standard-assayCRPcanonicals were merged on 2026-04-20 to a single general-scopeCRPcanonical. Assay type (standard vs hs-CRP), baseline-vs-time-varying status, and the paper-specific reference value all live in each model’scovariateData[[CRP]]$description/notes. Only aggregate values from hs-validated assays as CRP when the downstream analysis relies on low-range sensitivity; for most inflammatory-disease cohorts (IBD, RA/PsA), baseline CRP is well above the hs-sensitivity range and the distinction is moot.
AAG (canonical for alpha-1 acid glycoprotein concentration)
- Description: Serum alpha-1 acid glycoprotein (AAG; orosomucoid; ORM1) concentration, an acute-phase plasma glycoprotein that binds basic and lipophilic drugs (including taxanes such as docetaxel and paclitaxel). Elevated in cancer, inflammation, and infection; influences free-drug fraction and downstream PD effects in cytotoxic-chemotherapy myelosuppression models.
-
Units: g/L (= mg/mL; 1 g/L is the conventional
clinical-PK reporting unit). Document per-model via
covariateData[[AAG]]$unitsif a different unit is used. - Type: continuous
- Scope: general
-
Reference category: n/a – used in piecewise-linear,
power, or exponential effect forms (e.g., the Kloft 2006
cytotoxic-chemotherapy myelosuppression family fits a piecewise-linear
effect with breakpoint at the cohort median 1.34 g/L: separate slopes
apply for
AAG <= 1.34andAAG > 1.34). Reference values observed: 1.34 g/L (Kloft 2006 / Netterberg 2017, cohort median in mixed adult-cancer cohort). -
Source aliases:
-
AAG– used inNetterberg_2017_docetaxel.R(per the bundle’s NM-TRAN $INPUT block; matching Kloft 2006). -
AGP1– used inOzawa_2007_docetaxel.R(Appendix I $INPUT block); reported in mg/dL with conversion to canonical g/L viaAAG_g_per_L = AGP_mg_per_dL / 100.
-
-
Example models:
Netterberg_2017_docetaxel.R(piecewise-linear effects on baseline ANC with separate low-AAG and high-AAG slopes around median 1.34 g/L; linear effect on the drug-effect slope SL via(1 + theta * (AAG - 1.34))),Ozawa_2007_docetaxel.R(multiplicative power-form effect on the linear drug-effect slope:SLOPE = theta_SLOPE * (AAG / 0.94)^e_aag_slopewithe_aag_slope = -1.38; reference value 0.94 g/L from the published NONMEM control streamAGPm = 94mg/dL). -
Notes: General scope because serum AAG is a routine
clinical-laboratory measurement that recurs across
cytotoxic-chemotherapy population-PK / PD analyses (Bruno 1996/1998
docetaxel popPK uses AAG as a CL covariate; Kloft 2006 and downstream
Friberg-family myelosuppression models use it on baseline ANC and
drug-effect slope; Ozawa 2007 uses a power-form effect on the
drug-effect slope only). Time-fixed at baseline unless the source paper
states otherwise. The Kloft 2006 piecewise-linear breakpoint at 1.34 g/L
corresponds to the population median in their pooled cancer cohort; the
Ozawa 2007 normalisation reference 0.94 g/L is the cohort median used
inside that paper’s NONMEM control stream. Future papers may use
different breakpoints / reference values, so document the per-model
reference in
covariateData[[AAG]]$notes. Distinct fromCRP(a different acute-phase reactant with different binding properties).
IL6 (canonical for serum interleukin-6 concentration)
-
Description: Serum (or plasma) interleukin-6 (IL-6)
concentration. Pro-inflammatory cytokine; elevated in rheumatoid
arthritis, Castleman’s disease, sepsis, COVID-19, and other inflammatory
conditions. Both baseline and time-varying usages are covered; document
per-model in
covariateData[[IL6]]$noteswhether the column is baseline-only or time-varying. - Units: pg/mL (= ng/L; the two labels are numerically equivalent).
- Type: continuous
- Scope: general
-
Reference category: n/a – used with power scaling
(IL6 / ref)^exponentor with the log-transformed form(log(IL6 * 1000) / log(ref * 1000))^exponentthat some legacy NONMEM analyses adopt. Reference values observed: 20 pg/mL (Frey 2013 baseline; the formula(log(IL-6 * 1000)/9.9)^exponentis the algebraic equivalent of(log(IL-6) / log(20))^exponentafter the constant-factor rescaling that ties the reference log to 9.9 = log(20000)). -
Source aliases:
-
BLIL6,bIL6,IL6_BASE– baseline IL-6 (used in some NONMEM control streams; canonical drops theBLprefix per theEOS/EASI/AGEconvention with baseline-vs-time-varying status documented in per-model notes). -
IL-6,IL_6– punctuation variants seen in publication tables and figures.
-
-
Example models:
Frey_2013_tocilizumab.R(baseline IL-6, reference 20 pg/mL; log-transformed power effects on EC50, BASE, and the DMARD background-effect parameter). -
Notes: Anti-IL-6 / anti-IL-6R drugs (tocilizumab,
sarilumab, siltuximab) often see large transient increases in measured
IL-6 after dosing – when the column is time-varying for these drugs, the
assay typically detects total (free + drug-bound) IL-6 because the
antibody complex slows clearance of IL-6. Document the assay type (free
vs. total IL-6) and whether the value is pre-dose (baseline-only) or
post-dose / time-varying in each model’s
covariateData[[IL6]]$notes. Frey 2013’s formula relies on the relative log-IL-6 ratio rather than the linear concentration, so the column units must be pg/mL exactly (not ng/mL) for the published exponents to apply unchanged.
Cardiometabolic / target biomarkers
HDLC (canonical for high-density lipoprotein cholesterol)
- Description: Serum high-density lipoprotein cholesterol concentration (baseline or time-varying).
-
Units: mg/dL or mmol/L – document the unit used in
each model via
covariateData[[HDLC]]$units(1 mmol/L ~= 38.67 mg/dL for cholesterol). - Type: continuous
- Scope: general
-
Reference category: n/a – used with power scaling
(HDLC / ref)^exponent. Reference value observed: 54 mg/dL (Frey 2010 pooled-cohort median). -
Source aliases:
-
HDL-C– Frey 2010 spelling with hyphen. -
HDL_C– common alternative spelling.
-
-
Example models:
Frey_2010_tocilizumab.R(mg/dL, reference 54; small negative exponent -0.2 on linear CL; the paper interprets the effect as a body-size surrogate rather than a mechanism). - Notes: Cardiometabolic lipid-panel covariate. In Frey 2010 it correlates with body size (HDL-C is lower in larger patients) and the small CL effect (-14% to +15% across the observed 23-135 mg/dL range) was retained but not interpreted as mechanistic.
TCHOL (canonical for total serum cholesterol)
-
Description: Serum (or plasma) total cholesterol
concentration (baseline or time-varying). Sum of HDL, LDL, VLDL, and
other lipoprotein-associated cholesterol fractions; distinct from
HDLCwhich captures only the HDL fraction. -
Units: mmol/L or mg/dL – document the unit used in
each model via
covariateData[[TCHOL]]$units(1 mmol/L ~= 38.67 mg/dL for cholesterol). - Type: continuous
- Scope: general
-
Reference category: n/a – used with
linear-deviation forms
1 + theta * (TCHOL - ref)or power scaling(TCHOL / ref)^exponent. Reference value observed: 3 mmol/L (Archary 2018, severely malnourished pediatric LPV cohort, baseline mean 2.7-2.9 mmol/L). -
Source aliases:
-
CHOL– Archary 2018 NONMEM column abbreviation; the universal short form. -
TC– alternative abbreviation common in lipid-panel literature.
-
-
Example models:
Archary_2018_lopinavir.R(mmol/L, reference 3; linear effect on apparent CL/F:1 + 0.207 * (TCHOL - 3); serves as a surrogate for nutritional / hepatic-function recovery rather than a mechanistic effect). -
Notes: In severe acute malnutrition, total
cholesterol tracks lipid-pool repletion and hepatic recovery and may
serve as a surrogate covariate when the actual driver of bioavailability
or clearance variability is not directly measured. Lopinavir is highly
protein-bound (to albumin and AAG) and lipophilic, so circulating
cholesterol can correlate with binding capacity for lipophilic drugs.
Distinct from
HDLC(HDL fraction only) and fromCRP/ALB/AAG(separate canonicals for related malnutrition / inflammation markers).
TRIG (canonical for serum triglyceride concentration)
- Description: Serum triglyceride concentration (baseline or time-varying).
-
Units: mmol/L or mg/dL – document the unit used in
each model via
covariateData[[TRIG]]$units(1 mmol/L ~= 88.5 mg/dL for triglyceride). - Type: continuous
- Scope: general
-
Reference category: n/a – used with
linear-deviation form
(1 + theta * (TRIG - ref))or power scaling(TRIG / ref)^exponent. Reference value observed: 5.3 mmol/L (Archary 2019 lamivudine equation centring – the equation centring in the source is reported as the cohort average; cohort median in Archary 2019 Table 1 is 2.2-2.3 mmol/L, see model-file Errata). - Source aliases: none known.
-
Example models:
Archary_2019_lamivudine.R(mmol/L, reference 5.3; linear-deviation effect on Vc/F with coefficient -0.13 per mmol/L deviation from the reference; lower triglyceride implies higher apparent central volume). -
Notes: Cardiometabolic lipid-panel covariate. In
Archary 2019 the inverse triglyceride–Vc relationship is interpreted as
a nutritional-status / hydrophilic-drug-distribution surrogate:
triglycerides rise during nutritional rehabilitation, and lamivudine (a
hydrophilic drug) shows decreasing apparent volume as nutritional status
improves. The linear-deviation form preserves the source’s published
parameterization; per-model
covariateData[[TRIG]]$unitsis load-bearing because the centring reference and slope are unit-specific.
LDLC (canonical for low-density lipoprotein cholesterol)
- Description: Serum low-density lipoprotein cholesterol concentration (baseline or time-varying). The pharmacologically meaningful endpoint for lipid-lowering therapies (statins, PCSK9 inhibitors, ANGPTL3 inhibitors); also serves as a baseline covariate or as the time-varying PD response in indirect-response exposure-response models.
-
Units: mg/dL or mmol/L – document the unit used in
each model via
covariateData[[LDLC]]$units(1 mmol/L ~= 38.67 mg/dL for cholesterol). - Type: continuous
- Scope: general
-
Reference category: n/a – used with power scaling
(LDLC / ref)^exponentfor the baseline-LDLC covariate role, or with no reference (used directly as the PD response state) when it is the modelled output. Reference values observed: 211 mg/dL (Pu 2021 HoFH typical-patient definition). -
Source aliases:
-
LDL-C– common spelling with hyphen. -
LDL_C– common alternative spelling. -
LDLBL(baseline LDL-C) – used inPu_2021_evinacumab.R(Pu 2021 NM-TRAN $INPUT column for centred baseline LDL-C as a covariate on IC50).
-
-
Example models:
Pu_2021_evinacumab.R(mg/dL, baseline reference 211 mg/dL; power exponent -1.17 on IC50, where higher baseline LDL-C predicts a smaller IC50 and therefore greater sensitivity to evinacumab; LDL-C is also the PD output state initialised at the baseline value). -
Notes: Cardiometabolic lipid-panel covariate.
Distinct from
HDLC(high-density lipoprotein cholesterol) and from any total-cholesterol or non-HDL-C derivation. When LDL-C is both the response variable AND a covariate (as in Pu 2021, where baseline LDLC drives IC50 and the time-varying state is the modelled PD), document the dual role incovariateData[[LDLC]]$notes.
ANGPTL3 (canonical for angiopoietin-like protein 3 concentration)
- Description: Total serum angiopoietin-like protein 3 (ANGPTL3) concentration. ANGPTL3 is the pharmacological target for anti-ANGPTL3 monoclonal antibodies (evinacumab) and antisense oligonucleotides (vupanorsen). Baseline ANGPTL3 acts as a soluble-target biomarker that contributes to target-mediated drug disposition; higher baseline target predicts a higher saturable Vmax.
-
Units: mg/L (equivalent to ug/mL). Document
per-model via
covariateData[[ANGPTL3]]$units. - Type: continuous
- Scope: specific
-
Reference category: n/a – used with power scaling
(ANGPTL3 / ref)^exponent. Reference value observed: 0.08 mg/L (Pu 2021 typical-patient median). -
Source aliases:
-
ANGBL(baseline ANGPTL3) – used inPu_2021_evinacumab.R(Pu 2021 NM-TRAN $INPUT column for centred baseline ANGPTL3). -
ANGBL0– alternative Pu 2021 raw column.
-
-
Example models:
Pu_2021_evinacumab.R(mg/L, baseline reference 0.08 mg/L; power exponent +0.405 on Vmax, where higher baseline target predicts a faster saturable elimination – biologically consistent with evinacumab being co-cleared along with bound ANGPTL3). -
Notes: Specific scope because the column is
meaningful only for drugs whose mechanism involves ANGPTL3 (anti-ANGPTL3
mAbs / ASOs). Reusing the name for another anti-ANGPTL3 agent is
acceptable (extend the example-models list). The assay in Pu 2021
detects both free and target-bound ANGPTL3 after acid pretreatment of
serum; document the assay type (free vs. total) per model in
covariateData[[ANGPTL3]]$notes.
FPCSK9 (canonical for free (unbound) proprotein convertase subtilisin/kexin type 9 concentration)
- Description: Free (unbound, non-drug-bound) serum proprotein convertase subtilisin/kexin type 9 (PCSK9) concentration. For anti-PCSK9 monoclonal antibodies (alirocumab, evolocumab, bococizumab) the free-PCSK9 pool is the pharmacologically active target fraction; drug-target binding reduces FPCSK9 relative to total PCSK9.
- Units: ng/mL (document per-model if a paper reports a different unit).
- Type: continuous
- Scope: specific
-
Reference category: n/a – used with
linear-deviation forms
TVPARAM + theta * (FPCSK9 / ref)or power-form(FPCSK9 / ref)^theta. Reference values observed: 72.9 ng/mL (Martinez 2019 time-varying median). - Source aliases: none known.
-
Example models:
Martinez_2019_alirocumab.R(time-varying; additive-linear effect onKmwith slope -0.541 per (FPCSK9/72.9), reference 72.9 ng/mL). -
Notes: Specific scope because the column is
meaningful only for drugs whose mechanism is PCSK9 inhibition; reusing
the name for a different anti-PCSK9 agent is acceptable (add to Example
models). For non-PCSK9 drugs that use a similar target-concentration
biomarker, register a new canonical (e.g.,
FIL6R,FTNF) rather than overloadingFPCSK9. Per-modelcovariateData[[FPCSK9]]$notesshould state whether the value is baseline-only or time-varying and how missing values were imputed (Martinez 2019 used LOCF).
SBCMA (canonical for soluble B-cell maturation antigen concentration)
- Description: Baseline serum (or plasma) concentration of soluble B-cell maturation antigen (sBCMA), the shed extracellular domain of the BCMA receptor (TNFRSF17). Serves as a soluble-target biomarker for BCMA-directed therapeutics (anti-BCMA antibody-drug conjugates such as belantamab mafodotin, BCMA-targeted bispecifics, and BCMA CAR-T) – sBCMA is elevated in multiple-myeloma and reflects tumour burden, and contributes to target-mediated drug disposition by sequestering circulating drug.
-
Units: ng/mL (equivalent to ug/L; 1 ng/mL = 1
ug/L). Document per-model via
covariateData[[SBCMA]]$units. - Type: continuous
- Scope: specific
-
Reference category: n/a – used with power scaling
(SBCMA / ref)^exponent. Reference value observed: 50 ng/mL (Papathanasiou 2025 typical-patient definition). -
Source aliases:
-
SBCMABL(baseline soluble BCMA) – used inPapathanasiou_2025_belantamab.R.
-
-
Example models:
Papathanasiou_2025_belantamab.R(ng/mL, reference 50; power exponents on initial CL +0.113, on ADC Vc +0.0401, on Imax +0.160). -
Notes: Specific scope because the column is
meaningful only for drugs whose mechanism involves the BCMA receptor
(and thus a circulating soluble-target pool). Reusing the name for
another anti-BCMA agent is acceptable (extend the example-models list).
For other oncology TMDD targets register a new canonical (e.g.,
HER2_ECDalready exists for HER2; an analogousSCD20,SCD38would follow the same pattern). Multiple myeloma populations show sBCMA spanning roughly 2 to 2,000 ng/mL, so the (SBCMA/50)^exponent form should be evaluated with care over the full clinical range. ## Drug exposure metrics
CAV (canonical for average drug plasma concentration over a dosing interval)
- Description: Average plasma concentration of the modelled drug over a dosing interval (Cav = AUC_tau / tau). Used as the time-varying or per-period exposure metric in exposure-response models that feed individual empirical-Bayes PK predictions from a previously published population PK model into a downstream PD model.
-
Units: ug/mL (document per-model via
covariateData[[CAV]]$units). - Type: continuous
- Scope: specific
-
Reference category: n/a – used in Emax/EC50 (e.g.,
Emax * CAV / (EC50 + CAV)) or power (e.g.,(CAV / CavMedian)^exponent) drug-effect terms. Set to 0 for placebo periods. -
Source aliases:
CAV,Cav,CAVG. -
Example models:
FiedlerKelly_2020_fremanezumab_em.R,FiedlerKelly_2020_fremanezumab_cm.R,Schoemaker_2018_levetiracetam.R(DDMODEL00000239; LEV plasma concentration in mg/L). -
Notes: Specific scope because the value is
intrinsically tied to the modelled drug – there is no shared meaning
across drugs or studies. Each model’s
covariateData[[CAV]]$notesshould state how the Cav values are derived (e.g., empirical-Bayes from a referenced population PK model) and that the column is set to 0 for placebo periods.
DOSE_PHT_MGKGD (canonical for daily phenytoin dose per kg body weight)
- Description: Patient’s own total daily dose of phenytoin (mg) divided by current body weight (kg), expressed as mg/kg/d. Per-dose-record covariate; constant within an inter-dose interval and updated when the prescriber alters the daily dose.
- Units: mg/kg/d
- Type: continuous
- Scope: specific
-
Reference category: n/a – enters as a
self-dose-rate regressor in the dose-dependent powder bioavailability
formula
F_powder = 1 - exp(-theta / DOSE_PHT_MGKGD). Has no effect when paired withFORM_POWDER = 0(tablet); a non-NA non-zero placeholder must still be supplied. -
Source aliases:
-
Dij– used inYukawa_1990_phenytoin.R(paper’s per-record daily-dose-per-weight regressor, mg/kg/d, in the powder bioavailability equation 4 of Yukawa 1990).
-
-
Example models:
Yukawa_1990_phenytoin.R(powder bioavailabilityF_powder = 1 - exp(-9.92 / DOSE_PHT_MGKGD); F approaches 1 below ~2 mg/kg/d and decreases monotonically as the daily dose increases, reflecting the lower wettability of the Aleviatin brand phenytoin powder formulation). -
Notes: Specific scope because the value is
intrinsically tied to phenytoin (PHT) and the Yukawa 1990
powder-vs-tablet bioavailability contrast. Drug-self-dose covariates for
other drugs should register sibling canonicals (e.g.,
DOSE_<DRUG>_MGKGD) rather than reuse this name – the absolute coefficient (theta_BA2 = 9.92 in Yukawa 1990) is not transferable across drugs. Computed as the total daily dose summed across the 2-3 daily phenytoin doses (mg/d) divided by the patient’s body weight at the dose record (kg). Ratified canonically on 2026-05-10 alongside the Yukawa 1990 phenytoin extraction.
PRED_DOSE (canonical for concomitant oral prednisolone daily dose)
- Description: Concomitant oral prednisolone (or prednisolone-equivalent glucocorticoid) daily dose. Time-varying across the dosing period as the post-transplant conmed_steroid taper progresses.
-
Units: mg/day. Document per-model via
covariateData[[PRED_DOSE]]$unitswhen a paper uses a different unit (mg/kg/day) or a different glucocorticoid (methylprednisolone, dexamethasone, hydrocortisone) – in the latter case convert to prednisolone-equivalent mg/day before populating the column and record the conversion factor incovariateData[[PRED_DOSE]]$notes. - Type: continuous
- Scope: specific
-
Reference category: n/a – continuous, with 0 mg/day
(no prednisolone) the natural reference value. In Storset 2014 the
effect on tacrolimus oral bioavailability is modeled as a sigmoid-Emax
fractional reduction
(1 - Pred_max * PRED_DOSE / (Pred_50 + PRED_DOSE))withPred_max = 0.67andPred_50 = 35 mg/day(Hill = 1); document the per-model functional form incovariateData[[PRED_DOSE]]$notes. -
Source aliases:
-
Prednisolone dose– used inStorset_2014_tacrolimus.R(mg/day).
-
-
Example models:
Storset_2014_tacrolimus.R(Emax-style fractional reduction in tacrolimus oral bioavailability via prednisolone-driven induction of intestinal CYP3A / P-glycoprotein; Storset 2014 Methods Equations 4 + 6 with Hill = 1). -
Notes: Distinct from
PRED_CMAX_FREE(free prednisolone Cmax co-medication exposure) –PRED_CMAX_FREEis the modelled-from-data peak free concentration, whereasPRED_DOSEis the administered daily-dose level supplied directly from the dosing record. Both can coexist in a future model that simultaneously tests dose-driven and exposure-driven effects of prednisolone on a tacrolimus PK parameter. Distinct fromCONMED_STEROID(binary baseline / concomitant corticosteroid use indicator) andPRICORT(binary prior corticosteroid use indicator) –PRED_DOSEcarries the daily dose value, not just an on / off flag. Time-varying because tacrolimus PK depends on the conmed_steroid dose at the time of each tacrolimus observation; the conmed_steroid taper schedule must be supplied as a per-time-row covariate column. The corresponding methylprednisolone single-dose induction-bolus indicator (Storset 2014 binary covariate, not retained in the final model) would warrant a separate canonical (e.g.MPRED_BOLUS) if a future model retains it. Ratified canonically on 2026-05-08.
PRED_CMAX_FREE (canonical for free prednisolone Cmax co-medication exposure)
- Description: Maximum free (unbound, ultrafiltrable) plasma prednisolone concentration over a co-medication dosing interval, used as a co-medication-exposure covariate on a primary modelled drug (e.g., tacrolimus) when concomitant prednisolone is suspected to alter the primary drug’s PK via membrane-permeability or fluid-balance effects. Time-fixed per subject in the source paper (one Cmax value per subject, derived from limited-sampling concentrations).
- Units: nmol/L
- Type: continuous
- Scope: specific
-
Reference category: n/a – used in centred-deviation
form
(1 + e * (PRED_CMAX_FREE - ref)). Reference value observed: 155.5 nmol/L (Bergmann 2014 study median). -
Source aliases:
-
PredCmax,free/PREDCFR– used inBergmann_2014_tacrolimus.R(Bergmann 2014 Table 2 footnote; 162 nmol/L median per Table 1, 155.5 nmol/L centring value per Table 2 equation).
-
-
Example models:
Bergmann_2014_tacrolimus.R(linear deviation effect on tacrolimus apparent central volume V1/F: every 1 nmol/L increase from 155.5 nmol/L decreases V1/F by 0.28%). -
Notes: Specific scope because the value is
intrinsically tied to a particular co-medication (oral prednisolone) and
a particular reference population (kidney transplant recipients on a
tapering conmed_steroid regimen). Future popPK models that test free
prednisolone Cmax as a covariate on a different primary drug should
reuse this canonical; if a future model uses a different prednisolone
exposure metric (e.g.,
PRED_AUC0_12_FREEfor free AUC 0-12 h, orPRED_CMAX_TOTALfor total Cmax), register parallel canonicals rather than overloadPRED_CMAX_FREE. The source-paper Cmax is derived from limited-sampling concentrations at 1 / 2 / 4 hours postdose per Bergmann 2014 Methods (validated against earlier full-profile data from the same cohort). Distinct fromCAV(average concentration of the modelled drug) andCP_MGL(instantaneous concentration of the modelled drug as a time-varying PD driver) –PRED_CMAX_FREEis the maximum concentration of a co-medication, used as a per-subject covariate. Ratified canonically on 2026-05-08 alongside the Bergmann 2014 extraction.
AUC_CARBO (canonical for per-cycle average AUC of carboplatin)
- Description: Per-cycle average AUC of carboplatin used as the time-varying drug-exposure covariate driving cytotoxic tumour-death rates in tumour-size-dynamics models of platinum-based chemotherapy. The value is held step-wise constant over each chemotherapy cycle and resets at the start of the next cycle.
-
Units: carboplatin AUC units (typically
mg*min/mL); document per-model viacovariateData[[AUC_CARBO]]$units. - Type: continuous
- Scope: specific
- Reference category: n/a – set to 0 in cycles where carboplatin is not administered (e.g., post-discontinuation or non-platinum arms).
-
Source aliases:
-
CB(NONMEM$INPUTcolumn in DDMODEL00000217 / DDMODEL00000218) – used inZecchin_2016_tumorovarian.RandZecchin_2016_survival.R. The DDMORE bundles ship the simulated datasets with the column re-labelledAUC0; downstream consumers should mapAUC0->AUC_CARBO.
-
-
Example models:
Zecchin_2016_tumorovarian.R(Zecchin 2016 SLD model for advanced ovarian cancer, DDMODEL00000217),Zecchin_2016_survival.R(Zecchin 2016 OS model, DDMODEL00000218; the OS model integrates the same SLD ODE inline, with the prior IPP-fit subject-level KG/KD0/KD1/IBASE supplied via the dataset). -
Notes: Specific scope because the column meaning is
tied to a particular cytotoxic agent (carboplatin) and a particular
per-cycle averaging convention. Reusing the same column for another
platinum analogue (cisplatin, oxaliplatin) is not appropriate – register
a sibling canonical (
AUC_CISPLATIN,AUC_OXALIPLATIN) when needed. The Zecchin 2016 SLD and OS models use the value directly in the death-rate termkd0 * AUC_CARBO * tumorSize, with an internal/1000numerical scaling carried verbatim from the source$DESblock.
AUC_BAST_FW (canonical for first-week AUC in the BAST PTTE 2017 teaching dataset)
-
Description: Area under the plasma
concentration-time curve over the first week of treatment for the
unspecified hypothetical drug used in the BAST PTTE 2017 teaching
guiding-document (DDMODEL00000243). Per-subject, time-fixed (a single
early-exposure summary value carried as a baseline covariate). Used to
drive the Event 2 Gompertz hazard via centred-deviation form
exp(coef * (AUC_BAST_FW - 3065.5)). - Units: ug*h/L (per BAST PTTE guiding document Section 2.2.2)
- Type: continuous
- Scope: specific
- Reference category: n/a – used in centred-deviation form. Reference value observed: 3065.5 ug*h/L (BAST PTTE 2017 simulated cohort median; runEV2_105 base hazard).
-
Source aliases:
-
AUC– verbatim NM-TRAN$INPUTcolumn name in DDMODEL00000243’s executable .mod files and in the bundle’sSimulated_event_data.csv. Renamed toAUC_BAST_FWin the canonical register so that future models using a genericAUCcolumn with different semantics will not silently collide.
-
-
Example models:
NA_NA_tte_gompertz_ev2.R(BAST PTTE 2017 / DDMODEL00000243 Event 2 hazard model; centred at 3065.5 ug*h/L, coefficient 3.09e-4 on the NONMEM rescaled scale, equivalent toexp(3.09e-4 * (AUC_BAST_FW - 3065.5))on the hazard). -
Notes: Specific scope because the value is
intrinsically tied to an unspecified hypothetical drug and a fictional
simulated population in the BAST PTTE 2017 guiding document – the column
has no shared meaning across drugs or studies. Future models should
register a sibling canonical (e.g.,
AUC_<DRUG>) with explicit drug semantics rather than overload this name. The BAST guiding document Section 2.2.2 defines this as “AUC of drug treatment given within the first week (ug*h/L).”
AUC_GEM (canonical for per-cycle average AUC of gemcitabine)
- Description: Per-cycle average AUC of gemcitabine (sum of parent and active metabolite exposure, per Zecchin 2016 Methods) used as the time-varying drug-exposure covariate driving cytotoxic tumour-death rates in tumour-size-dynamics models of gemcitabine-containing chemotherapy.
-
Units: gemcitabine AUC units (typically
mg*h/Lor the paper’smol*day / 10^6 cellsscaling for the parent-plus-active-metabolite composite); document per-model viacovariateData[[AUC_GEM]]$units. - Type: continuous
- Scope: specific
- Reference category: n/a – set to 0 in cycles where gemcitabine is not administered (e.g., carboplatin-monotherapy arms).
-
Source aliases:
-
G(NONMEM$INPUTcolumn in DDMODEL00000217 / DDMODEL00000218) – used inZecchin_2016_tumorovarian.RandZecchin_2016_survival.R. The DDMORE bundles ship the simulated datasets with the column re-labelledAUC1; downstream consumers should mapAUC1->AUC_GEM.
-
-
Example models:
Zecchin_2016_tumorovarian.R(Zecchin 2016 SLD model for advanced ovarian cancer, DDMODEL00000217),Zecchin_2016_survival.R(Zecchin 2016 OS model, DDMODEL00000218). -
Notes: Specific scope. The Zecchin 2016 SLD and OS
models use the value directly in the death-rate term
kd1 * AUC_GEM * tumorSize, with an internal/100numerical scaling carried verbatim from the source$DESblock.
AUC_GCV (canonical for per-q12h-interval AUC of ganciclovir)
- Description: Time-varying ganciclovir AUC over a q12h dosing interval (AUC_0-12), used as the drug-exposure input to indirect-response viral-turnover PK/PD models of cytomegalovirus (CMV) viral load decline under (val)ganciclovir treatment. The Koloskoff 2025 source computes individual AUC_0-12 from an upstream popPK model (Franck 2021) and feeds it to the PD model as a Monolix “varying input”; the PD model itself does not integrate a PK ODE, so AUC_GCV is supplied to nlmixr2 as a time-varying data column.
-
Units:
mg*h/L(document per-model viacovariateData[[AUC_GCV]]$unitsif a different exposure unit is reported). - Type: continuous
- Scope: specific
-
Reference category: n/a – set to 0 in pre-treatment
/ off-treatment records so the drug-stimulation term
Emax * AUC_GCV / (EC50 + AUC_GCV)vanishes and the viral load returns to thekin / koutsteady-state baseline. -
Source aliases:
-
AUC_0-12– the printed variable name in Koloskoff 2025 (Methods Section 2.3, Eq. 1, and Table 1). Q24h dosing intervals are entered asAUC_0-24 / 2so all data live in the q12h framework (Koloskoff 2025 Methods Section 2.1).
-
-
Example models:
Koloskoff_2025_ganciclovir.R(Koloskoff 2025 indirect viral turnover model for CMV viral load in pediatric SOT / HSCT recipients; AUC_GCV enters the ODE viakout * (1 + Emax * AUC_GCV / (EC50 + AUC_GCV)) * viralLoad). -
Notes: Specific scope – the column meaning is tied
to ganciclovir as the drug and to a q12h interval-averaging convention.
Sibling drug-specific AUC canonicals (
AUC_CARBO,AUC_GEM,AUC_BAST_FW,AUC_PAZO) follow the sameAUC_<DRUG>naming pattern; a future PK/PD model that uses a different exposure metric for ganciclovir (e.g., trough concentration, instantaneous concentration) should register a parallel canonical rather than overloadAUC_GCV. Koloskoff 2025 Monte Carlo simulations are reported under AUC_0-24 (Tables 3 and 4) assuming AUC_0-24 = 2 x AUC_0-12 at steady state; nlmixr2 simulations should set AUC_GCV to the q12h-interval value (i.e., AUC_0-24 / 2).
AUC_PAZO (canonical for per-period mean AUC of pazopanib)
- Description: Per-period (per-dose-group in preclinical xenograft studies; per-subject mean dose-adjusted in clinical studies) mean AUC of pazopanib used as the drug-exposure covariate driving the antiangiogenic and cytotoxic effect rates in semi-mechanistic tumour-growth / angiogenesis-inhibition (TGI) models of pazopanib in renal-cell carcinoma. Time-varying step-wise (held constant within a treatment period and resetting when dose level changes or treatment ends).
-
Units:
ug*h/mL(= mg*h/L). Document per-model viacovariateData[[AUC_PAZO]]$units. - Type: continuous
- Scope: specific
-
Reference category: n/a – enters via power form
a = a0 * AUC_PAZO^e_auc_pazo_aandc = c0 * AUC_PAZO^e_auc_pazo_c. Set to 0 in periods where pazopanib is not administered; the model rate terms reduce to baseline drug-off (a, c -> 0 when AUC_PAZO -> 0) under that convention. -
Source aliases:
-
AUC– used inOuerdani_2015_pazopanib_mouse.RandOuerdani_2015_pazopanib.R(Ouerdani 2015 Methods Equations 2-3; preclinical values 220.2, 656.8, 1140.8 ugh/mL for the 10, 30, 100 mg/kg mouse dose groups; clinical mean 771.6 ugh/mL for 800 mg QD pazopanib in RCC patients, with per-subject values 629.4-802.4 ug*h/mL derived from an Emax fit to mean AUCs at the patient’s dose history).
-
-
Example models:
Ouerdani_2015_pazopanib_mouse.R(preclinical TGI in CAKI-2 xenograft mice; AUC enters asc = c0 * AUC_PAZO^0.332only – the cytotoxic exponente_auc_pazo_awas fixed to 0 because the in-mouse cytotoxic effect did not vary with exposure across the 10-100 mg/kg range),Ouerdani_2015_pazopanib.R(clinical TGI in RCC patients; AUC enters as botha = a0 * AUC_PAZO^0.125andc = c0 * AUC_PAZO^0.142). -
Notes: Specific scope because the column meaning is
tied to a particular drug (pazopanib) and the power-form
parameterisation that the Ouerdani 2015 model uses. Reusing the name for
a different tyrosine-kinase inhibitor is not appropriate – register a
sibling canonical (
AUC_SORAFfor sorafenib,AUC_SUNIfor sunitinib, etc.) when needed. The Ouerdani 2015 paper reports the preclinical AUCs inug*h/mLfrom a separate preclinical PK study (cited as the FDA Pharmacology Review for pazopanib NDA 022465); the clinical AUCs come from an Emax fit (Equation derived from Methods) to pooled mean AUCs at varying daily doses (5 mg to 2000 mg) across five prior pazopanib trials. Ratified canonically on 2026-05-12 alongside the Ouerdani 2015 pazopanib mouse and clinical extractions.
CLI (canonical for individual posthoc clearance from an upstream popPK fit)
- Description: Subject-specific empirical-Bayes (posthoc) total plasma clearance from a separately published population PK model that the current PD model treats as a fixed input. Used as a per-subject (time-fixed) covariate in PD-only models that derive a per-cycle exposure metric (e.g., AUC = DOSE / CLI) without instantiating a PK ODE.
-
Units: L/h (document per-model via
covariateData[[CLI]]$units). - Type: continuous
- Scope: specific
- Reference category: n/a – appears directly in derived-exposure expressions; not a covariate effect coefficient.
-
Source aliases:
-
CL– used in the Hansson 2013 sunitinib biomarker / TGI / fatigue PD-model family (DDMODEL00000197 and siblings, including DDMODEL00000222) and inSchindler_2016_sunitinib.R(DDMODEL00000221) as the posthoc CL column from the paper’s upstream 2-compartment popPK fit.
-
-
Example models:
Hansson_2013a_sunitinib.R(DDMODEL00000197; typical-value reference 32.819 L/h, drawn from the bundle’s simulated dataset for subject 1 – broadly consistent with Houk et al. 2010 typical sunitinib CL),Hansson_2013b_sunitinib.R(DDMODEL00000198; tumor growth inhibition variant; same per-subjectCLcolumn as Hansson 2013a/c),Hansson_2013c_sunitinib.R(DDMODEL00000222; uses a per-recordCLcolumn with subject-specific values 30-43 L/h in the bundle’s three-subject simulated dataset),Schindler_2016_sunitinib.R(DDMODEL00000221; per-subject post-hoc CL fed in as theCLcolumn, vignette uses 50 L/h literature-typical sunitinib CL/F per Houk 2010). -
Notes: Specific scope because the values are
intrinsically tied to a specific upstream popPK fit (sunitinib in this
case; another model adopting CLI would carry its own upstream-PK
lineage). Renamed from the source’s
CLcolumn becauseclis the canonical nlmixr2 PK parameter name (a parameter, not a data column). Each model’scovariateData[[CLI]]$notesshould cite the upstream popPK source (paper or DDMORE ID) and explain how to populate the column for new simulations (typically: simulate the upstream popPK first to obtain individual CL, or set every subject to the typical-value CL for typical-trajectory simulations). Distinct fromDOSE(current administered dose level) – the two columns jointly carry a per-cycle drug-exposure summary (AUC = DOSE / CLI) for PD-only models that consume posthoc PK from an upstream popPK fit instead of instantiating their own PK ODE.
CL_INDIV (canonical for per-subject empirical-Bayes drug clearance)
- Description: Individual point estimate of the modelled drug’s clearance, supplied per-subject as a fixed data column. Used in sequential PK->PD models where the PK structure has been fixed from a previously-published population PK analysis and the per-subject empirical-Bayes (POSTHOC) clearances are passed through to drive the PD layer rather than being re-estimated alongside the PD parameters.
-
Units: L/h (document per-model via
covariateData[[CL_INDIV]]$units). - Type: continuous
- Scope: specific
-
Reference category: n/a – used directly inside
model()asCL_INDIVin place of an estimatedcl <- exp(lcl + etalcl). -
Source aliases:
CLI– used inFriberg_2002_paclitaxel.R(NM-TRAN data column for per-subject paclitaxel CL). -
Example models:
Friberg_2002_paclitaxel.R. -
Notes: Specific scope because the value is
intrinsically tied to the modelled drug – there is no shared meaning
across drugs. Each model’s
covariateData[[CL_INDIV]]$notesshould state which upstream popPK source the EBE values come from (e.g., Henningsson 2001 paclitaxel popPK, fixed in the DDMORE encoding) and whether placebo periods are present. Companion volumes are registered asVC_INDIVandVP_INDIV.
CMAX_M1 (canonical for maximum drug plasma concentration during month 1)
- Description: Empirical-Bayes maximum plasma concentration (Cmax) reached during the first month (or the first cycle) of dosing for the modelled drug, used as a per-subject early-exposure covariate in PD / safety models that condition later-time outcomes on early exposure. Continuous, time-invariant per subject (set once from the month-1 / cycle-1 PK simulation).
-
Units: ng/mL (document per-model via
covariateData[[CMAX_M1]]$unitsif a different concentration unit is reported). - Type: continuous
- Scope: specific
-
Reference category: n/a – used as an additive logit
shift
(CMAX_M1 - CMAX_M1_REF) * theta(Girard 2012) or as a power scaling depending on the source. Reference value observed: 0 ng/mL (Girard 2012 setsMED17 = 0as the centering reference). -
Source aliases:
-
CMAXM1– used inGirard_2012_pimasertib.R.
-
-
Example models:
Girard_2012_pimasertib.R(additive logit shift on the cumulative-logit AE-score model:theta_cmaxm1 * CMAXM1). -
Notes: Specific scope because the value depends on
the upstream drug-specific population-PK model used to derive the
empirical-Bayes Cmax. Sibling of the existing
CAV(average dosing-interval concentration); both are derived exposure metrics fed into downstream PD / safety models. Document the upstream PK model incovariateData[[CMAX_M1]]$notesfor any future user.
VC_INDIV (canonical for per-subject empirical-Bayes central volume of distribution)
-
Description: Individual point estimate of the
modelled drug’s central volume of distribution, supplied per-subject as
a fixed data column. Companion to
CL_INDIVin sequential PK->PD encodings. -
Units: L (document per-model via
covariateData[[VC_INDIV]]$units). - Type: continuous
- Scope: specific
-
Reference category: n/a – used directly inside
model()asVC_INDIVin place of an estimatedvc <- exp(lvc + etalvc). -
Source aliases:
V1I– used inFriberg_2002_paclitaxel.R(NM-TRAN data column for per-subject paclitaxel V1). -
Example models:
Friberg_2002_paclitaxel.R. -
Notes: See
CL_INDIVnotes for the broader convention.
VP_INDIV (canonical for per-subject empirical-Bayes peripheral volume of distribution)
-
Description: Individual point estimate of the
modelled drug’s first peripheral volume of distribution, supplied
per-subject as a fixed data column. Companion to
CL_INDIVandVC_INDIVin sequential PK->PD encodings. -
Units: L (document per-model via
covariateData[[VP_INDIV]]$units). - Type: continuous
- Scope: specific
-
Reference category: n/a – used directly inside
model()asVP_INDIVin place of an estimatedvp <- exp(lvp + etalvp). -
Source aliases:
V2I– used inFriberg_2002_paclitaxel.R(NM-TRAN data column for per-subject paclitaxel V2). -
Example models:
Friberg_2002_paclitaxel.R. -
Notes: See
CL_INDIVnotes for the broader convention. For models requiring a second peripheral compartment, registerVP2_INDIV(and add a follow-on entry to this register) when a second model legitimately needs it.
BAS_SVEGFR3 (canonical for individual posthoc baseline soluble VEGFR-3 concentration from an upstream PD fit)
- Description: Subject-specific empirical-Bayes (posthoc) baseline plasma sVEGFR-3 (soluble vascular endothelial growth factor receptor 3) concentration from a separately published indirect-response biomarker model that the current downstream PD model treats as a fixed input. Used as a per-subject (time-fixed) covariate in fatigue / adverse-event PD models that consume the upstream biomarker dynamics as data covariates without instantiating the biomarker ODE.
-
Units: pg/mL (document per-model via
covariateData[[BAS_SVEGFR3]]$units). - Type: continuous
- Scope: specific
-
Reference category: n/a – appears directly inside
model()as the initial conditionsvegfr3(0) <- BAS_SVEGFR3and inside the relative-change driverbm = (svegfr3 - BAS_SVEGFR3) / BAS_SVEGFR3. -
Source aliases:
-
BAS3– used inHansson_2013b_sunitinib.R(DDMODEL00000198) andHansson_2013c_sunitinib.R(DDMODEL00000222) as the posthoc sVEGFR-3 baseline column from the paper’s upstream Hansson 2013a biomarker indirect-response fit (DDMODEL00000197).
-
-
Example models:
Hansson_2013b_sunitinib.R(DDMODEL00000198; tumor growth inhibition with sVEGFR-3 driven shrinkage),Hansson_2013c_sunitinib.R(DDMODEL00000222; bundle’s three-subject simulated dataset reports BAS_SVEGFR3 values 42554-57365 pg/mL). -
Notes: Specific scope because the value is
intrinsically tied to a specific upstream biomarker model (sVEGFR-3
indirect response under sunitinib in this case). The downstream fatigue
model only consumes individual posthoc baseline / MRT / EC50 of the
upstream biomarker; it does not re-fit them. Each model’s
covariateData[[BAS_SVEGFR3]]$notesshould cite the upstream biomarker-PD source (paper or DDMORE ID) and explain how to populate the column for new simulations (typically: simulate from the upstream biomarker model to obtain individual posthoc baselines, or set every subject to the typical-value baseline for typical-trajectory simulations).
MRT_SVEGFR3 (canonical for individual posthoc mean residence time of soluble VEGFR-3 from an upstream PD fit)
-
Description: Subject-specific empirical-Bayes
(posthoc) mean residence time of plasma sVEGFR-3 from a separately
published indirect-response biomarker model that the current downstream
PD model treats as a fixed input. Used as a per-subject (time-fixed)
covariate in fatigue / adverse-event PD models that consume the upstream
biomarker dynamics as data covariates without instantiating the
biomarker ODE; appears as
kout3 = 1 / MRT_SVEGFR3insidemodel(). -
Units: h (hours) – document per-model via
covariateData[[MRT_SVEGFR3]]$units. - Type: continuous
- Scope: specific
- Reference category: n/a – appears directly in derived rate-constant expressions; not a covariate effect coefficient.
-
Source aliases:
-
MRT3– used inHansson_2013b_sunitinib.R(DDMODEL00000198) andHansson_2013c_sunitinib.R(DDMODEL00000222) as the posthoc sVEGFR-3 MRT column from the paper’s upstream Hansson 2013a biomarker indirect-response fit (DDMODEL00000197).
-
-
Example models:
Hansson_2013b_sunitinib.R(DDMODEL00000198),Hansson_2013c_sunitinib.R(DDMODEL00000222; bundle’s three-subject simulated dataset reports MRT_SVEGFR3 values 313-408 h, broadly consistent with the Hansson 2013a typical sVEGFR-3 MRT of 401 h). -
Notes: Specific scope; same upstream-biomarker
dependency rationale as
BAS_SVEGFR3. The downstream fatigue model consumes the upstream MRT directly without re-fitting it.
EC50_SVEGFR3 (canonical for individual posthoc drug-effect EC50 on soluble VEGFR-3 from an upstream PD fit)
- Description: Subject-specific empirical-Bayes (posthoc) half-maximum-effect concentration of the modelled drug on the production of sVEGFR-3 (or, depending on the upstream model parameterization, on the analogous Imax inhibition pathway) from a separately published indirect-response biomarker model that the current downstream PD model treats as a fixed input. Used as a per-subject (time-fixed) covariate in fatigue / adverse-event PD models that consume the upstream biomarker dynamics as data covariates without instantiating the biomarker ODE.
-
Units: mgh/L (when the per-cycle drug-exposure
summary is
auc = DOSE / CLIin mgh/L, EC50_SVEGFR3 carries the same units; document per-model viacovariateData[[EC50_SVEGFR3]]$units). - Type: continuous
- Scope: specific
-
Reference category: n/a – appears directly inside
model()in the simple-Imax drug-effect termeff3 = auc / (EC50_SVEGFR3 + auc). -
Source aliases:
-
EC53– used inHansson_2013b_sunitinib.R(DDMODEL00000198) andHansson_2013c_sunitinib.R(DDMODEL00000222) as the posthoc sVEGFR-3 EC50 column from the paper’s upstream Hansson 2013a biomarker indirect-response fit (DDMODEL00000197).
-
-
Example models:
Hansson_2013b_sunitinib.R(DDMODEL00000198),Hansson_2013c_sunitinib.R(DDMODEL00000222; bundle’s three-subject simulated dataset reports EC50_SVEGFR3 values 1.0-2.8 mgh/L, consistent with the Hansson 2013a typical sVEGFR-3 IC50 typical value of 1.0 mgh/L). -
Notes: Specific scope; same upstream-biomarker
dependency rationale as
BAS_SVEGFR3. The downstream fatigue model consumes the upstream EC50 directly without re-fitting it.
BAS_SKIT (canonical for individual posthoc baseline soluble KIT concentration from an upstream PD fit)
- Description: Subject-specific empirical-Bayes (posthoc) baseline plasma sKIT (soluble stem cell factor receptor) concentration from a separately published indirect-response biomarker model that the current downstream PD model treats as a fixed input. Used as a per-subject (time-fixed) covariate in tumor-growth-inhibition / fatigue / adverse-event PD models that consume the upstream sKIT dynamics as data covariates without instantiating the biomarker ODE for sKIT in isolation.
-
Units: pg/mL (document per-model via
covariateData[[BAS_SKIT]]$units). - Type: continuous
- Scope: specific
-
Reference category: n/a – appears directly inside
model()as the initial condition for treated and placebo sKIT compartments and inside the relative-change driver(skit_pla - skit_drug) / skit_pla(or the analogous BAS_SKIT-denominated form when only one sKIT compartment is simulated). -
Source aliases:
-
SBAS– used inHansson_2013b_sunitinib.R(DDMODEL00000198) as the posthoc sKIT baseline column from the paper’s upstream Hansson 2013a biomarker indirect-response fit (DDMODEL00000197).
-
-
Example models:
Hansson_2013b_sunitinib.R(DDMODEL00000198; the Hansson 2013 e84 paper Table 2 reports a typical sKIT baseline of 39200 pg/mL with ~50% CV, matchingHansson_2013a_sunitinib’s typical value). -
Notes: Specific scope because the value is
intrinsically tied to a specific upstream biomarker model (sKIT indirect
response under sunitinib in this case). The downstream
tumor-growth-inhibition model only consumes individual posthoc baseline
/ MRT / EC50 / DP-slope of the upstream biomarker; it does not re-fit
them. Each model’s
covariateData[[BAS_SKIT]]$notesshould cite the upstream biomarker-PD source (paper or DDMORE ID) and explain how to populate the column for new simulations (typically: simulate from the upstream biomarker model to obtain individual posthoc baselines, or set every subject to the typical-value baseline for typical-trajectory simulations). Sister covariates:MRT_SKIT,EC50_SKIT,SLOPE_SKIT(companions for the same upstream biomarker fit).
MRT_SKIT (canonical for individual posthoc mean residence time of soluble KIT from an upstream PD fit)
-
Description: Subject-specific empirical-Bayes
(posthoc) mean residence time of plasma sKIT from a separately published
indirect-response biomarker model that the current downstream PD model
treats as a fixed input. Used as a per-subject (time-fixed) covariate in
tumor-growth-inhibition / fatigue / adverse-event PD models that consume
the upstream sKIT dynamics as data covariates without instantiating the
biomarker ODE for sKIT in isolation; appears as
kout_skit = 1 / MRT_SKITinsidemodel(). -
Units: h (hours) – document per-model via
covariateData[[MRT_SKIT]]$units. - Type: continuous
- Scope: specific
- Reference category: n/a – appears directly in derived rate-constant expressions; not a covariate effect coefficient.
-
Source aliases:
-
SMRT– used inHansson_2013b_sunitinib.R(DDMODEL00000198) as the posthoc sKIT MRT column from the paper’s upstream Hansson 2013a biomarker indirect-response fit (DDMODEL00000197).
-
-
Example models:
Hansson_2013b_sunitinib.R(DDMODEL00000198; the Hansson 2013 e84 paper Table 2 reports a typical sKIT MRT of 101 days = 2424 h, matchingHansson_2013a_sunitinib’s typical value of 2430 h). -
Notes: Specific scope; same upstream-biomarker
dependency rationale as
BAS_SKIT. The downstream tumor-growth-inhibition model consumes the upstream MRT directly without re-fitting it.
EC50_SKIT (canonical for individual posthoc drug-effect EC50 on soluble KIT from an upstream PD fit)
- Description: Subject-specific empirical-Bayes (posthoc) half-maximum-effect concentration of the modelled drug on the production of sKIT (or, depending on the upstream model parameterization, on the analogous Imax inhibition pathway) from a separately published indirect-response biomarker model that the current downstream PD model treats as a fixed input. Used as a per-subject (time-fixed) covariate in tumor-growth-inhibition / fatigue / adverse-event PD models that consume the upstream sKIT dynamics as data covariates without instantiating the biomarker ODE for sKIT in isolation.
-
Units: mgh/L (when the per-cycle drug-exposure
summary is
auc = DOSE / CLIin mgh/L, EC50_SKIT carries the same units; document per-model viacovariateData[[EC50_SKIT]]$units). - Type: continuous
- Scope: specific
-
Reference category: n/a – appears directly inside
model()in the simple-Imax drug-effect termeff_skit = auc / (EC50_SKIT + auc). -
Source aliases:
-
SEC5– used inHansson_2013b_sunitinib.R(DDMODEL00000198) as the posthoc sKIT EC50 column from the paper’s upstream Hansson 2013a biomarker indirect-response fit (DDMODEL00000197).
-
-
Example models:
Hansson_2013b_sunitinib.R(DDMODEL00000198; the Hansson 2013 e84 paper Table 2 reports a typical (common across all four biomarkers) IC50 of 1.0 mg*h/L, matchingHansson_2013a_sunitinib’s shared typical value). -
Notes: Specific scope; same upstream-biomarker
dependency rationale as
BAS_SKIT. The downstream tumor-growth-inhibition model consumes the upstream EC50 directly without re-fitting it.
SLOPE_SKIT (canonical for individual posthoc linear disease-progression slope on soluble KIT from an upstream PD fit)
-
Description: Subject-specific empirical-Bayes
(posthoc) linear disease-progression slope on the placebo / untreated
sKIT compartment from a separately published indirect-response biomarker
model that the current downstream PD model treats as a fixed input. Used
as a per-subject (time-fixed) covariate in tumor-growth-inhibition
models that need to simulate the placebo-arm sKIT trajectory as a
comparator for the drug-arm sKIT compartment; appears as
dps_skit = BAS_SKIT * (1 + SLOPE_SKIT * t)andkin_skit = dps_skit * kout_skitinsidemodel(). -
Units: 1/h (document per-model via
covariateData[[SLOPE_SKIT]]$units). - Type: continuous
- Scope: specific
- Reference category: n/a – appears directly in the placebo-arm Kin expression for sKIT; not a covariate effect coefficient on a structural rate.
-
Source aliases:
-
SLO– used inHansson_2013b_sunitinib.R(DDMODEL00000198) as the posthoc sKIT linear disease-progression slope column from the paper’s upstream Hansson 2013a biomarker indirect-response fit (DDMODEL00000197).
-
-
Example models:
Hansson_2013b_sunitinib.R(DDMODEL00000198; the Hansson 2013 e84 paper Table 2 reports a typical disease-progression slope of 0.0261/month shared between VEGF and sKIT, which equals approximately 3.5e-5/h, matchingHansson_2013a_sunitinib’s typical value). -
Notes: Specific scope; same upstream-biomarker
dependency rationale as
BAS_SKIT. Sign convention follows the upstream biomarker fit – positive slope means the placebo / natural-history sKIT trajectory drifts upward over time (capturing disease progression).
CSS_RBV (canonical for individual posthoc ribavirin steady-state plasma concentration from an upstream PK fit)
-
Description: Subject-specific empirical-Bayes
(posthoc) modal estimate of the ribavirin (RBV) steady-state plasma
trough concentration from a separately fitted population PK model (e.g.,
the Laouenan 2015 upstream RBV popPK fit) that the current downstream PD
model treats as a fixed input. Used together with
K_RBVinsidemodel()to reconstruct the individual ribavirin concentration time-course analytically asriba(t) = CSS_RBV * (1 - exp(-K_RBV * t)). -
Units: ng/mL (document per-model via
covariateData[[CSS_RBV]]$units). - Type: continuous
- Scope: specific
-
Reference category: n/a – appears directly inside
model()in the analytical RBV-concentration expressionriba = CSS_RBV * (1 - exp(-K_RBV * t))that drives the inhibition termriba / (riba + ec50). -
Source aliases:
-
css_mode(modal posterior estimate of individual ribavirin Css from the upstream popPK fit, in ng/mL) – used inLaouenan_2015_ribavirin.R(DDMODEL00000285). Renamecss_mode->CSS_RBVbefore passing the dataset torxSolve.
-
-
Example models:
Laouenan_2015_ribavirin.R(DDMODEL00000285; the bundle’sSimulated_Laouenant_2015_CPTPSP_hb_RBV.txtcarriescss_modevalues 2,400-4,000 ng/mL in the 15-subject ANRS-CO20-CUPIC cohort). -
Notes: Specific scope because the value is
intrinsically tied to a specific upstream popPK fit (ribavirin in
HCV-cirrhotic patients on triple therapy in this case) and to the
analytical
Css*(1-exp(-k*t))parameterization; another drug or another popPK parameterization would carry its own canonical. Each model’scovariateData[[CSS_RBV]]$notesshould cite the upstream popPK source (paper or DDMORE ID) and explain how to populate the column for new simulations (typically: simulate the upstream popPK first to obtain individual Css and approach-rate, or set every subject to the typical Css for typical-trajectory simulations). Companion column:K_RBV. Distinct fromCAV(dosing-interval-averaged concentration used in Emax / EC50 PD models with a single per-period exposure number) –CSS_RBVcarries the asymptotic steady-state value paired with the approach-rate constant, supporting the full time-course reconstruction.
K_RBV (canonical for individual posthoc ribavirin approach-to-steady-state rate constant from an upstream PK fit)
-
Description: Subject-specific empirical-Bayes
(posthoc) modal estimate of the first-order rate constant governing the
exponential approach of ribavirin plasma trough concentrations to steady
state, from the same upstream popPK fit that supplies
CSS_RBV. Used together withCSS_RBVinsidemodel()to reconstruct the individual ribavirin concentration time-course analytically asriba(t) = CSS_RBV * (1 - exp(-K_RBV * t)). -
Units: 1/day (document per-model via
covariateData[[K_RBV]]$units). - Type: continuous
- Scope: specific
-
Reference category: n/a – appears directly inside
the analytical RBV-concentration expression alongside
CSS_RBV. -
Source aliases:
-
k_mode(modal posterior estimate of individual ribavirin approach-to-Css rate constant from the upstream popPK fit, in 1/day) – used inLaouenan_2015_ribavirin.R(DDMODEL00000285). Renamek_mode->K_RBVbefore passing the dataset torxSolve.
-
-
Example models:
Laouenan_2015_ribavirin.R(DDMODEL00000285; the bundle’sSimulated_Laouenant_2015_CPTPSP_hb_RBV.txtcarriesk_modevalues 0.013-0.47 day^-1 across the 15-subject cohort, corresponding to approach-to-Css half-lives of 1.5-55 days). -
Notes: Specific scope; same upstream-PK-dependency
rationale as
CSS_RBV. Distinct from any structuralkelPK parameter –K_RBVis an apparent approach-to-Css rate from a lumped exponential parameterization of the trough time-course, not the elimination-rate constant of a one-compartment IV model (the lumped form absorbs absorption, distribution, and elimination into a single first-order rate). Companion column:CSS_RBV.
CP_MGL (canonical for instantaneous drug plasma concentration as a time-varying PD driver)
- Description: Instantaneous (per-event-record) plasma concentration of the modeled drug, supplied directly as a time-varying covariate column rather than computed from a coupled PK model. Used in PD-only myelosuppression / toxicity / response models that consume an upstream PK trajectory as input – typically because the source analysis was done as a sequential PK-then-PD fit, with the PK model fixed from a previously published popPK analysis (e.g., a Bruno-style docetaxel popPK feeding a Friberg / Kloft myelosuppression PD model).
-
Units: mg/L (= ug/mL; the two labels are
numerically equivalent and the canonical reporting convention for
cytotoxic-chemotherapy concentrations in Kloft 2006 / Friberg 2002
family analyses). Document per-model via
covariateData[[CP_MGL]]$unitsif a different unit is used. - Type: continuous
- Scope: specific
-
Reference category: n/a – typically enters as a
linear term in the drug-effect expression (e.g.,
drug = SL * CP_MGL, whereSLhas units of1/(mg/L)). Set to 0 for placebo periods or any time outside the drug-exposure window. Reference values observed: docetaxel typical Cmax ~3 mg/L after a 100 mg/m^2 1-hour IV infusion (Kloft 2006 / Netterberg 2017 simulated dataset). -
Source aliases:
-
CP(Kloft 2006 / Netterberg 2017 NM-TRAN $INPUT convention for “predicted drug concentration”; values in mg/L) – used inNetterberg_2017_docetaxel.R.
-
-
Example models:
Netterberg_2017_docetaxel.R(linear drug effect on the proliferation rate of the Friberg myelosuppression chain:(1 - SL * CP_MGL)withSL = 19.27 (mg/L)^-1after Kloft 2006’sTHETA(3)/808*1000MW-808 conversion; CP_MGL supplied per event row from an upstream docetaxel popPK simulation). -
Notes: Specific scope because the covariate’s
mechanistic meaning is bound to the modeled drug and the source paper’s
chosen PK input (e.g., a Bruno 1996/1998 docetaxel popPK trajectory for
Netterberg 2017). Distinct from
CAV(dosing-interval-averaged exposure used in Emax / EC50 PD models) –CP_MGLis the instantaneous concentration, sampled at every PD event time. When a future paper requires the same time-varying-PK-as-PD-input pattern for a different drug, register a drug-specific canonical (e.g.,CP_PACL_MGLfor paclitaxel) rather than overloading this name;CP_MGLretains the implicit “drug = the modeled drug under the PD analysis” semantics. When a paper supplies the time-varying PK as separate per-subject empirical-Bayes PK parameters (e.g.,CL_INDIV,VC_INDIV,VP_INDIV), use those columns in a coupled PK-PD ODE model (seeFriberg_2002_paclitaxel.R) rather than reducing toCP_MGL. The choice between PK-as-covariate (this canonical) and PK-as-EBE-parameters depends on whether the source paper’s NM-TRAN dataset shipped Cp directly or shipped the upstream individual PK parameters.
CP_OXY_NGML (canonical for instantaneous oxypurinol plasma concentration as a time-varying PD driver)
- Description: Instantaneous plasma concentration of oxypurinol (the active metabolite of allopurinol; xanthine oxidase inhibitor) supplied directly as a time-varying covariate column rather than computed from a coupled PK model. Used in semi-mechanistic uric-acid disposition models that take XOI exposure as input to the production-inhibition equation.
- Units: ng/mL
- Type: continuous
- Scope: specific
-
Reference category: n/a – enters as a Hill term in
the production-inhibition expression
1 - Rmax * CP_OXY_NGML / (CP_OXY_NGML + p50). Set to 0 outside the drug-exposure window or for non-XOI scenarios. Reference values observed: mean daily concentration on 300 mg/day allopurinol is approximately 10,000 ng/mL (Aksenov 2018, Eq. 13). -
Source aliases:
-
[P]_PIN(oxypurinol) – the symbol used in Aksenov 2018 Eq. 9 for the production-inhibitor concentration when the inhibitor is oxypurinol.
-
-
Example models:
Aksenov_2018_uricAcid.R(Hill-type production inhibition withrmax_oxy = 0.84andp50_oxy = 14000 ng/mLper Aksenov 2018 Table 1). -
Notes: Specific scope because the canonical name is
bound to oxypurinol; allopurinol’s PK is conventionally summarized via
the active metabolite oxypurinol (Day et al. 2007). Distinct from
CP_FBX_NGML(febuxostat) andCP_LSN_NGML(lesinurad). When the source paper supplies an upstream popPK for oxypurinol (e.g., Wright et al. 2013, Anzai & Endou 2012), the user simulates that PK to populate this column; otherwise a steady-state value can be used. Ratified canonically on 2026-05-08 alongside the Aksenov 2018 extraction.
CP_FBX_NGML (canonical for instantaneous febuxostat plasma concentration as a time-varying PD driver)
- Description: Instantaneous plasma concentration of febuxostat (xanthine oxidase inhibitor) supplied directly as a time-varying covariate column rather than computed from a coupled PK model. Used in semi-mechanistic uric-acid disposition models that take XOI exposure as input to the production-inhibition equation.
- Units: ng/mL
- Type: continuous
- Scope: specific
-
Reference category: n/a – enters as a Hill term in
the production-inhibition expression
1 - Rmax * CP_FBX_NGML / (CP_FBX_NGML + p50). Set to 0 outside the drug-exposure window or for non-XOI scenarios. Reference values observed: mean daily concentration on 40 mg/day febuxostat is approximately 1000-2000 ng/mL (Aksenov 2018, Bhattaram & Gobburu 2017 regulatory review). -
Source aliases:
-
[P]_PIN(febuxostat) – the symbol used in Aksenov 2018 Eq. 9 for the production-inhibitor concentration when the inhibitor is febuxostat.
-
-
Example models:
Aksenov_2018_uricAcid.R(Hill-type production inhibition withrmax_fbx = 1(fixed) andp50_fbx = 120 ng/mLfor hyperuricemic subjects (or 87 ng/mL for normouricemic subjects) per Aksenov 2018 Table 1). -
Notes: Specific scope; febuxostat-specific. The
p50parameter differs between hyperuricemic and normouricemic populations in Aksenov 2018;Rmaxis fixed at 1 per Bhattaram & Gobburu 2017. Distinct fromCP_OXY_NGML(oxypurinol) andCP_LSN_NGML(lesinurad). Ratified canonically on 2026-05-08 alongside the Aksenov 2018 extraction.
CP_LSN_NGML (canonical for instantaneous lesinurad plasma concentration as a time-varying PD driver)
- Description: Instantaneous plasma concentration of lesinurad (uricosuric URAT1 inhibitor) supplied directly as a time-varying covariate column rather than computed from a coupled PK model. Used in semi-mechanistic uric-acid disposition models that take uricosuric exposure as input to the fractional-excretion-increase equation.
- Units: ng/mL
- Type: continuous
- Scope: specific
-
Reference category: n/a – enters as a Hill term in
the fractional-excretion expression
FE = FE0 + Fmax * CP_LSN_NGML / (CP_LSN_NGML + p50). Set to 0 outside the drug-exposure window. Reference values observed: peak plasma concentration after single dose 200 mg lesinurad is approximately 6,000-9,000 ng/mL (Fleischmann et al. 2014; Shen et al. 2015). -
Source aliases:
-
[P]_RIN(lesinurad) – the symbol used in Aksenov 2018 Eq. 10 for the reabsorption-inhibitor concentration when the inhibitor is lesinurad.
-
-
Example models:
Aksenov_2018_uricAcid.R(Hill-type increase in fractional excretion withfmax_lsn = 0.56(fixed) andp50_lsn = 23000 ng/mLfor hyperuricemic subjects (or 11000 ng/mL for normouricemic subjects) per Aksenov 2018 Table 1). -
Notes: Specific scope; lesinurad-specific. The
p50parameter differs between hyperuricemic and normouricemic populations in Aksenov 2018;Fmaxwas fixed during estimation. Distinct fromCP_OXY_NGML(oxypurinol) andCP_FBX_NGML(febuxostat). Ratified canonically on 2026-05-08 alongside the Aksenov 2018 extraction.
CP_MORPH_NGML (canonical for instantaneous morphine plasma concentration as a time-varying PD driver)
- Description: Instantaneous plasma concentration of morphine supplied directly as a time-varying covariate column rather than computed from a coupled PK model. Used in PD-only IRT / latent-pain models that take morphine exposure as an external input to a concentration-effect equation.
- Units: ng/mL
- Type: continuous
- Scope: specific
-
Reference category: n/a – enters linearly into the
latent-pain equation
pain = pain_state - e_morph_pain * CP_MORPH_NGML + e_time_pain * time(Valitalo 2017). Reference values observed: most individual predicted concentrations in Valitalo 2017 were within 0-60 ng/mL (Figure 2a); the IRT linear morphine-effect slope is 0.0091 (ng/mL)^-1, so a 20 ng/mL morphine exposure reduces the latent pain by ~0.18 latent-variable units. -
Source aliases:
-
CP(Valitalo 2017 NM-TRAN $INPUT convention for “morphine plasma concentration”; values in ng/mL) – used inValitalo_2017_morphine.R.
-
-
Example models:
Valitalo_2017_morphine.R(linear morphine concentration-effect on the IRT latent pain variable; CP_MORPH_NGML supplied per event row from an upstream morphine popPK simulation, typicallyKnibbe_2009_morphine.R). -
Notes: Specific scope; morphine-specific. The
drug-specific naming follows the existing
CP_OXY_NGML/CP_FBX_NGML/CP_LSN_NGMLprecedent established with Aksenov 2018. Distinct from the broaderCP_MGL(mg/L PD-driver convention used in Netterberg 2017 docetaxel myelosuppression and similar) because the IRT PD models in this family use ng/mL natively. When a future morphine PD analysis uses mg/L, the conversion isCP_MORPH_NGML = CP_MORPH_MGL * 1000. Ratified canonically alongside the Valitalo 2017 morphine extraction (DDMODEL00000247).
STIM_QUININE_MM (canonical for applied quinine HCl dihydrate stimulus concentration in a brief-access taste aversion experiment)
- Description: Applied sipper-tube concentration of quinine HCl dihydrate (mM) directly presented to the rat during an 8-second brief-access taste aversion (BATA) trial. This is the experimental stimulus level that contacts the taste receptors in solution – not a systemic plasma concentration of an absorbed drug. Drives the sigmoid-Emax effect on the logistic mixing probability in Sheng 2016’s two-generalized-Poisson mixture model for bimodal lick-count data.
- Units: mM
- Type: continuous
- Scope: specific
-
Reference category: n/a – enters as a Hill term in
the logistic-Emax expression
E = E0 + Emax * STIM_QUININE_MM^c / (RIC50^c + STIM_QUININE_MM^c)(Sheng 2016, Methods, “Model for the drug effect”). Set to 0 for the water (control) presentation. Reference values observed: Sheng 2016 presented seven concentrations 0 / 0.01 / 0.03 / 0.1 / 0.3 / 1 / 3 mM (Table 1); the final-model RIC50 is 0.0423 mM. -
Source aliases:
-
QUININE(Sheng 2016 NM-TRAN $INPUT column name for the per-record applied stimulus concentration; values in mM).
-
-
Example models:
Sheng_2016_quinine_rat.R(sigmoid Emax on the logistic mixing probability between a low-count generalized-Poisson distribution and a right-truncated high-count generalized-Poisson distribution; STIM_QUININE_MM is the only model covariate). -
Notes: Specific scope because the canonical name is
bound to quinine HCl dihydrate as the bitter stimulus. Distinct from
CAV(systemic average drug plasma concentration over a dosing interval) and theCP_*family (instantaneous plasma concentration as time-varying PD driver) – STIM_QUININE_MM is the applied stimulus concentration in solution that contacts the taste receptors directly, with no PK absorption / distribution involved. Future BATA-style extractions with a different bitter compound (caffeine, denatonium, etc.) should register a parallel canonical (e.g.STIM_CAFFEINE_MM) rather than overload this name; theSTIM_<drug>_<units>pattern mirrors the establishedCP_<drug>_<units>precedent. Ratified canonically alongside the Sheng 2016 quinine BATA extraction.
FPG (canonical for baseline fasting plasma glucose)
-
Description: Fasting plasma glucose concentration
at baseline (or time-varying baseline-style observation; document
per-model). Distinct from
GLU(time-varying plasma glucose regressor input for mechanistic glucose-kinetics models). -
Units: mmol/L (or mg/dL – 1 mmol/L glucose is
approximately 18.02 mg/dL). Document per-model via
covariateData[[FPG]]$units. - Type: continuous
- Scope: general
-
Reference category: n/a – used with
linear-deviation form
1 + theta * (FPG - ref). Reference values observed: 8.90 mmol/L (Retlich 2015 popPK/PD linagliptin median fasting glucose at baseline). - Source aliases: none known.
-
Example models:
Retlich_2015_linagliptin.R(mmol/L, reference 8.90; linear-deviation effect on baseline DPP-4 activity BSL with coefficient 1.46 % per mmol/L deviation). -
Notes: Glycemic-control covariate (baseline FPG);
routinely reported alongside HbA1c in T2DM populations. Distinct from
GLUwhich is a time-varying within-subject glucose regressor for mechanistic glucose-kinetics models (Bizzotto 2016). Ratified canonically alongside the Retlich 2015 linagliptin extraction.
DPP4_BL_RFU (canonical for baseline plasma dipeptidyl peptidase-4 activity in relative fluorescence units)
- Description: Baseline plasma dipeptidyl peptidase-4 (DPP-4) enzymatic activity, measured by relative fluorescence units (RFU). DPP-4 is the pharmacological target of the gliptin (DPP-4-inhibitor) drug class; baseline activity correlates with circulating DPP-4 protein concentration in the central compartment and serves as a covariate on the central-compartment binding-site concentration in TMDD models for gliptins.
-
Units: RFU (assay-specific; document the assay in
covariateData[[DPP4_BL_RFU]]$notessince RFU values are not directly comparable across assays). - Type: continuous
- Scope: specific
-
Reference category: n/a – used with
linear-deviation form
1 + theta * (DPP4_BL_RFU - ref). Reference values observed: 12,497 RFU (Retlich 2015 popPK linagliptin median baseline), 11,600 RFU (Retlich 2015 popPK/PD linagliptin median baseline, applied to the individual-predicted BSL_i parameter on EC50). - Source aliases: none known.
-
Example models:
Retlich_2015_linagliptin.R(RFU, reference 12,497; linear-deviation effect on the central-compartment binding-site concentration Bmax,C with coefficient 0.00332 % per RFU deviation – captures the inter-individual correlation between baseline DPP-4 protein concentration and the apparent saturable-binding amplitude). -
Notes: Specific scope because DPP-4-activity values
in RFU are assay-specific and not directly transferable between studies
/ instruments. Future gliptin extractions reporting DPP-4 activity in
the same assay can reuse this canonical; extractions in absolute
enzymatic-rate units (pmol AMC per minute) or normalised units should
register a sibling canonical (e.g.,
DPP4_BL_PMOL_MIN). Ratified canonically alongside the Retlich 2015 linagliptin extraction.
GLU (canonical for plasma glucose time-course regressor)
-
Description: Plasma glucose concentration as a
time-varying regressor input that drives a mechanistic
glucose-kinetics model. Not a covariate that modifies a parameter; the
model integrates
GLUdirectly through a smoothing filter into a site-of-action glucose variable. - Units: mmol/L
- Type: continuous
- Scope: specific
-
Reference category: n/a – used as a time-varying
regressor. The model declares
linear(GLU)so rxode2 linearly interpolatesGLUbetween dataset rows. -
Source aliases:
-
iglu(glucose at the current row time) – used in the DDMORE bundle’sSimulated_glucoseKinetics.csvforDDMODEL00000227. Renameiglu->GLUbefore passing torxSolve.
-
-
Example models:
Bizzotto_2016_glucose.R(driving regressor for the glucose-at-site-of-action delay). -
Notes: Specific scope because
GLUis meaningful only for glucose-kinetics or glucose-PD models that take plasma glucose as an exogenous regressor. The DDMORE bundle’s hand-rolled piecewise-linear interpolation (GL = (t-T1)/(TOBS-T1)*(GLU-GLU1)+GLU1with bracketing columnsiglu / glun / td / tn) is replaced in nlmixr2 bylinear(GLU)declared inmodel(); the bracketing columns are not required.
INS (canonical for plasma insulin time-course regressor)
-
Description: Plasma insulin concentration as a
time-varying regressor input that drives a mechanistic
glucose-kinetics model. Not a covariate that modifies a parameter; the
model integrates
INSdirectly through a smoothing filter into a site-of-action insulin variable. - Units: pmol/L
- Type: continuous
- Scope: specific
-
Reference category: n/a – used as a time-varying
regressor. The model declares
linear(INS)so rxode2 linearly interpolatesINSbetween dataset rows. -
Source aliases:
-
iins(insulin at the current row time) – used in the DDMORE bundle’sSimulated_glucoseKinetics.csvforDDMODEL00000227. Renameiins->INSbefore passing torxSolve. -
INSU– used in the DDMORE bundle’sSimulated_ddmoremockdata2.txtforDDMODEL00000228. RenameINSU->INSbefore passing torxSolve.
-
-
Example models:
Bizzotto_2016_glucose.R(driving regressor for the insulin-at-site-of-action delay),NA_NA_paracetamol.R(DDMODEL00000228 OGTT model: drives the insulin-on-glucose-elimination first-order effect compartment viakie * (INS / 6.945 - effect_ins)). -
Notes: Specific scope because
INSis meaningful only for glucose-kinetics or insulin-PD models that take plasma insulin as an exogenous regressor. For drugs that modify circulating insulin as a downstream effect, use a different mechanism-specific name. The DDMORE bundle’s hand-rolled piecewise-linear interpolation (I = (t-T1)/(TOBS-T1)*(INS-INS1)+INS1with bracketing columnsiins / insn / td / tn) is replaced in nlmixr2 bylinear(INS)declared inmodel(); the bracketing columns are not required.
INS_BL (canonical for baseline (fasting) plasma insulin concentration)
- Description: Baseline (fasting) plasma insulin concentration, time-fixed per subject. Used as the per-subject anchor for steady-state insulin-driven processes (e.g., initial condition of an insulin-on-elimination effect compartment) and as the per-subject baseline-state insulin input to a baseline-glucose-production rate calculation.
-
Units: pmol/L (or uU/mL; document per-model via
covariateData[[INS_BL]]$units). The example model rescales viaINS_BL / 6.945to convert pmol/L to uU/mL. - Type: continuous
- Scope: specific
- Reference category: n/a.
-
Source aliases:
-
BASI(baseline insulin) – used in the DDMORE bundle’sSimulated_ddmoremockdata2.txtforDDMODEL00000228. RenameBASI->INS_BLbefore passing torxSolve.
-
-
Example models:
NA_NA_paracetamol.R(DDMODEL00000228 OGTT model: initialises the insulin-on-elimination effect compartmenteffect_ins(0) = INS_BL / 6.945and feeds the steady-state baseline-glucose-production rategpro = gss * (kg + kgi * INS_BL / 6.945) * vg * 180 / 1000). -
Notes: Distinct from
INS(time-varying regressor);INS_BLis a per-subject baseline-state anchor used in initial conditions and steady-state derived quantities, not the dynamic regressor itself. Specific scope because the conversion factor (1/6.945) and the rescaled-units interpretation are paper-specific; future extractions that report baseline insulin in mIU/L or pmol/L directly without rescaling can ratify the same canonical and document the per-model units / conversion incovariateData[[INS_BL]]$units/notes. Companion concept toFPG(baseline fasting plasma glucose).
IGE (canonical for serum total immunoglobulin E concentration)
- Description: Baseline serum total immunoglobulin E concentration (free IgE plus, in patients on anti-IgE therapy, omalizumab-IgE complex). For anti-IgE monoclonal antibodies (omalizumab, ligelizumab) IgE is the pharmacologic target; baseline IgE sets the magnitude of the target sink and modifies free-IgE clearance and the rate of IgE production in mechanism-based binding/turnover models.
-
Units: ng/mL (typical clinical-PK convention).
Pretreatment values reported in
IU/mLare converted via1 IU/mL = 2.42 ng/mL(Hayashi 2007 Methods). Document per-model viacovariateData[[IGE]]$units. - Type: continuous
- Scope: general
-
Reference category: n/a – used with power scaling
(IGE / ref)^exponent. Reference value observed: 482.4 ng/mL (Hayashi 2007 Japanese atopic-asthma cohort). -
Source aliases:
-
IgE0(baseline IgE concentration) – used inHayashi_2007_omalizumab.R.
-
-
Example models:
Hayashi_2007_omalizumab.R(ng/mL, reference 482.4; power exponents -0.281 on apparent CL of free IgE and +0.657 on apparent IgE production rate; also used as the initial value for the total-IgE state at t = 0). -
Notes: General scope because baseline serum total
IgE is a routine clinical-laboratory measurement, not a target tied to
one drug. In mechanism-based anti-IgE binding/turnover models the
in-model IgE state is a separate dynamic variable (
X_TE, in nmol or nmol/L) –IGEis the per-subject baseline column used for covariate scaling and (when applicable) state initialization, not the dynamic state itself. For models that use the alternative reporting unitIU/mL, multiply by 2.42 before applying the canonical-units (ng/mL) reference value, or document the per-model unit choice incovariateData[[IGE]]$unitsso downstream tooling can interpret the values correctly.
Count / Markov-feedback PD covariates
These columns are specific to count / Markov / time-to-event PD models that fit per-record event counts (e.g., daily or monthly seizure counts) with optional dependence on the previous-period count. Register names retain their source-paper conventions where those names are unambiguous and readable.
TRT_PHASE (canonical for double-blind active-treatment-phase indicator)
- Description: 1 = the record falls within the active double-blind treatment phase (drug + placebo effects are switched on); 0 = baseline / run-in / off-treatment (drug + placebo effects are zeroed).
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (baseline / off-treatment).
-
Source aliases:
-
Q2– used in the Schoemaker 2018 LEV / BRV pediatric extrapolation (DDMODEL00000239) as the treatment-phase gating multiplier on the combined placebo + drug-effect log-rate term (LE = LS0 + Q2 * LTRTE).
-
-
Example models:
Schoemaker_2018_levetiracetam.R(DDMODEL00000239). -
Notes: New canonical because the source name
Q2collides with the canonical PK parameterq2(inter-compartmental clearance to peripheral2) –q2could not be used as a covariate column without confusing source-trace lookups. Useful for any phase-gated PD model where placebo and drug effects are constrained to a specific study period; analogous to a “treatment-on” indicator.
PDV (canonical for previous-period observed seizure count)
- Description: Previous-period observed seizure count, supplied per-record as a covariate input to capture Markov dependence on the previous count. For daily-count datasets PDV is the observation on the immediately preceding day; for monthly (or other aggregate) records the previous-interval count would apply if used.
- Units: (count, integer)
- Type: count
- Scope: specific
-
Reference category: n/a – typically used as
<param> * PDV / (ES50 + PDV)(Markov-Hill) or analogous form. -
Source aliases:
PDV– used in the Schoemaker 2018 LEV/BRV pediatric extrapolation (DDMODEL00000239) and in the Ahn 2010 pregabalin Markov seizure-count model that the Schoemaker 2018 publication cites as the precursor structure. -
Example models:
Schoemaker_2018_levetiracetam.R(DDMODEL00000239). -
Notes: Specific scope because the column’s
interpretation (a Markov feedback state) is intrinsic to
count-likelihood Markov-feedback PD models. nlmixr2 / rxode2 cannot
natively express a Markov dependence of an observation on the
immediately preceding observation as a model state – supplying PDV as a
per-record data column is the operator-approved (sidecar response-001
Q2) way to preserve the published structure in this batch. For records
where the Markov term should not contribute (e.g., monthly counts in a
mixed daily / monthly cohort) the bundle convention is to set PDV = -99
as a sentinel; in
Schoemaker_2018_levetiracetam.Rthe model gates the Markov contribution onCHILD = 1so the sentinel is multiplied by 0 and is harmless.
NDAYS (canonical for number of days in the count-record interval)
-
Description: Number of days in the interval over
which the observed seizure count was tabulated. Multiplies the per-day
rate to give the expected count for the record (e.g.,
LAMB = exp(LE) * NDAYS). - Units: days
- Type: count
- Scope: general
- Reference category: n/a – appears as a multiplier on a per-day rate.
-
Source aliases:
NDAYS– used in the Schoemaker 2018 LEV/BRV pediatric extrapolation (DDMODEL00000239). -
Example models:
Schoemaker_2018_levetiracetam.R(DDMODEL00000239). - Notes: General scope because the count-interval-length concept is shared across any count or rate-based PD model that mixes record granularity (e.g., daily and monthly counts in the same dataset). For pure-daily or pure-monthly cohorts the column is constant; including it as a covariate keeps the model usable in mixed-granularity simulations.
MOMENT (canonical for endotracheal suctioning procedural state)
- Description: Procedural state at the time of the pain / distress assessment in invasive-ventilation studies, used to select between separate pre-procedure / intra-procedure / post-procedure baseline parameters. Coded 1 = before suctioning, 2 = during suctioning, 3 = after suctioning.
- Units: (categorical, 3 levels)
- Type: categorical
- Scope: specific
- Reference category: 1 (before suctioning).
-
Source aliases:
MOMENT– used in the Valitalo 2017 IRT morphine PD model (DDMODEL00000247). -
Example models:
Valitalo_2017_morphine.R(DDMODEL00000247; selects between the three baseline pain typical valuespresuct/suct/aftsuctand the matching 3x3 correlated etas). - Notes: Specific scope because the column’s meaning is tied to a particular study procedure (endotracheal suctioning during mechanical ventilation in neonates). The Simons 2003 cohort that Valitalo 2017 re-analysed scheduled pain assessments around suctioning events, so MOMENT changes within-subject at each scheduled assessment. Ratified canonically alongside the Valitalo 2017 morphine extraction.
ITEM (canonical for pain-assessment item identifier in IRT graded-response models)
- Description: Identifier of the specific pain-assessment item being scored at each observation row, used to dispatch between the IRT discrimination / difficulty parameter sets in a graded-response model. Valitalo 2017 coding: 1 = COMFORT-B alertness; 2 = COMFORT-B calmness/agitation; 3 = COMFORT-B respiratory response; 5 = COMFORT-B body movement; 7 = COMFORT-B facial tension; 12 = VAS (cm, range 0-10); 25 = PIPP brow bulge; 26 = PIPP eye squeeze; 27 = PIPP nasolabial furrow; 28 = NIPS total.
- Units: (categorical, 9-10 levels depending on cohort)
- Type: categorical
- Scope: specific
- Reference category: n/a – selects per-item parameter sets rather than acting as a reference contrast.
-
Source aliases:
ITEM– used in the Valitalo 2017 IRT morphine PD model (DDMODEL00000247). -
Example models:
Valitalo_2017_morphine.R(DDMODEL00000247; switches the IRT graded-response discrimination / difficulty parameters per row). -
Notes: Specific scope because the integer-to-item
mapping is tied to the Valitalo 2017 NM-TRAN dataset’s coding. Other IRT
graded-response models in the library (when they are added) may use
different integer codings and should register their own canonical (e.g.,
ITEM_<study>) when the codings collide. The COMFORT-B “muscle tension” item is omitted from the Valitalo 2017 coding because it could not be assessed from video recordings.
OBSTYPE (canonical for VAS observer type)
- Description: Observer type for visual-analogue-scale pain assessments: 1 = investigator (video-based), 2 = bedside nurse. Used to select between observer-specific VAS difficulty / discrimination THETAs and between observer-specific residual-error SDs.
- Units: (categorical, 2 levels)
- Type: categorical
- Scope: specific
- Reference category: 1 (video investigator).
-
Source aliases:
OBSTYPE– used in the Valitalo 2017 IRT morphine PD model (DDMODEL00000247). -
Example models:
Valitalo_2017_morphine.R(DDMODEL00000247; selects betweendiff_vas_videovsdiff_vas_bedside,discr_vas_videovsdiscr_vas_bedside, andaddSd_vas_videovsaddSd_vas_bedside). - Notes: Specific scope because the binary observer-coding is tied to the Valitalo 2017 NM-TRAN dataset. Future IRT models with a different observer split (e.g., parent / nurse / physician) should register their own canonical rather than overloading this 2-level coding.
Race / ethnicity
Canonical pattern: RACE_<GROUP>.
Use one indicator per race/ethnicity group the source models. Reference
category is the implicit 0 = all other groups; document explicitly which
groups are in the reference. When the source uses composite groups
(e.g., “Black or Other”), name them accordingly
(RACE_BLACK_OTHER) and list the components in
notes. The base RACE_<GROUP> indicators
are scope: general; composite groupings are scope: specific because the
grouping is tied to the study’s analysis plan.
RACE_BLACK (canonical for Black / African American race indicator)
- Description: 1 = Black / African American, 0 = other.
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (document the actual reference groups used).
-
Source aliases:
-
BLACK– used inHu_2026_clesrovimab.R,Robbie_2012_palivizumab.R.
-
-
Example models:
Zhu_2017_lebrikizumab.R(canonical form),Robbie_2012_palivizumab.R.
RACE_WHITE (canonical for White race indicator)
- Description: 1 = White, 0 = non-White.
- Units: (binary)
- Type: binary
- Scope: general
-
Reference category: 0 (non-White; complement
composition depends on the source paper, typically pooling Black/African
American, Asian, American Indian/Alaska Native, Native Hawaiian/Pacific
Islander, Other, Not reported, Unknown). Some papers (e.g., Hu 2014)
instead use the Caucasian (RACE_WHITE = 1) subgroup as the typical-value
reference; the column encoding is unchanged but the model implements the
effect on
(1 - RACE_WHITE). -
Source aliases:
-
RACE(with values1 = White / 0 = non-White) – used inLin_2024_casirivimab.R. Source column nameRACEis generic; the canonical name is intentionally explicit because some other models useRACEfor a different dichotomy. -
RACE(Caucasian-vs-non-Caucasian dichotomy as named in Hu 2014 Table 2) – used inHu_2014_bapineuzumab.R. Same canonical column name and 1 = White / 0 = non-White encoding; the typical-value reference is the Caucasian subgroup, so the model implements the 15% non-Caucasian effect on(1 - RACE_WHITE).
-
-
Example models:
Lin_2024_casirivimab.R(multiplicative fractional change on CL relative to non-White reference),Hu_2014_bapineuzumab.R(multiplicative 15% increase in CL for non-Caucasian relative to Caucasian reference). -
Notes: Used by papers that dichotomize race as
White vs. non-White rather than decomposing into separate group
indicators. Sign and reference-category interpretation are inverted
relative to
RACE_BLACK/RACE_ASIAN/ etc.; do NOT combineRACE_WHITEwith the decomposed indicators in the same model. The model’s typical-value reference category (which subgroup gets the unmodifiedlcl/lvc) varies between papers – Lin 2024 uses non-White as the reference, Hu 2014 uses Caucasian (White) as the reference; both share the same canonical column encoding.
RACE_BLACK_OTH (canonical for composite Black/Other group)
- Description: 1 = Black/African American or Other race, 0 = other groups.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 = White or Native Hawaiian/Pacific Islander (Clegg 2024 grouping).
-
Source aliases:
BLACK_OTH– used inClegg_2024_nirsevimab.R. -
Example models:
Clegg_2024_nirsevimab.R. -
Notes: Kept distinct from
RACE_BLACKbecause the composite is not interchangeable.
RACE_ASIAN (canonical for Asian race indicator)
- Description: 1 = Asian, 0 = other.
- Units: (binary)
- Type: binary
- Scope: general
-
Source aliases:
ASIAN– used inHu_2026_clesrovimab.R,Robbie_2012_palivizumab.R,Fau_2020_isatuximab.R.RAAS(race-Asian-vs-other indicator as named in Bajaj 2017 Table 1) – used inBajaj_2017_nivolumab.R.RACEN(race-numeric indicator as named in Lu 2019 / Shi 2020 NONMEM control stream; ASIAN = 1 if RACEN == 1) – used inLu_2019_polatuzumab.R. -
Example models:
Zhu_2017_lebrikizumab.R(canonical form),Robbie_2012_palivizumab.R,Bajaj_2017_nivolumab.R,Fau_2020_isatuximab.R,Lu_2019_polatuzumab.R(multiplicative factore_asian_vc = 0.929on acMMAE Vc, i.e., 7.1% lower V1 in Asian patients; verbatim Shi 2020 (PMID 32770353) ethnicity-sensitivity re-quote of the Lu 2019 popPK Asian-race covariate).
RACE_ASIAN_AMIND_OTH (canonical for composite Asian / American Indian / Other group)
-
Description: 1 = Asian, American Indian / Alaska
Native, or Other race; 0 = White or Black. Composite indicator that
pools the smaller-N race groups in a population dominated by White and
Black subjects, with White + Black serving as the reference category.
Distinct from
RACE_ASIAN_AMIND_MULTI(Clegg 2024 grouping that includes Multiracial; pooled against a different reference), fromRACE_ASIAN_OTH(within-Asian-population sub-indicator), and fromRACE_BLACK_OTH(different composite). - Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 = White or Black (the larger-N pooled group used as the reference in the source paper).
- Source aliases: none formally; Frey 2013’s NONMEM control stream uses the inline race classification rather than a separate named column.
-
Example models:
Frey_2013_tocilizumab.R(multiplicative fractional effect on the DAS28 first-order loss rate Kout:Kout * (1 - 0.25 * RACE_ASIAN_AMIND_OTH)– Kout is 25% lower in the Asian/AmInd/Other composite group relative to the White+Black reference). -
Notes: Specific scope because the composite
grouping is defined by the source paper’s analysis plan rather than by a
uniform external standard. Do not combine with the decomposed
RACE_ASIAN,RACE_OTHER, etc. indicators in the same model; the composite indicator is mutually exclusive with the decomposition. Ratified canonically on 2026-04-29 in support of the Frey 2013 tocilizumab DAS28 PKPD model. Frey 2013 uses TWO distinct race covariates: thisRACE_ASIAN_AMIND_OTHindicator on Kout (DAS28-PD-side; pools Asian + AmInd + Other vs White+Black) AND the within-AsianRACE_ASIAN_OTHindicator on CL (PK-side; isolates the “Other Asian” subgroup within the Asian-only cohort).
RACE_ASIAN_AMIND_MULTI (canonical for composite group)
- Description: 1 = Asian, American Indian / Alaskan Native, or Multiple races, 0 = other.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 = White, Black / African American, Native Hawaiian / Pacific Islander, or Other (Clegg 2024 grouping).
-
Source aliases:
ASIAN_AMIND_MULTI– used inClegg_2024_nirsevimab.R. -
Example models:
Clegg_2024_nirsevimab.R. - Notes: Clegg 2024 applies this covariate to both CL and V2 with different coefficients.
RACE_ASIAN_OTH (canonical for Asian-other composite race indicator)
- Description: 1 = subject self-identifies as Asian-other (Asian heritage outside the locally-dominant Asian subgroup, e.g. non-Chinese in a Chinese-dominant cohort, or “Other Asian” as a study-defined catch-all category). 0 = otherwise. Reference category is the cohort’s dominant race grouping (typically Chinese or White, depending on the study).
- Units: (binary)
- Type: binary
- Scope: specific
-
Reference category: 0. Document the dominant
subgroup explicitly in
covariateData[[RACE_ASIAN_OTH]]$notesfor every model that uses this covariate. - Source aliases: none.
-
Example models:
Brown_2017_osimertinib.R(paper’s “Asian (not Japanese or Chinese)” composite indicator with linear effect on apparent clearance of the AZ5104 metabolite; reference category Caucasian). -
Notes: Distinct from
RACE_ASIAN_AMIND_MULTI(a 4-way composite of Asian + American Indian + Multiple Races),RACE_ASIAN_AMIND_OTH(a 3-way Asian + AmInd + Other composite against a White+Black reference, used in Frey 2013), andRACE_BLACK_OTH(different composite).RACE_ASIAN_OTHis a within-Asian-population sub-indicator, not a multi-race composite. Operator decision (2026-04-28): kept separate fromRACE_ASIANbecause the paper’s “Other Asian” category is its own grouping, not an alias of “Asian (any)”. Brown 2017 uses Caucasian (not Chinese) as the dominant reference.
RACE_NEAS (canonical for North East Asian composite race indicator)
-
Description: 1 = North East Asian heritage
(worldwide Chinese, Japanese, or Korean), 0 = non-North East Asian.
Composite indicator analogous to
RACE_ASIANbut specifically restricted to the East Asian subgroup most-relevant to ICH E5 ethnic-sensitivity / Asian-region bridging analyses. - Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (any non-North East Asian race, including South / Southeast Asian, White, Black, etc.).
-
Source aliases:
-
RAC4– used inZhou_2021_belimumab.R(Zhou 2021 Table 2 footnote d).
-
-
Example models:
Zhou_2021_belimumab.R(multiplicative factor 1.07 on V1). -
Notes: Distinct from the broader
RACE_ASIAN(which can include South / Southeast Asian populations) because Zhou 2021 specifically tested whether Chinese/Japanese/Korean patients had different PK from the rest of the dataset; the analysis explicitly comparedRAC4(North East Asian) against alternative race definitions and choseRAC4by AIC.
RACE_MULTI (canonical for multiracial indicator)
- Description: 1 = multiracial, 0 = other.
- Units: (binary)
- Type: binary
- Scope: general
-
Source aliases:
MULTIRACIAL– used inHu_2026_clesrovimab.R. -
Example models:
Hu_2026_clesrovimab.R.
RACE_OTHER (canonical for race-category ‘Other’ indicator)
- Description: 1 = race category “Other,” 0 = not.
- Units: (binary)
- Type: binary
- Scope: general
-
Source aliases:
-
OTHER– used inRobbie_2012_palivizumab.R.
-
-
Example models:
Zhu_2017_lebrikizumab.R,Robbie_2012_palivizumab.R.
RACE_HISPANIC (canonical for Hispanic / Latino ethnicity indicator)
- Description: 1 = Hispanic / Latino, 0 = non-Hispanic. Used by papers that report Hispanic as a separate category alongside Black, Asian, and Other rather than as a distinct ethnicity dimension.
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (non-Hispanic; document the paper-specific reference race composition per-model).
-
Source aliases:
-
HISPANIC– used inRobbie_2012_palivizumab.R.
-
-
Example models:
Robbie_2012_palivizumab.R(fractional effect on CL; additional effect on Vc). -
Notes: In the US Office-of-Management-and-Budget
(OMB) classification Hispanic is an ethnicity rather than a race, but
clinical PK analyses frequently treat it as one of the race indicators.
When a paper treats Hispanic as a race, use this column; otherwise
encode ethnicity separately. Register-wise, this follows the
RACE_<GROUP>indicator-decomposition pattern.
RACE_JAPANESE (canonical for Japanese-heritage race indicator)
- Description: 1 = Japanese heritage, 0 = non-Japanese. Used when Japanese subjects form a distinct subgroup in the study design (e.g., ICH E5 bridging analyses or studies with a dedicated Japanese healthy-volunteer cohort).
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (non-Japanese).
-
Source aliases:
-
JAPANESE_HV– used inWang_2017_benralizumab.R(Japanese healthy-volunteer cohort indicator; the healthy-volunteer vs. asthma-patient distinction is captured separately, not in this covariate).
-
-
Example models:
Wade_2015_certolizumab.R(multiplicative fractional effect on V/F; Wade 2015 breaks Japanese [RACE.EQ.8] out separately from RACE_ASIAN),Wang_2017_benralizumab.R(multiplicative factor 1.34 on Vc). -
Notes: Distinct from
RACE_NEAS(North East Asian composite, includes Chinese, Japanese, and Korean) and fromRACE_ASIAN. UseRACE_JAPANESEonly when the source paper breaks out Japanese heritage as its own indicator; do not aggregate with other Asian groups when the paper keeps them separate. Ratified canonically on 2026-04-26.
RACE_CHINESE (canonical for Chinese-heritage race indicator)
- Description: 1 = Chinese heritage, 0 = non-Chinese. Used when Chinese subjects form a distinct subgroup alongside Japanese, Asian-other, and other race categories (e.g., multiregional oncology trials enrolling Chinese, Japanese, and other Asian cohorts as separate strata).
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (non-Chinese; document the paper-specific reference race composition per-model).
-
Source aliases: none formally; source NONMEM
control streams typically use ad-hoc names (e.g.,
CHINESE,RACE.EQ.X). -
Example models:
Brown_2017_osimertinib.R(linear additive effect(1 + 0.17 * RACE_CHINESE)on apparent clearance of the AZ5104 metabolite; reference category Caucasian). -
Notes: Distinct from
RACE_NEAS(North East Asian composite, includes Chinese, Japanese, and Korean) and fromRACE_ASIAN. UseRACE_CHINESEonly when the source paper breaks out Chinese heritage as its own indicator alongsideRACE_JAPANESEandRACE_ASIAN_OTH; do not aggregate with other Asian groups when the paper keeps them separate. Parallels the establishedRACE_JAPANESEentry. Ratified canonically on 2026-05-09.
Geographic / enrollment-country indicators
Geographical study-site region indicators. Distinct from race /
ethnicity (RACE_*), which describe subject ancestry; these
describe the geographical location of the clinical trial site that
enrolled the subject. Used in multi-regional studies (typically those
including bridging analyses for Japan or East Asia) to capture
region-specific clinical-practice or unmeasured-environment effects on
PK that remain after accounting for body weight, race, and laboratory
covariates. Encoded as a set of mutually exclusive binary indicators
with US as the implicit reference category (all indicators = 0). When a
paper groups some non-US regions with US (e.g., Hong 2025 groups US and
Japan as the DXd CL reference), the model code uses only the indicators
that distinguish the non-reference groups; the data column for the
grouped region (e.g., REGION_JAPAN) is still recorded so
the same dataset can serve other parameters that do separate that
group.
REGION_JAPAN (canonical for Japan study-site / enrollment-country indicator)
- Description: 1 = study site in Japan (or patient enrolled in Japan, depending on source paper’s reporting), 0 = study site / enrollment country outside Japan. Geographical Japan indicator.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (non-Japan sites / enrollment; specific reference set varies per model – e.g., Hong 2025 Dato-DXd CL uses “any non-Japan region”, whereas Hong 2025 DXd CL groups Japan with US into the reference).
-
Source aliases:
-
COUNTRY_JPN– retired canonical; used inYin_2021_trastuzumabDeruxtecan.Ras an enrollment-country (not study-site-region) indicator. Some papers report country-of-enrollment rather than site region; both map toREGION_JAPANwhen the binary contrast is Japan vs. non-Japan.
-
-
Example models:
Hong_2025_datopotamab.R(multiplicative effect 1 + (-0.219) = 0.781 on Dato-DXd linear clearance),Yin_2021_trastuzumabDeruxtecan.R(multiplicative effect 0.903 on CL_intact and 0.738 on V2_intact when REGION_JAPAN = 1; Yin 2021 retained Japan enrollment-country over Japanese race because the two were highly confounded, correlation -0.81). -
Notes: Distinct from
RACE_JAPANESE(subject ancestry). A subject of Japanese ancestry enrolled at a US site hasRACE_JAPANESE = 1butREGION_JAPAN = 0. Some papers (e.g., Yin 2021) report enrollment country rather than study-site region; both are encoded asREGION_JAPANwhen the binary contrast is Japan vs. non-Japan. Paired withREGION_EUROPEandREGION_ROWin multi-regional studies (e.g., Hong 2025);REGION_JAPAN = 0for a US-only cohort.
REGION_EUROPE (canonical for Europe study-site indicator)
- Description: 1 = study site in Europe, 0 = study site outside Europe. Geographical study-site region.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (non-Europe study sites; specific reference set varies per model).
- Source aliases: none yet; canonical name preferred.
-
Example models:
Hong_2025_datopotamab.R(multiplicative effect 1 + 0.240 = 1.240 on DXd clearance versus US/Japan reference),Naik_2016_vortioxetine.R(additive intercept-shift form: TVCL_EU = 39 L/hr is the typical CL/F whenREGION_EUROPE = 1, versus USA reference TVCL = 51 L/hr). -
Notes: Pair with
REGION_JAPANandREGION_ROWto encode multi-regional study membership; subjects with all three indicators = 0 are in the “US” reference group.
REGION_ROW (canonical for Rest-of-World study-site indicator)
- Description: 1 = study site in Rest of World (i.e., not US, Japan, or Europe), 0 = study site in US / Japan / Europe. Residual-region indicator for multi-regional studies.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (US / Japan / Europe study sites; specific reference set varies per model).
-
Source aliases:
-
REGION_RoW– mixed-case variant in some source publications (e.g., Naik 2016).
-
-
Example models:
Hong_2025_datopotamab.R(multiplicative effect 1 + 0.196 = 1.196 on DXd clearance versus US/Japan reference),Naik_2016_vortioxetine.R(additive intercept-shift form: TVCL_RoW = 38 L/hr is the typical CL/F whenREGION_ROW = 1, versus USA reference TVCL = 51 L/hr; the RoW group for Naik 2016 spans study sites in Canada, Australia, and Asia). -
Notes: “Rest of the World” composition is
paper-specific (e.g., Hong 2025 = study sites outside US, Japan, and
Europe; Naik 2016 = Canada, Australia, and Asia). Document the subject
set in
covariateData[[REGION_ROW]]$notesper model.
REGION_MOZAMBIQUE (canonical for Mozambique study-site / enrollment-country indicator)
- Description: 1 = study site in Mozambique, 0 = otherwise. Country-level study-site indicator used in multi-country sub-Saharan African trials of intermittent preventive treatment of malaria in pregnancy (IPTp).
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (non-Mozambique sites; specific reference set varies per model – in de Kock 2017 the reference is Mali).
- Source aliases: none yet; canonical name preferred.
-
Example models:
deKock_2017_sulfadoxinePyrimethamine.R(multiplicative -20.2% effect on apparent pyrimethamine clearance; +57.6% scaling on observed pyrimethamine concentrations; +21.2% scaling on observed sulfadoxine concentrations; the on-observation scalings capture residual site-specific differences in apparent bioavailability or dried-blood-spot sample handling). -
Notes: Specific scope because the canonical models
a single Mozambique vs non-Mozambique contrast within a specific
multi-site trial; the reference set is paper-specific (e.g., Mali in de
Kock 2017). Pairs with
REGION_SUDANandREGION_ZAMBIAfor a 4-country sub-Saharan African IPTp trial design with Mali as the implicit reference. Ratified canonically on 2026-05-18.
REGION_SUDAN (canonical for Sudan study-site / enrollment-country indicator)
- Description: 1 = study site in Sudan, 0 = otherwise. Country-level study-site indicator used in multi-country sub-Saharan African trials.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (non-Sudan sites; specific reference set varies per model – in de Kock 2017 the reference is Mali).
- Source aliases: none yet; canonical name preferred.
-
Example models:
deKock_2017_sulfadoxinePyrimethamine.R(+15.5% scaling on observed sulfadoxine concentrations; +33.2% scaling on observed pyrimethamine concentrations). -
Notes: Specific scope; pairs with
REGION_MOZAMBIQUEandREGION_ZAMBIA. Ratified canonically on 2026-05-18.
REGION_ZAMBIA (canonical for Zambia study-site / enrollment-country indicator)
- Description: 1 = study site in Zambia, 0 = otherwise. Country-level study-site indicator used in multi-country sub-Saharan African trials.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (non-Zambia sites; specific reference set varies per model – in de Kock 2017 the reference is Mali).
- Source aliases: none yet; canonical name preferred.
-
Example models:
deKock_2017_sulfadoxinePyrimethamine.R(-24.8% scaling on observed sulfadoxine concentrations; -5.4% scaling on observed pyrimethamine concentrations). -
Notes: Specific scope; pairs with
REGION_MOZAMBIQUEandREGION_SUDAN. Ratified canonically on 2026-05-18.
Pediatric comorbidities
CLD_PREM (canonical for chronic lung disease of prematurity)
- Description: 1 = chronic lung disease of prematurity (bronchopulmonary dysplasia, BPD), 0 = no CLD. Time-fixed per subject (diagnosis at study entry).
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (no CLD of prematurity).
-
Source aliases:
-
CLD– used inRobbie_2012_palivizumab.R. -
BPD– bronchopulmonary-dysplasia shorthand.
-
-
Example models:
Robbie_2012_palivizumab.R(fractional +20% effect on CL). - Notes: Standard pediatric / neonatology comorbidity flag; ties to palivizumab’s label population (high-risk preterm infants) and may re-appear in future pediatric mAb PK analyses (RSV, parenteral nutrition, etc.).
Comorbidities
DIAB (canonical for diabetes-mellitus comorbidity indicator)
- Description: 1 = patient has diabetes mellitus comorbidity (Type 1 or Type 2 not distinguished), 0 = no diabetes comorbidity. Time-fixed at study entry per subject.
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (no diabetes comorbidity).
-
Source aliases:
-
DIAB– used inChen_2022_guselkumab.R. -
Diabetes– used inSherer_2012_AAA.R(Sherer 2012 Methods page 2 symbol “Diabetes”).
-
-
Example models:
Chen_2022_guselkumab.R(multiplicative effect on CL/F: 1.15^DIAB, +15% in patients with diabetes).-
Chen_2022_guselkumab.R(multiplicative effect on CL/F: 1.15^DIAB, +15% in patients with diabetes). -
Sherer_2012_AAA.R(additive shift on the first derivative of AAA growth rate with size beta2:e_diab_b2 = -0.32/yearfor diabetics; cohort prevalence 14%).
-
-
Notes: Captures pre-existing diabetes mellitus as a
comorbidity in non-diabetes-primary indications (e.g., psoriatic
arthritis, psoriasis, vascular disease). Distinct from a primary
disease-state indicator like
DIS_UC. Type 1 vs Type 2 mellitus is not separated unless the source paper distinguishes them; in pooled-population PK analyses, the covariate is typically a single binary flag derived from medical history. Diabetic patients tend to have higher inflammation and altered IgG turnover, which can manifest as modest changes in monoclonal-antibody clearance. In vascular populations (Sherer 2012) diabetes is associated with slower AAA growth, possibly via aberrant monocyte-matrix interactions (Golledge 2008 mechanism cited in Sherer 2012 Discussion).
T2DM (canonical for type-2-diabetes-mellitus-specific indicator)
- Description: 1 = patient has Type-2 diabetes mellitus specifically; 0 = normal-glucose-tolerance control (or other reference cohort pooled in the source analysis). Time-fixed at study entry per subject.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (normal-glucose-tolerance control).
-
Source aliases:
-
T2DM– used inNA_NA_paracetamol.R(DDMODEL00000228). -
T2D– used inGuiastrennec_2016_gastric_emptying.R(matched-cohort flag, 1 = T2D patient vs 0 = matched nondiabetic control).
-
-
Example models:
NA_NA_paracetamol.R(DDMODEL00000228).-
NA_NA_paracetamol.R(DDMODEL00000228). -
Guiastrennec_2016_gastric_emptying.R(multiplicative -81.1% depression of POTcarbC, the carbohydrate potency on CCK release; all other parameters are common across cohorts).
-
-
Notes: Distinct from the existing
DIABcanonical (which deliberately does not distinguish Type 1 vs Type 2). Specific scope because the reference cohort is study-specific and the mechanism in the example model is a Type-2-versus-healthy stratification of OGTT response; a future T2DM-specific study (e.g., a popPK/PD analysis stratifying by HbA1c level) can ratify the same canonical and document the reference cohort incovariateData[[T2DM]]$notes.
HYPERT (canonical for hypertension comorbidity / medical-history indicator)
- Description: 1 = patient has a history of (or current) hypertension as a comorbidity; 0 = no hypertension. Time-fixed at study entry per subject (medical-history flag rather than time-varying blood-pressure measurement).
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (no hypertension comorbidity).
-
Source aliases:
-
MHHY(medical history of hypertension) – used inGirard_2012_pimasertib.R.
-
-
Example models:
Girard_2012_pimasertib.R(additive shift on the cumulative-logit AE-score model:theta_mhhy * HYPERT; +0.539 logit units in patients with prior hypertension). -
Notes: Companion to
DIAB(diabetes-mellitus comorbidity); both are baseline binary medical-history flags collected from clinical-history forms. Captures any prior or current hypertension diagnosis, regardless of treatment status; if a future model needs to separate treated vs untreated hypertension, register a refinement (HYPERT_TREATED).
Surgical history / disease state
POD (canonical for post-operative day)
- Description: Days elapsed since the qualifying surgical event (e.g., solid-organ transplantation, major resection). Time-varying within subject; integer- or fractional-day valued; rises monotonically from 0 at the day of surgery. Captures time-since-surgery effects on PK that are not explained by other covariates – e.g., post-transplant clearance of immunosuppressants typically declines toward a steady value over the first weeks-to-months as graft function, fluid status, hematocrit, and corticosteroid taper stabilise.
- Units: days
- Type: continuous
- Scope: general
-
Reference category: n/a – typically used in
centred-deviation form
(1 + e_pod_param * (POD - ref_pod)), sometimes with an upper cap (e.g., values > 180 fixed to 180 days when the residual time-varying effect plateaus). -
Source aliases:
-
POD– used inBergmann_2014_tacrolimus.R(capped at 180 days; centred at 22.7 days, the dataset mean).
-
-
Example models:
Bergmann_2014_tacrolimus.R(linear deviation from POD = 22.7 days on tacrolimus CL/F; coefficient -0.0021 per day implies a 0.21% per-day decrease in apparent oral clearance with a 180-day plateau). -
Notes: Time-varying within subject; the per-row
value is the integer day count from the date of surgery to the
observation date. For solid-organ-transplant cohorts,
PODis the conventional NONMEM$INPUTcolumn name. When the source paper reports a different name (DPTfor “days post-transplant”,TX_DAY,T_POSTOP), record the alias here. Distinct fromTIME(rxode2 time clock) and fromOCC(integer-valued occasion / period indicator for IOV). When a paper usesPODjointly with an IOV occasion column, both can coexist in the dataset:PODenters the typical-value covariate equation,OCCmultiplexes the IOV etas. The 180-day cap in Bergmann 2014 is data-driven (most observations are within the first 90 days post-transplant, so the linear effect is identifiable only over that window) – document any per-model cap incovariateData[[POD]]$notes. Ratified canonically on 2026-05-08 alongside the Bergmann 2014 extraction.
TTD (canonical for time to death)
- Description: Days remaining from the observation time to the patient’s recorded time of death (TTD >= 0; falls to 0 on the day of death). Time-varying per subject. Available only retrospectively in observational palliative-care / end-of-life cohorts; not usable as a prospective predictor.
- Units: days
- Type: continuous
- Scope: specific
-
Reference category: n/a – used in a first-order
exponential decay form on a structural PK parameter. The Franken 2015
morphine model parameterises
CL(TTD) = CL_pop - theta_D * exp(-theta_rate * TTD), so the decay term vanishes as TTD -> infinity (asymptotic CL far from death) and reaches its peak droptheta_Das TTD -> 0 (day of death). -
Source aliases:
-
TTD– used inFranken_2015_morphine.R(Franken 2015 NONMEM column for time-to-death in days; paper Eq. 3 and Table 2).
-
-
Example models:
Franken_2015_morphine.R(Franken 2015 Clin Pharmacokinet; first-order exponential decay term on morphine clearance with theta_D = 17.6 L/h and theta_rate = 0.13 /day; CL drops from 47.5 to 29.9 L/h as TTD goes from infinity to 0). -
Notes: Specific scope because the covariate’s
semantics depend on a palliative-care / observational-cohort study
design where time of death is known by retrospective abstraction. The
Franken 2015 model interprets the TTD-dependent CL change as a composite
of physiological end-of-life processes (e.g., reduced hepatic blood
flow, cachexia) that are not captured by standard blood-chemistry
covariates. For prospective simulation of a virtual cohort without a
known time of death, set TTD to a large value (e.g., > 50 days) to
recover the asymptotic CL_pop. Distinct from the canonical
POD(post-operative day, monotonically increasing from a surgery date) – TTD counts down to death rather than up from a procedure. Ratified canonically on 2026-05-16 alongside the Franken 2015 morphine extraction.
POSTTX_DAY1 (canonical for first-24-hours-post-transplant indicator)
- Description: Binary indicator for the first 24 hours post-transplant. 1 = the observation falls within the first 24 hours (day 1) post-transplant; 0 = otherwise. Time-varying per subject.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (any observation outside the first 24 hours post-transplant).
-
Source aliases:
-
first day post-transplant– used inStorset_2014_tacrolimus.R.
-
-
Example models:
Storset_2014_tacrolimus.R(multiplicative ~2.68-fold increase in oral bioavailability on day 1:fdepot *= 2.68^POSTTX_DAY1; Storset 2014 Table 2 final theory-based model retains the day-1 factor with subject-level IIV of 57% CV on the day-1 multiplier). -
Notes: Distinct from the continuous
POD(post-operative day) canonical above –POSTTX_DAY1is a derived binary indicator (operationallyPOSTTX_DAY1 = as.integer(POD < 1)when both columns are present in the dataset). The two coexist in the same model when the source paper uses POD-based continuous effects on some parameters and a separate binary day-1 effect on others (Storset 2014 retains the binary day-1 factor explicitly because the day-1 oral bioavailability is ~2.68-fold higher than the rest of the post-transplant period and is not well captured by a continuous POD effect). Storset 2014 Discussion attributes the day-1 oral-bioavailability spike to candidate mechanisms including methylprednisolone-bolus inhibition of intestinal CYP3A / P-glycoprotein, surgery-related inflammation, anaesthesia / opioid effects on gut motility, and reduced food intake – but no single mechanism was identifiable in the data. The 2.68-fold factor was retained because it produced a 209-point OFV decrease and was crucial for predicting concentrations measured on the first post-transplant day. In Storset 2014 the day-1 effect carries its own subject-level eta (BSV 57% CV on the day-1 factor); only subjects with day-1 observations contribute to that eta. Ratified canonically on 2026-05-08.
TX_LIVER (canonical for liver (hepatic) transplant indicator)
- Description: Binary indicator for liver / hepatic-graft transplant recipients in a pooled solid-organ-transplant cohort. 1 = liver transplant; 0 = non-liver solid-organ transplant (kidney, heart, or lung). Time-fixed per subject (assigned at transplantation date).
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (non-liver solid-organ graft – typically kidney as the most common non-liver comparator in pooled solid-organ-transplant analyses).
-
Source aliases:
-
Hepatic trans_CL– used inNanga_2019_tacrolimus_metaanalysis.R(Table 3 covariate-effect label; encoded as a multiplicative power coefficient theta^TX_LIVER on CL/F).
-
-
Example models:
Nanga_2019_tacrolimus_metaanalysis.R(multiplicative effect on apparent oral clearance:cl_typ *= 0.38^TX_LIVER, so liver-graft recipients have CL/F 62% lower than non-liver recipients at the same body weight and post-transplant day; Nanga 2019 Table 3 ‘Hepatic trans_CL’ = 0.38). -
Notes: Specific scope because the reference
complement (non-liver solid-organ graft) is paper-defined – in Nanga
2019 the complement is exclusively kidney-transplant patients (Table 5:
201 liver, 80 kidney). Future pooled solid-organ-transplant popPK models
that distinguish liver from non-liver patients should reuse this
canonical; if a future model decomposes the non-liver class into
separate kidney / heart / lung indicators, register sibling canonicals
(
TX_KIDNEY,TX_HEART,TX_LUNG) rather than overloading this entry. Distinct fromPOSTTX_DAY1(first-24-hour-post-transplant indicator, time-varying) and fromPOD(continuous post-transplant day); all three can coexist when a model parameterises both organ type and time-after-transplantation effects. Ratified canonically on 2026-05-18 alongside the Nanga 2019 tacrolimus meta-analysis extraction.
PRIOR_GAST (canonical for prior gastrectomy)
- Description: Prior (partial or total) gastrectomy indicator, 1 = prior gastrectomy, 0 = no prior gastrectomy. Time-fixed per subject.
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (no prior gastrectomy).
-
Source aliases:
-
GAST– used inYamada_2025_zolbetuximab.R.
-
-
Example models:
Yamada_2025_zolbetuximab.R(fractional effects on CLss, CLT, V1). -
Notes: Renamed from
GASTon 2026-04-20 to follow thePRIOR_TNF/PRICORTnaming pattern for prior-treatment and surgical-history indicators. Applicable to any PK model where gastrointestinal anatomy affects absorption, first-pass, or protein turnover; not inherently oncology-specific. No distinction between partial vs total gastrectomy unless the source paper separates them.
Disease state (cross-population indicators)
DIS_UC (canonical for ulcerative colitis disease-state indicator)
- Description: 1 = ulcerative colitis patient, 0 = non-UC (e.g., healthy volunteer or non-IBD indication). Time-fixed per subject.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (non-UC subject; the complement group is defined per-model – typically healthy volunteers and/or patients with another indication such as asthma).
-
Source aliases:
-
UC– used inHua_2015_anrukinzumab.R.
-
-
Example models:
Hua_2015_anrukinzumab.R(multiplicative fractional increase in CL, +72.8%, on top of weight and albumin effects). -
Notes: Used when a population PK model pools UC
patients with a non-UC reference population (e.g., Hua 2015: healthy
volunteers + asthma patients + UC patients) and UC disease status is
tested as a PK covariate. Distinct from
DISEXT_EP/DISEXT_OTHER, which operate within a UC-only cohort (disease extension). Start as scope: specific; promote to general if a second paper pools UC with a non-UC reference.
DIS_SASTHMA (canonical for moderate-to-severe asthma disease-state indicator)
- Description: 1 = moderate-to-severe asthma patient, 0 = not (e.g., healthy volunteer, mild-to-moderate asthma, or other indication). Time-fixed per subject.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (non-moderate-to-severe-asthma subject; the complement group is defined per-model).
-
Source aliases:
-
sAsthma– used inHua_2015_anrukinzumab.R.
-
-
Example models:
Hua_2015_anrukinzumab.R(multiplicative fractional change in SC bioavailability, -30.9%). - Notes: The moderate-to-severe vs. mild-to-moderate asthma cutoff is protocol-defined; Hua 2015 uses FEV1 55-80% and ACQ-5 >= 2 for “moderate to severe” (study 4) versus FEV1 > 70% and ACQ-5 <= 1 for “mild to moderate” (study 1). Scope: specific because the severity threshold is tied to a particular analysis plan; future asthma-severity indicators with different thresholds should register as separate canonicals.
DIS_PJIA (canonical for polyarticular juvenile idiopathic arthritis disease-state indicator)
- Description: 1 = polyarticular juvenile idiopathic arthritis (pJIA) patient, 0 = non-pJIA (e.g., adult rheumatoid arthritis or other indication). Time-fixed per subject.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (non-pJIA subject; the complement group is defined per-model – typically adult RA in pooled abatacept analyses).
-
Source aliases:
-
JIA– used inGandhi_2021_abatacept.RandZhong_2026_abatacept.R.
-
-
Example models:
Gandhi_2021_abatacept.R(additive coefficient on logit-F: pJIA patients have markedly higher SC bioavailability than RA reference),Zhong_2026_abatacept.R(additive coefficient +3.08 on logit-F transferred verbatim from a previous internal JIA PPK model that matches Gandhi 2021’s published value). -
Notes: Used when a population PK model pools pJIA
patients with a non-pJIA reference population (e.g., Gandhi 2021: pooled
adult RA + pediatric pJIA; Zhong 2026: pooled adult RA + pediatric pJIA
+ adult/pediatric HM) and pJIA disease/age status is tested as a PK
covariate (typically on bioavailability rather than CL). Distinct from
CHILDandADOLESCENT, which are pure age-band indicators independent of indication. Scope: specific; promote to general if a third paper pools pJIA with a non-pJIA reference and the reference category remains adult RA.
DIS_CANCER (canonical for advanced-solid-tumor / oncology cohort indicator)
- Description: 1 = patient with an advanced or metastatic solid tumor (the oncology cohort in a pooled multi-indication PK/PD analysis), 0 = non-oncology subject (healthy volunteer or non-oncology disease cohort pooled in the source analysis). Time-fixed per subject.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (non-oncology subject; the complement group is paper-defined – typically the union of healthy volunteers and a non-oncology disease cohort such as cGVHD pooled in the source analysis).
-
Source aliases: none; source NONMEM / Monolix
control streams typically derive the indicator from a
POPorSTUDYcategorical alongsideDIS_HEALTHY. -
Example models:
Yang_2024_axatilimab.R(multiplicative effect on baseline NCMC:BL_NCMC x exp(1.22 x DIS_CANCER + 0.618 x DIS_HEALTHY); reference category cGVHD when both indicators are 0). -
Notes: Used together with
DIS_HEALTHYto decompose a three-level “participant population” categorical (cGVHD reference, advanced solid tumor, healthy volunteer) into two orthogonal binary indicators. Scope: specific because the disease-pooling reference category is paper-defined (Yang 2024 reference is patients with cGVHD). Ratified canonically on 2026-04-28.
DIS_CANCER_PED (canonical for pediatric oncology cohort indicator)
-
Description: 1 = pediatric patient receiving
cancer-directed therapy (any malignancy, including hematologic cancers
such as leukemia and lymphoma as well as solid tumors / blastomas), 0 =
pediatric patient admitted for a non-oncology indication
(e.g. infection, surgery, transplant). Time-fixed per subject. Distinct
from
DIS_CANCER, which is restricted to advanced/metastatic solid tumors in adults;DIS_CANCER_PEDis the pediatric variant in theDIS_CANCER*family and explicitly covers leukemia-dominant pediatric cohorts. - Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (non-oncology pediatric patient; the complement group is paper-defined – in Llanos-Paez 2020 the complement is pediatric patients admitted for various non-oncology indications, with appendicitis and kidney disease / urinary tract infection the most common).
-
Source aliases:
-
ONCOLOGY– Llanos-Paez 2020 NONMEM column with the same orientation (1 = oncology, 0 = nononcology); maps directly toDIS_CANCER_PED.
-
-
Example models:
Llanos-Paez_2020_gentamicin.R(multiplicative cohort shifts on V1 (-0.154) and Q (-0.321) relative to the nononcology baseline; CL has no oncology effect). -
Notes: Use
DIS_CANCER_PEDrather thanDIS_CANCERwhenever the source paper’s “oncology” cohort includes hematologic malignancies (leukemia / lymphoma) or pediatric blastomas, becauseDIS_CANCERis canonically restricted to advanced/metastatic solid tumors. Reference-category complement is paper-defined (Llanos-Paez 2020 complement is the pooled pediatric non-oncology admissions cohort). Scope: specific because the complement is paper-defined. Covariate-effect parameters drop theDIS_prefix per the existingDIS_CANCER->e_cancer_*convention (Yang 2024); usee_cancer_ped_<param>.
DIS_HEALTHY (canonical for healthy-participant cohort indicator)
- Description: 1 = healthy participant (no diagnosis), 0 = patient (any diagnosis represented in the pooled cohort). Time-fixed per subject.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (patient subject; the complement group is the union of disease cohorts pooled in the source analysis).
-
Source aliases: none known; healthy-participant
indicators in source NONMEM control streams typically use ad-hoc names
(e.g.,
HV,HEALTHY,DIS_HV). -
Example models:
Nikanjam_2019_siltuximab.R(multiplicative effects: 0.77 on CL, 0.83 on Vss; reference category is the pooled non-healthy oncology cohort),Okada_2025_rocatinlimab.R(multiplicative shift1 - 0.532on Vmax when 1; reference complement is the pooled atopic-dermatitis + ulcerative-colitis + plaque-psoriasis patient cohort),Yang_2024_axatilimab.R(multiplicative effect on baseline NCMC:BL_NCMC x exp(1.22 x DIS_CANCER + 0.618 x DIS_HEALTHY); reference category cGVHD),Goel_2016_Sonidegib.R(multiplicative power-form effect on CL/F:2.96^DIS_HEALTHY; reference category is the pooled cancer-patient cohort across X2101 / X1101 / A2201),Brown_2017_osimertinib.R(linear factor(1 + 0.44 x DIS_HEALTHY)on apparent osimertinib clearance and(1 + 1.25 x DIS_HEALTHY)on apparent AZ5104 clearance; reference category is the pooled NSCLC cohort across AURA / AURA2),Lu_2015_vismodegib.R(additive-on-log-scale shift on ka viaexp(0.671 * DIS_HEALTHY)and on F viaexp(0.881 * DIS_HEALTHY)gated on the Phase I formulation indicator; reference category is the pooled cancer-patient cohort across SHH3925g / SHH4610g / SHH4476g),Gupta_2016_lenvatinib.R(multiplicative power-form effect on CL/F:1.15^DIS_HEALTHY; reference category is the pooled solid-tumor / thyroid-cancer patient cohort across 15 phase 1-3 studies; healthy subjects show +15 percent CL/F vs cancer patients),Bienczak_2025_ligelizumab.R(log-additiveexp(-0.087 * DIS_HEALTHY)on apparent ligelizumab CL/F; reference category is the pooled chronic-spontaneous-urticaria patient cohort across C2201 / C2202 / C2302 / C2303). -
Notes: Used when a population PK model pools
healthy participants with patients across heterogeneous indications and
the healthy-vs-patient contrast is retained as a covariate. Scope:
specific because the complement reference category is paper-defined
(Nikanjam 2019 reference is “all non-healthy, non-Castleman, non-SMM
tumor types”; Okada 2025 reference is the pooled AD+UC+psoriasis patient
cohort; Yang 2024 reference is patients with cGVHD; Goel 2016 reference
is the pooled cancer-patient cohort with advanced solid tumors or BCC;
Brown 2017 reference is the pooled advanced NSCLC cohort; Lu 2015
reference is the pooled cancer-patient cohort with advanced solid tumors
/ metastatic or locally-advanced BCC). The retired canonical name
DIS_HV(healthy-volunteer) was renamed on 2026-05-11 because “volunteer” terminology is discouraged for clinical-trial participants. Ratified canonically on 2026-04-24.
DIS_CASTLEMAN (canonical for Castleman’s disease indicator)
- Description: 1 = Castleman’s disease (multicentric or unicentric), 0 = not Castleman’s disease. Time-fixed per subject.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (non-Castleman subject; the complement group is the union of healthy volunteers and other indications pooled in the source analysis).
-
Source aliases: none known; source NONMEM control
streams typically use ad-hoc names (e.g.,
CD,CASTLEMAN). -
Example models:
Nikanjam_2019_siltuximab.R(multiplicative +24% effect on CL; no Vss effect). - Notes: Castleman’s disease is a lymphoproliferative disorder strongly associated with elevated IL-6 levels; it is the only FDA-approved indication for siltuximab. Scope: specific because the disease-pooling reference category is paper-defined. Ratified canonically on 2026-04-24.
DIS_HOFH (canonical for homozygous familial hypercholesterolemia patient indicator)
- Description: 1 = patient with homozygous familial hypercholesterolemia (HoFH), 0 = non-HoFH subject (typically healthy volunteer or another reference cohort pooled in the source analysis). Time-fixed per subject.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (non-HoFH subject; the complement group is paper-defined – for Pu 2021 the reference is the pooled healthy-volunteer cohort).
-
Source aliases:
-
DISTYPN– used inPu_2021_evinacumab.R(Pu 2021 NM-TRAN $INPUT column for HoFH-vs-HV disease type, 1 = HoFH).
-
-
Example models:
Pu_2021_evinacumab.R(multiplicativeexp(theta * DIS_HOFH)factor on Vmax with theta = -0.289, i.e. HoFH patients show ~25% lower target-mediated Vmax than the HV reference; biologically consistent with the LDLR-pathway disruption in HoFH altering ANGPTL3 catabolic kinetics). - Notes: Used when a population PK model pools HoFH patients with healthy volunteers (or another non-HoFH cohort) and HoFH disease status is retained as a covariate. Distinct from a heterozygous-FH (HeFH) indicator because HoFH patients have markedly higher baseline LDL-C (untreated levels often > 500 mg/dL) and a more pronounced response to LDLR-independent therapies. Scope: specific because the reference category is paper-defined.
DIS_HAE (canonical for hereditary angioedema patient indicator)
- Description: 1 = patient with hereditary angioedema (HAE-C1INH-Type1, HAE-C1INH-Type2, or HAE-nC1INH), 0 = healthy volunteer (or other non-HAE reference cohort pooled in the source analysis). Time-fixed per subject.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (non-HAE subject; the complement group is paper-defined – for Diep 2026 the reference is the pooled healthy-volunteer cohort from NCT03263507 and ISIS 721744-CS9).
- Source aliases: paper narrative “patient with HAE” / “healthy volunteer” subgroup labels driving the Diep 2026 disease-status covariate effects on Vc/F, Q/F, baseline PKK, and IC50.
-
Example models:
Diep_2026_donidalorsen.R(linear(1 + theta * DIS_HAE)multiplicative effects on apparent central volume Vc/F (theta = +0.426, +42.6%), apparent intercompartmental clearance Q/F (theta = -0.261, -26.1%), baseline plasma prekallikrein BL (theta = -0.132, -13.2%), and donidalorsen IC50 on PKK production (theta = +0.770, +77.0%) for patients with HAE vs healthy volunteers). -
Notes: Used when a population PK/PD model pools HAE
patients with healthy volunteers and HAE disease status is retained as a
covariate. The three molecular HAE subtypes (HAE-C1INH-Type1,
HAE-C1INH-Type2, HAE-nC1INH) are pooled in this indicator following the
Diep 2026 analysis; if a future paper resolves subtype-specific
covariate effects, separate canonical indicators (e.g.,
DIS_HAE_C1INH_T1) can be added without conflicting with this pooled indicator. Scope: specific because the complement reference category is paper-defined.
DIS_PBC (canonical for primary biliary cirrhosis disease-state indicator)
- Description: 1 = patient with primary biliary cirrhosis (PBC), an autoimmune destruction of intrahepatic bile ducts causing chronic cholestasis; 0 = non-PBC subject (healthy reference cohort or other non-PBC reference). Time-fixed per subject.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (non-PBC subject; complement reference is paper-defined – for Zuo 2016 the reference is the pooled healthy-adult cohort from Xiang 2011 / Dilger 2012 / Hess 2004).
- Source aliases: paper narrative “patient with PBC” / “healthy” subgroup labels in Zuo 2016.
-
Example models:
Zuo_2016_UDCA.R(multiplicative scaling on liver-to-biliary rate constants when DIS_PBC = 1: K_LB,0 scaled by 0.10 – 90% reduction; K_LB,1 scaled by 0.30 – 70% reduction; K_LB,2 scaled by 0.10 – 90% reduction; reproduces the Zuo 2016 Figure 3 PBC simulation). -
Notes: Used when a systems / popPK model adapts a
healthy-state structural model to a PBC population via fixed
disease-state scaling on hepatic-excretion rate constants. Scope:
specific because the structural adaptation form (which K parameters are
scaled, by how much) is paper-defined; future PBC extractions that
re-estimate or alter the scaling pattern can extend the example-models
list. Distinct from
DIS_HEPATIMP(hepatic-impairment severity categorical),DBIL(direct bilirubin biomarker), andALP(cholestasis biomarker), which describe pathophysiology rather than the disease label itself.
DIS_DMD (canonical for Duchenne muscular dystrophy patient indicator)
- Description: 1 = patient with Duchenne muscular dystrophy (DMD), 0 = non-DMD subject (healthy volunteer or other reference cohort). Time-fixed per subject.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (non-DMD subject; the complement group is the reference cohort the source analysis pools alongside the DMD population – typically healthy adult volunteers).
-
Source aliases:
SPOP(Wojciechowski 2022 study-population indicator with the same orientation: 1 = DMD pediatric patient, 0 = healthy adult volunteer). -
Example models:
Wojciechowski_2022_domagrozumab.R(additive1 + thetashift on baseline myostatin and on the joint kdeg/kint axis; theta_BASE = -0.641, theta_kdegkint = -0.900). - Notes: Used when a population PK/PD model pools DMD patients with a non-DMD reference population and DMD disease status is retained as a covariate. Scope: specific because the reference category is paper-defined. Ratified canonically on 2026-04-26.
DIS_SMM (canonical for smoldering multiple myeloma indicator)
- Description: 1 = smoldering (asymptomatic) multiple myeloma, 0 = not smoldering MM. Time-fixed per subject.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (non-SMM subject; the complement group is the union of healthy volunteers and other indications pooled in the source analysis).
-
Source aliases: none known; source NONMEM control
streams typically use ad-hoc names (e.g.,
SMM,SMOLDMM). -
Example models:
Nikanjam_2019_siltuximab.R(multiplicative -23% effect on Vss; no CL effect). - Notes: Smoldering multiple myeloma is an asymptomatic plasma-cell disorder distinct from active multiple myeloma; pooled with the Nikanjam 2019 cohort that also included MGUS, multiple myeloma, RCC, ovarian, and other tumor types. Scope: specific because the disease-pooling reference category is paper-defined. Ratified canonically on 2026-04-24.
MM (canonical for active multiple myeloma disease indicator)
- Description: 1 = active (non-smoldering) multiple myeloma, 0 = other hematologic malignancy or reference group. Time-fixed per subject.
DIS_PNH (canonical for paroxysmal nocturnal hemoglobinuria indicator)
- Description: 1 = paroxysmal nocturnal hemoglobinuria (PNH) patient, 0 = non-PNH subject (healthy volunteer or another indication pooled in the source analysis). Time-fixed per subject.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (non-PNH subject; the complement group is paper-defined – for Lin 2024 it pools healthy adult volunteers and CHAPLE disease patients).
-
Source aliases: none known; source NONMEM control
streams typically use ad-hoc names (e.g.,
PNH,DPNH). -
Example models:
Lin_2024_pozelimab.R(additive-fractional +34.07% effect on Vc; no CL or Vp effect; reference category pools healthy volunteers and CHAPLE patients). - Notes: Paroxysmal nocturnal hemoglobinuria is a rare hematological disease characterized by uncontrolled complement activation on red blood cells; treated with C5-targeted complement inhibitors (eculizumab, ravulizumab, pozelimab). Scope: specific because the disease-pooling reference category is paper-defined. Ratified canonically on 2026-04-27.
MDSAML (canonical for MDS or AML disease-type indicator)
- Description: 1 = patient with myelodysplastic syndrome (MDS) or acute myeloid leukemia (AML), 0 = other hematologic malignancy or reference group. Time-fixed per subject.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (non-MDS/AML subjects; the complement group is defined per-model – typically multiple myeloma and non-Hodgkin lymphoma in Ogasawara 2020).
-
Source aliases: none;
MDSAMLis the combined indicator used directly in source analyses. -
Example models:
Ogasawara_2020_durvalumab.R(multiplicative factor 1.26 on CL; reference group is the union of MM and NHL subjects). -
Notes: Use
MDSAMLas a combined MDS+AML indicator when the source paper collapses the two diagnoses into one covariate. If a future paper separates MDS and AML as distinct indicators, registerDIS_MDSandDIS_AMLseparately. Scope: specific because the reference category is paper-defined. Ratified canonically on 2026-04-26.
DIS_AML (canonical for acute myeloid leukemia disease-state indicator)
- Description: 1 = patient with acute myeloid leukemia (AML), 0 = non-AML subject (the complement group in a pooled multi-indication PK analysis). Time-fixed per subject.
- Units: (binary)
- Type: binary
- Scope: specific
-
Reference category: 0 (non-AML subject; the
complement group is paper-defined – for Xu 2023 the reference is
patients with advanced solid tumors, alongside the parallel
DIS_MDSandDIS_CMMLindicators that decompose the hematologic-malignancy cohort). -
Source aliases:
-
DISEASE_abb == "AML"– used inXu_2023_MBG453.R(the Monolix supplement Appendix S2 encodes disease as the categorical columnDISEASE_abbwith categories{AML, CMML, MDS, Solid_Tumor}and referenceSolid_Tumor; the canonical column carries the binaryas.integer(DISEASE_abb == "AML")).
-
-
Example models:
Xu_2023_MBG453.R(exponential effect on CL:exp(-0.0146 * DIS_AML); not statistically significant in the full covariate model but retained because Xu 2023 used the full-covariate-model approach). -
Notes: Use
DIS_AML(rather than the combinedMDSAML) when the source paper separates AML from MDS as distinct indicators. Scope: specific because the disease-pooling reference category is paper-defined.
DIS_MDS (canonical for myelodysplastic syndrome disease-state indicator)
- Description: 1 = patient with myelodysplastic syndrome (MDS), 0 = non-MDS subject (the complement group in a pooled multi-indication PK analysis). Time-fixed per subject.
- Units: (binary)
- Type: binary
- Scope: specific
-
Reference category: 0 (non-MDS subject; the
complement group is paper-defined – for Xu 2023 the reference is
patients with advanced solid tumors, alongside the parallel
DIS_AMLandDIS_CMMLindicators). -
Source aliases:
-
DISEASE_abb == "MDS"– used inXu_2023_MBG453.R(Monolix supplement Appendix S2 categorical column; reference categorySolid_Tumor).
-
-
Example models:
Xu_2023_MBG453.R(exponential effect on CL:exp(-0.149 * DIS_MDS); statistically significant, p = 0.021 – patients with MDS have ~14% lower CL than the solid-tumor reference). -
Notes: Use
DIS_MDS(rather than the combinedMDSAML) when the source paper separates MDS from AML as distinct indicators. Scope: specific because the disease-pooling reference category is paper-defined.
DIS_CMML (canonical for chronic myelomonocytic leukemia disease-state indicator)
- Description: 1 = patient with chronic myelomonocytic leukemia (CMML), 0 = non-CMML subject (the complement group in a pooled multi-indication PK analysis). Time-fixed per subject.
- Units: (binary)
- Type: binary
- Scope: specific
-
Reference category: 0 (non-CMML subject; the
complement group is paper-defined – for Xu 2023 the reference is
patients with advanced solid tumors, alongside the parallel
DIS_AMLandDIS_MDSindicators that decompose the hematologic-malignancy cohort). -
Source aliases:
-
DISEASE_abb == "CMML"– used inXu_2023_MBG453.R(Monolix supplement Appendix S2 categorical column; reference categorySolid_Tumor).
-
-
Example models:
Xu_2023_MBG453.R(exponential effect on CL:exp(-0.0411 * DIS_CMML); not statistically significant in the full covariate model but retained because Xu 2023 used the full-covariate-model approach). - Notes: CMML is a clonal myeloid malignancy with overlapping features of MDS and myeloproliferative neoplasms. Scope: specific because the disease-pooling reference category is paper-defined.
DIS_BCPALL (canonical for B-cell precursor acute lymphoblastic leukemia disease-state indicator)
- Description: 1 = B-cell precursor acute lymphoblastic leukemia (BCP-ALL), 0 = B-cell non-Hodgkin’s lymphoma (NHL) or other non-BCP-ALL indication pooled in the source analysis. Time-fixed per subject.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (non-BCP-ALL; in the Wu 2024 cohort this is the adult B-cell NHL stratum).
-
Source aliases:
-
ALL– used inWu_2024_inotuzumab.R(Wu 2024 calls it the “ALL effect” and notes it bundles disease type with the corresponding bioanalytical assay difference).
-
-
Example models:
Wu_2024_inotuzumab.R(additive fractional-change effects on CL1 (-0.767) and CL2 (-0.362), and gates the BLSTABL and AGE effects on kdes; for kdes itself a -0.924 fractional change for BCP-ALL). - Notes: Used when a population PK model pools BCP-ALL patients with a non-BCP-ALL reference (e.g., Wu 2024: pooled adult B-cell NHL + adult BCP-ALL + pediatric BCP-ALL). Scope: specific because the complement reference category is paper-defined (Wu 2024 reference is pooled adult B-cell NHL). The “ALL effect” theta in Wu 2024 conflates two physiologically distinct sources of variation – B-cell tumor type (NHL vs ALL surface CD22 burden) and bioanalytical method (ELISA for adult NHL vs HPLC-MS for ALL) – and cannot be split with the available data; document this confounding when comparing across populations. Ratified canonically on 2026-04-26.
DIS_SAD (canonical for secondary antibody deficiency indicator)
- Description: 1 = secondary antibody deficiency (SAD) patient (hypogammaglobulinaemia from external causes such as B-cell-depleting therapy, haematological malignancy, or other immunosuppression), 0 = primary immunodeficiency (PID) patient. Time-fixed per subject.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (PID patient; the complement category is the genetic / inborn-error-of-immunity primary immunodeficiency cohort pooled with SAD in the source analysis).
-
Source aliases: none known; source NONMEM control
streams typically use ad-hoc names (e.g.,
SAD,IMD,DIS). -
Example models:
Cheng_2026_immunoglobulin.R(multiplicativetheta^DIS_SADfactors on CL (0.542) and on baseline IgG (CBAS, 0.541); reference category PID). -
Notes: Used when a population PK model pools PID
and SAD pediatric or adult patients receiving immunoglobulin replacement
therapy (IgRT) and tests SAD-vs-PID as a covariate. Distinct from the
disease-state indicators that pool oncology / autoimmune indications:
DIS_SADspecifically partitions hypogammaglobulinaemia by its underlying mechanism (genetic vs. acquired). Scope: specific because the SAD cohort composition is paper-defined (in Cheng 2026, 75% post-rituximab and 25% post-CAR-T cell therapy). Ratified canonically on 2026-04-28.
DIS_AD (canonical for Alzheimer’s disease patient indicator)
- Description: 1 = participant with Alzheimer’s disease (clinical AD diagnosis), 0 = non-AD subject (typically healthy volunteer pooled in the source analysis). Time-fixed per subject.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (non-AD subject; the complement group is paper-defined – for Perez-Ruixo 2025 the reference is the pooled healthy-volunteer cohort).
-
Source aliases: none known; source NONMEM control
streams typically use ad-hoc names (e.g.,
AD,STATUS,DISGRP). -
Example models:
PerezRuixo_2025_posdinemab.R(acts on baseline free p217+tau in CSF, R0; healthy R0 = 0.793 pmol/L vs AD R0 = 5.995 pmol/L, a 656% relative increase, no PK-parameter effects). - Notes: Used when a population PK/PD model pools healthy volunteers with Alzheimer’s disease patients and the AD-vs-HV contrast is retained as a covariate on a target-related parameter (e.g., baseline p-tau, baseline p217+tau). Scope: specific because the complement reference category is paper-defined. Ratified canonically on 2026-04-28.
DIS_COPD (canonical for chronic obstructive pulmonary disease patient indicator)
- Description: 1 = patient with chronic obstructive pulmonary disease (clinical COPD diagnosis, typically moderate-to-severe per GOLD criteria), 0 = non-COPD subject (typically healthy volunteer pooled in the source analysis). Time-fixed per subject.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (non-COPD subject; the complement group is paper-defined – for Lahu 2010 the reference is the pooled phase I healthy-volunteer cohort).
-
Source aliases:
-
COPD– used inLahu_2010_roflumilast.R(paper text covariate symbol in equation 6 and 7).
-
-
Example models:
Lahu_2010_roflumilast.R(linear additive effects on roflumilast parent CL (-39.4%) and V1 (+184%) and on roflumilast N-oxide CL (-7.9%) and Vd (-21.4%); reference category 0 = pooled phase I healthy volunteers, 1 = pooled phase II/III moderate-to-severe COPD patient). - Notes: Used when a population PK/PD model pools healthy volunteers with COPD patients and the COPD-vs-HV contrast is retained as a covariate on PK parameters. Scope: specific because the complement reference category and the COPD-severity inclusion criteria are paper-defined.
DIS_OBESE_MORBID (canonical for morbidly obese cohort indicator)
- Description: 1 = morbidly obese patient (BMI > 40 kg/m^2 in the canonical definition; typical pooled-analysis enrollment criterion is bariatric-surgery patients), 0 = non-obese subject (typically healthy volunteer pooled in the source analysis). Time-fixed per subject.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (non-obese subject; the complement group is paper-defined – for de Hoogd 2017 the reference is the pooled healthy-volunteer cohort from Sarton 2000 and Romberg 2004).
-
Source aliases: none known; source NONMEM control
streams typically use ad-hoc names (e.g.,
OBESE,MO,COHORT). -
Example models:
deHoogd_2017_morphine.R(selects per-cohort proportional residual error magnitudes for each of three observed species – morphine, M3G, M6G – after a pooled-cohort fit of 20 morbidly obese surgical patients and 20 healthy volunteers). -
Notes: Used when a population PK or PK/PD model
pools morbidly obese patients with a non-obese reference population
(typically healthy volunteers) and the cohort indicator selects
per-cohort parameter values (residual error magnitudes, study-specific
bioavailability, or similar). Distinct from
BMI(which is the continuous body-mass-index covariate used for parameter scaling) –DIS_OBESE_MORBIDis the binary cohort-membership flag and does not encode a specific BMI threshold for general use; the threshold is paper-defined. Scope: specific because the complement reference category is paper-defined. Ratified canonically on 2026-05-11.
HSCT_URD_7OF8 (canonical for hematopoietic stem cell transplant from a 7-of-8 HLA-matched unrelated donor)
- Description: 1 = patient received an allogeneic hematopoietic stem cell transplant (HSCT) from an unrelated donor (URD) HLA-matched at 7 of 8 alleles (single-allele mismatch), 0 = otherwise (the union of patients not in this transplant cohort, including non-HSCT patients and HSCT recipients matched at all 8 alleles). Time-fixed per subject.
- Units: (binary)
- Type: binary
- Scope: specific
-
Reference category: 0 (not in the 7-of-8-matched
HSCT cohort; the complement group is paper-defined – for Zhong 2026 it
pools the RA + pJIA studies and the 8-of-8-matched HSCT cohort, with the
latter encoded by the parallel
HSCT_URD_8OF8indicator). -
Source aliases:
-
COHORT7– used inZhong_2026_abatacept.R(Zhong 2026 NM-TRAN indicator for ABA2 study Cohort 7/8).
-
-
Example models:
Zhong_2026_abatacept.R(exponential coefficient -0.326 on CL; the single-allele-mismatch HSCT cohort exhibits ~28% lower abatacept clearance than the reference complement). -
Notes: Used together with
HSCT_URD_8OF8to decompose a three-level “transplant cohort” categorical (non-HSCT-cohort / 7-of-8 / 8-of-8) into two orthogonal binary indicators. The 7-of-8 cohort represents a higher GvHD-risk population because of the single-allele HLA mismatch. Scope: specific because the reference complement (the union of non-transplant disease cohorts pooled in the source analysis) is paper-defined. Ratified canonically on 2026-04-29.
HSCT_URD_8OF8 (canonical for hematopoietic stem cell transplant from an 8-of-8 HLA-matched unrelated donor)
- Description: 1 = patient received an allogeneic hematopoietic stem cell transplant (HSCT) from an unrelated donor (URD) HLA-matched at all 8 alleles (full match), 0 = otherwise (the union of patients not in this transplant cohort, including non-HSCT patients and HSCT recipients matched at 7 of 8 alleles). Time-fixed per subject.
- Units: (binary)
- Type: binary
- Scope: specific
-
Reference category: 0 (not in the 8-of-8-matched
HSCT cohort; the complement group is paper-defined – for Zhong 2026 it
pools the RA + pJIA studies and the 7-of-8-matched HSCT cohort, with the
latter encoded by the parallel
HSCT_URD_7OF8indicator). -
Source aliases:
-
COHORT8– used inZhong_2026_abatacept.R(Zhong 2026 NM-TRAN indicator for ABA2 study Cohort 8/8).
-
-
Example models:
Zhong_2026_abatacept.R(exponential coefficient -0.0934 on CL and +0.257 on VC; the fully-HLA-matched HSCT cohort exhibits a small CL decrease and a larger VC increase relative to the reference complement). -
Notes: Used together with
HSCT_URD_7OF8to decompose a three-level “transplant cohort” categorical (non-HSCT-cohort / 7-of-8 / 8-of-8) into two orthogonal binary indicators. The 8-of-8 cohort is the lower-risk HLA-matching configuration. Scope: specific because the reference complement (the union of non-transplant disease cohorts pooled in the source analysis) is paper-defined. Ratified canonically on 2026-04-29.
DIS_PSORIASIS (canonical for plaque psoriasis disease-state indicator)
- Description: 1 = plaque psoriasis patient, 0 = non-psoriasis subject (e.g., atopic dermatitis, ulcerative colitis, or healthy volunteer). Time-fixed per subject.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (non-psoriasis subject; the complement group is paper-defined – the union of other disease cohorts pooled in the source analysis).
-
Source aliases: none known; source NONMEM control
streams typically use a categorical
DISindicator (e.g., Okada 2025:DIS=1for psoriasis,DIS=0for healthy,DIS=2for UC,DIS=3for AD), decomposed into a binaryDIS_PSORIASISindicator at ingestion. -
Example models:
Okada_2025_rocatinlimab.R(multiplicative shift1 - 0.372on linear CL when 1; reference complement is the pooled atopic dermatitis + ulcerative colitis + healthy-volunteer cohort). - Notes: Used when a population PK model pools plaque-psoriasis patients with a non-psoriasis reference population and psoriasis disease status is retained as a covariate. Scope: specific because the disease-pooling reference category is paper-defined. Ratified canonically on 2026-04-27.
CARRAGEENAN (canonical for intraplantar-carrageenan inflammatory-challenge indicator)
- Description: Binary indicator for intraplantar injection of carrageenan suspension as an experimental inflammatory / hyperalgesic challenge. 1 = subject received an intraplantar carrageenan injection at the start of the experiment (the carrageenan-induced peripheral inflammation / thermal-hyperalgesia paradigm); 0 = subject received an intraplantar saline injection (sham control). Time-fixed per subject within an experiment.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (saline-injected sham animal; no induced inflammation).
-
Source aliases:
-
CARRAGEENAN– used inVasquezBahena_2009_lumiracoxib_rat.R(1 = groups II-IX, 100 uL of 1% carrageenan suspension into the right hind paw; 0 = group I, 100 uL of 0.9% saline solution).
-
-
Example models:
VasquezBahena_2009_lumiracoxib_rat.R(switches the COX-2 synthesis-rate model: CARRAGEENAN = 0 selects the constant saline synthesis rateks_cox2_salineand CARRAGEENAN = 1 selects the time-variant gamma functionks_cox2(t) = A * t^alpha * exp(-beta * t)driving the carrageenan-induced inflammation profile). -
Notes: The carrageenan-induced peripheral
inflammation / hyperalgesia model (Winter 1962; Hargreaves 1988
thermal-hyperalgesia variant) is one of the most widely used preclinical
assays for screening anti-inflammatory and analgesic drugs in rodents,
so the canonical name is reusable for future preclinical extractions.
Scope: specific until a second model ratifies the binary-switch
semantics on a different PD framework. The Hargreaves-test
thermal-hyperalgesia readout (
LTpaw withdrawal latency, seconds) is the typical observable when this indicator is in use, but other readouts (paw oedema, mechanical-allodynia von Frey threshold) are equally valid. Distinct from disease-state indicators (DIS_*) because the inflammatory state is experimentally induced at a defined time, not a chronic patient condition.
Pulmonary / lung-disease biomarkers
FEV1 (canonical for forced expiratory volume in 1 second)
- Description: Baseline forced expiratory volume in 1 second (FEV1) reported as an absolute volume in litres. Pulmonary-function spirometry endpoint; reflects large-airway airflow obstruction and is a standard covariate in chronic obstructive pulmonary disease, alpha-1 antitrypsin deficiency, asthma, and cystic-fibrosis disease-progression analyses. Distinct from FEV1 percent-predicted (the % predicted value standardises absolute FEV1 by reference equations from sex / age / height / ethnicity); use this canonical only for the absolute-litre value supplied as a covariate column.
-
Units: L (absolute volume; document the
exhalation-effort standard the source paper cites in
covariateData[[FEV1]]$notesif non-default – ATS / ERS post-bronchodilator is the typical convention). - Type: continuous
- Scope: specific
-
Reference category: n/a – used with
linear-deviation forms
(theta + e_fev1_param * (FEV1 - ref))or power forms(FEV1 / ref)^exponent. Reference values observed: 1.6 L (Tortorici 2017, population median across the RAPID-RCT/RAPID-OLE A1-PI augmentation cohort). -
Source aliases: none yet; canonical name preferred.
Source papers typically use the abbreviation
FEV1directly. -
Example models:
Tortorici_2017_a1pi.R(linear-deviation effect on the lung-density decline rate:theta5 * (FEV1 - 1.6)withtheta5 = +0.56 (g/L/year per L FEV1); lower-FEV1 patients have steeper natural decline rates independent of A1-PI exposure). -
Notes: Distinct from FEV1 percent-predicted (which
is a derived ratio with the reference-equation denominator built in;
FEV1% is the outcome variable in
Harun_2019_cysticFibrosis.Rrather than a covariate column). UseFEV1only when the source paper supplies the absolute-volume value; if the source supplies a percent-predicted value as a covariate, the canonical for that surface isFEV1_PCTPRED. Scope: specific until a second model ratifies the absolute-litre semantics; promote to general at that point. Ratified canonically on 2026-05-09 alongside the Tortorici 2017 extraction.
FEV1_PCTPRED (canonical for baseline FEV1 as percent of the predicted value)
- Description: Baseline forced expiratory volume in 1 second expressed as a percent of the sex / age / height / ethnicity reference-equation predicted value (FEV1% predicted). The percent-predicted scaling normalises the absolute-litre FEV1 measurement against a healthy-reference standard so a single covariate value is comparable across patient ages and body sizes. Standard pulmonary-function covariate in cystic-fibrosis disease-severity and inhaled-therapy popPK analyses. Time-fixed at study entry / baseline.
- Units: % predicted (numeric percentage, e.g. 62.1 for the Ting 2014 population median; not a fraction 0.621).
- Type: continuous
- Scope: specific
-
Reference category: n/a – used with power-form
effects
(FEV1_PCTPRED / ref)^exponentor linear-deviation forms(1 + e * (FEV1_PCTPRED - ref)). Reference values observed: 62.1 % (Ting 2014, population median across the combined three-study cystic-fibrosis cohort). -
Source aliases:
-
FEV1% predicted– used inTing_2014_tobramycin_inhaled.R(paper Table 1 / Table 2). Free-text label, not a typical NMTRAN column header; downstream data sets should use the canonical column nameFEV1_PCTPRED.
-
-
Example models:
Ting_2014_tobramycin_inhaled.R(power-form effect on apparent central volume of distribution:(FEV1_PCTPRED / 62.1)^-0.303; lower lung-function patients have larger apparent Vd/F, consistent with the paper’s hypothesis that worsening lung disease increases central-airway aerosol deposition). -
Notes: Distinct from absolute-litre
FEV1(canonical for the unstandardised volume) and fromFEV1as an outcome variable in disease-progression models (e.g.Harun_2019_cysticFibrosis.R, where percent-predicted FEV1 is the dependent variable rather than a covariate). The reference equation used to derive the percent-predicted value is paper-specific; document any non-default reference standard (Hankinson 1999 NHANES III, GLI 2012, Wang 1993, etc.) incovariateData[[FEV1_PCTPRED]]$notesper model. Scope: specific until a second model ratifies the percent-predicted semantics; promote to general at that point. Ratified canonically on 2026-05-12 alongside the Ting 2014 extraction.
A1PI (canonical for serum alpha-1 proteinase inhibitor concentration)
-
Description: Baseline serum alpha-1 proteinase
inhibitor (A1-PI; also known as alpha-1 antitrypsin, AAT) concentration.
Used in alpha-1 antitrypsin deficiency (AATD) augmentation-therapy
modelling as a per-subject pre-treatment exposure covariate (the
subject’s endogenous A1-PI level at study entry, before any augmentation
infusions). Time-fixed per subject. Distinct from a time-course of A1-PI
used as a state variable – when the source paper carries A1-PI as the
dynamic dependent variable (the augmentation model’s PD output), use
Ccrather thanA1PI. -
Units: umol/L (typical SI-convention reporting;
also reported as mg/dL in US-convention papers – document the unit used
in each model via
covariateData[[A1PI]]$units). Conversion: 1 umol/L A1-PI ~= 5.2 mg/dL (using MW ~52 kDa). The 11 umol/L “putative protective threshold” used clinically corresponds to ~57 mg/dL. - Type: continuous
- Scope: specific
-
Reference category: n/a – used with power-form
effects
(A1PI / ref)^exponenton the post-treatment exposure intercept and slope. Reference values observed: 5.5 umol/L (Tortorici 2017, approximate median pre-treatment A1-PI among RAPID-RCT placebo-randomised patients, used as the normalisation denominator for power-form covariate effects). -
Source aliases:
Cbase– used in Tortorici 2017’s published equation 6 to denote the baseline pre-treatment A1-PI value; the column name in the modelled dataset would beA1PI. -
Example models:
Tortorici_2017_a1pi.R(two power-form effects:(A1PI/5.5)^theta5withtheta5 = +0.73on the placebo-arm post-treatment exposure intercept, and(A1PI/5.5)^theta4withtheta4 = -0.12on the dose-rate slope; together they encode the modest dose-exposure dependence on each subject’s endogenous A1-PI level). -
Notes: AATD enrolment criteria typically restrict
A1PI to <= 11 umol/L (severe deficiency); reference / heterozygous
PI*MZ phenotypes have higher levels. Document the source paper’s
AATD-genotype enrolment criteria in
covariateData[[A1PI]]$notesper model. Specific scope because the canonical is tied to AATD augmentation-therapy modelling; future PK / PD analyses of A1-PI (or AAT) in non-AATD contexts (acute-phase response, smoking-induced inflammation) should ratify general scope at that time. Ratified canonically on 2026-05-09 alongside the Tortorici 2017 extraction.
Cystic fibrosis lung-disease indicators
AIR_TRAP_5Y (canonical for severe air trapping on chest HRCT scan at age 5 years)
- Description: 1 = subject had a non-zero “air trapping” component score on the validated Brody-II chest high-resolution computed tomography (HRCT) scan performed at age 5 years; 0 = air trapping component score of 0 (absent). Time-fixed per subject (the indicator captures the single end-of-study HRCT performed at age 5 in the Australasian Cystic Fibrosis Bronchoalveolar Lavage (ACFBAL) study). The Brody-II scoring system reports air trapping as a percentage of maximum possible HRCT score; the binary “present vs absent” dichotomy follows the same convention as Rosenow 2015 (Am J Respir Crit Care Med 191:1158-65).
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (no air trapping detected on the age-5 HRCT scan).
-
Source aliases:
-
ATS5C– used inHarun_2019_cysticFibrosis.R(Harun 2019 NMTRAN$INPUTcolumn, “presence of air trapping at age 5”; values 0 = absent, 1 = present).
-
-
Example models:
Harun_2019_cysticFibrosis.R(linear-deviation effect on baseline FEV1% predicted at age 5:(1 + e_at_baseline * AIR_TRAP_5Y)with coefficient -0.0417, i.e., subjects with severe air trapping at age 5 have a baseline FEV1% predicted approximately 4.17% lower than those without). -
Notes: Specific scope because the indicator is tied
to the ACFBAL study’s standardised HRCT-at-age-5 protocol and the
Brody-II scoring rubric; future paediatric CF lung-disease studies that
score HRCT at a different age or use a different scoring system should
register a separate canonical (
AIR_TRAP_8Y,AIR_TRAP_PRAGMA, etc.) rather than overload this name. Ratified canonically on 2026-05-08 alongside the Harun 2019 extraction.
HOSPRA (canonical for hospitalisation due to a pulmonary exacerbation)
- Description: Time-varying binary indicator of inpatient hospitalisation for management of a pulmonary exacerbation at the time of the FEV1% predicted measurement: 1 = subject is hospitalised because of a pulmonary exacerbation when the spirometry value is recorded, 0 = not hospitalised at that visit. Used as a per-visit covariate in disease-progression models of FEV1% decline in cystic fibrosis where pulmonary exacerbations are tracked from the Australian Cystic Fibrosis Data Registry (ACFDR) inpatient records.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (not hospitalised for a pulmonary exacerbation at the time of the FEV1% measurement).
-
Source aliases:
-
HOSPRA– used inHarun_2019_cysticFibrosis.R(Harun 2019 NMTRAN$INPUTcolumn, “hospitalisation at the time of FEV1% predicted measurement; 0=no, 1=yes”).
-
-
Example models:
Harun_2019_cysticFibrosis.R(linear-deviation effects on the disease-progression maximum drop and on the half-effect age:(1 + e_hpe_dmax * HOSPRA)with coefficient -0.22 on the maximum FEV1% drop and(1 + e_hpe_t50max * HOSPRA)with coefficient -0.235 on the age at which 50% of the maximum drop occurs; hospitalised visits accelerate both the magnitude and the onset of FEV1% decline). - Notes: Specific scope because the canonical encoding pools all pulmonary exacerbation-driven hospitalisations into a single binary regardless of severity, duration, or treatment intensity; future CF / chronic-respiratory-disease studies that need to distinguish exacerbation severity (e.g., requiring intravenous antibiotics vs oral) should register a finer-grained canonical. Ratified canonically on 2026-05-08 alongside the Harun 2019 extraction.
Infectious disease
LNPC (canonical for log-transformed admission Plasmodium parasitaemia)
- Description: Natural logarithm of the asexual Plasmodium parasite count (parasites per microlitre of blood) at study admission. Time-fixed per subject (one value per subject, captured at enrolment).
- Units: log(parasites/uL)
- Type: continuous
- Scope: specific
-
Reference category: n/a – used with
linear-deviation forms
(1 + e * (LNPC - ref)). Reference values observed: 5.88 log(parasites/uL) (Birgersson 2019, population median in the pooled pregnant + non-pregnant Burkina Faso cohort). -
Source aliases: none formally; companion column
PARA(raw asexual parasite count per microlitre) is provided alongsideLNPCin the Birgersson 2019 NONMEM dataset but the model usesLNPC = log(PARA)as the active covariate. -
Example models:
Birgersson_2019_artesunate.R(linear-deviation effect on relative bioavailabilityF1:F1LNPC = 1 + e_lnpc_f * (LNPC - 5.88); positive coefficiente_lnpc_f = +0.138per unit increase in log-parasite-count, reflecting increased oral artesunate bioavailability with higher parasite burden). -
Notes: Disease-severity covariate specific to
malaria PK models. Higher parasitaemia is a marker of more severe acute
malaria infection and has been associated in the source publication with
altered oral bioavailability of artesunate (presumably via gut-mucosal /
first-pass effects of the febrile parasitised state). Scope: specific
because the canonical reference value (5.88) is the Birgersson 2019
cohort median; future malaria-in-pregnancy or malaria-in-children PK
models may legitimately reuse
LNPCbut should document their own cohort-specific reference value incovariateData[[LNPC]]$notes. Ratified canonically on 2026-05-07.
SARS_VLOAD (canonical for SARS-CoV-2 baseline viral load)
- Description: Baseline (pre-treatment) SARS-CoV-2 viral load measured from nasopharyngeal swab by RT-qPCR, reported as log10 RNA copies/mL.
- Units: log10 copies/mL
- Type: continuous
- Scope: specific
-
Reference category: n/a – used with power scaling
(SARS_VLOAD / ref)^exponent. Reference values observed: 6.4 log10 copies/mL (Lin 2024, median in pooled COVID-19 cohort). -
Source aliases:
-
VIRAL– used inLin_2024_casirivimab.R.
-
-
Example models:
Lin_2024_casirivimab.R(small negative exponent -0.0075 on CL). - Notes: SARS-CoV-2-specific. For non-infected subjects, the value is encoded as 0 in the source dataset (below assay detection); the population-PK exponent is small enough that this 0 is absorbed by the reference shift. Register a parallel canonical for any future paper that uses a different infection (e.g., RSV, influenza).
SARS_SEROPOS (canonical for SARS-CoV-2 baseline serostatus positive)
- Description: 1 = SARS-CoV-2 spike or nucleocapsid antibody positive at baseline (prior infection or prior vaccination), 0 = seronegative or other / unknown.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (seronegative; “Other” / unknown serostatus is typically pooled into the reference per the source paper’s analysis plan).
-
Source aliases:
-
SERPOS– used inLin_2024_casirivimab.R.
-
-
Example models:
Lin_2024_casirivimab.R(multiplicative fractional change on CL). -
Notes: SARS-CoV-2-specific. The exact assay
(anti-spike vs anti-nucleocapsid; vendor) varies by study; document
per-model in
covariateData[[SARS_SEROPOS]]$notes.
OXYSUP_LOW (canonical for low-flow supplemental oxygen indicator)
- Description: 1 = subject is receiving low-flow supplemental oxygen at baseline (e.g., nasal cannula, simple face mask), 0 = no supplemental oxygen at baseline.
- Units: (binary)
- Type: binary
- Scope: specific
-
Reference category: 0 (no supplemental oxygen at
baseline; the high-flow / mechanical-ventilation categories are encoded
by the parallel
OXYSUP_HIGHindicator). -
Source aliases:
-
OXYSTAT1– used inLin_2024_casirivimab.R.
-
-
Example models:
Lin_2024_casirivimab.R(multiplicative fractional change on CL; +10.6%). -
Notes: Decomposed indicator from a 4-level ordered
categorical (no oxygen / low-flow / high-flow / mechanical ventilation).
Use with the parallel
OXYSUP_HIGHindicator. Register a separateOXYSUP_VENTcanonical if a future analysis splits mechanical ventilation from high-flow oxygen.
OXYSUP_HIGH (canonical for high-flow supplemental oxygen indicator)
- Description: 1 = subject is receiving high-flow supplemental oxygen at baseline (high-flow nasal cannula, non-rebreather mask, non-invasive positive-pressure ventilation, OR mechanical ventilation pooled into the high-flow category), 0 = otherwise (no supplemental oxygen or low-flow).
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (no supplemental oxygen at baseline; document whether the source paper pooled mechanical ventilation into this indicator or treated it separately).
-
Source aliases:
-
OXYSTAT2– used inLin_2024_casirivimab.R.
-
-
Example models:
Lin_2024_casirivimab.R(multiplicative fractional change on CL; +38.0%). -
Notes: Companion indicator to
OXYSUP_LOW. In Lin 2024 the rare mechanical-ventilation cases were pooled into the high-flow indicator (n = 24 across the 7598-subject dataset).
HIV_POS (canonical for HIV-positive comorbidity indicator)
- Description: 1 = HIV-1 antibody positive at study entry, 0 = HIV-negative. Time-fixed per subject. Used as a binary comorbidity indicator on PK parameters (typically bioavailability or clearance) when a study population pools HIV-positive and HIV-negative subjects on a non-HIV primary indication (tuberculosis treatment, hepatitis treatment, etc.).
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (HIV-negative).
-
Source aliases:
-
HIV– used inJonsson_2011_ethambutol.R(DDMODEL00000220 NMTRAN$INPUTcolumn with values 0 = HIV negative, 1 = HIV positive; same orientation as the canonical).
-
-
Example models:
Jonsson_2011_ethambutol.R(multiplicative1 + e_hiv_pos_f * HIV_POSshift on bioavailability; HIV-positive patients exhibit a 15.5% reduction in ethambutol bioavailability versus HIV-negative reference). -
Notes: Parallels the
_POSsuffix convention used byADA_POS,SARS_SEROPOS, and other serostatus / antibody-positivity indicators. Distinct from a primary disease-state indicator likeDIS_HIV(not yet registered) –HIV_POSis a comorbidity flag in non-HIV-primary indications where HIV-vs-non-HIV is tested as a PK covariate. Ratified canonically on 2026-05-06.
EARLY_ART (canonical for early-vs-delayed antiretroviral-treatment-initiation arm indicator)
- Description: Trial randomization-arm indicator: 1 = subject was randomized to initiate antiretroviral treatment (ART) early (within the first 14 days of admission, before nutritional recovery), 0 = subject was randomized to delayed ART initiation (after nutritional recovery, > 14 days from admission). Time-fixed per subject within the trial. The indicator captures the early-vs-delayed-ART contrast tested in the Archary 2019 / MATCH (Malnutrition and ART Timing in Children with HIV) trial in severely malnourished HIV-infected children; the early-ART arm exhibits ~31% higher abacavir bioavailability than the delayed-ART arm.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (delayed ART initiation; F = 1 typical-value reference).
-
Source aliases:
-
EARLY– Archary 2019 source description (“randomized to early ART”); same orientation as the canonical (1 = early arm).
-
-
Example models:
Archary_2019_abacavir.R(multiplicative additive shift on bioavailability F:f(depot) <- (1 + e_earlyart_f * EARLY_ART) * exp(etalfdepot)withe_earlyart_f = 0.31; the early-arm IIV on F is 21.4%, the delayed-arm IIV on F is fixed at 0 in the source – see model-file vignette Errata). -
Notes: Specific scope because the early/delayed
cutoff (14 days from admission) and the underlying
nutritional-rehabilitation context are tied to the MATCH-trial design;
future studies that test a similar early-vs-delayed contrast with a
different time cutoff or non-nutritional context should register a new
canonical (e.g.,
EARLY_ART_28D). Distinct fromTRT_PHASE(which gates active-vs-baseline study-phase contributions on a per-record basis) and fromDAY14(which is a within-subject time-since-treatment-initiation landmark, not an enrolment arm). Ratified canonically on 2026-05-08.
Oncology
RCFB1MAX (canonical for week-1 maximum across-lesion relative change from baseline in SUVmax)
-
Description: Per-subject scalar predictor entering
the overall-survival hazard in the Schindler 2016 sunitinib joint SUVmax
/ SLD / OS-TTE / dropout model. Defined as the maximum (across the
up-to-five tracked target lesions) of
(SUVmax(t = 168 h) - SUVmax(0)) / SUVmax(0), i.e., the most negative (largest reduction) relative change in[18F]FDG-PETstandardized uptake value at one week of sunitinib therapy. The OS Weibull hazard islambh * alphh * t^(alphh - 1) * exp(theta_pred * RCFB1MAX)withalphh = 1(degenerates to constant baseline hazard). - Units: unitless (relative change; typically negative for responders).
- Type: continuous
- Scope: specific
-
Reference category: n/a – entered directly into the
OS hazard exponent. Sign convention: more negative
RCFB1MAX(greater week-1 SUVmax suppression) reduces the OS hazard (Schindler 2016 reports a positive theta_pred = 5.36, soexp(5.36 * RCFB1MAX)is < 1 for negative RCFB1MAX). -
Source aliases:
-
RCFB1MAX(Schindler 2016 NONMEM$ERRORblock intermediate; “max relative change in SUVmax from baseline at week 1 across lesions”). Computed inline in the source.modfrom the on-the-fly SUVmax states at TIME = 168 h and reused in subsequent records.
-
-
Example models:
Schindler_2016_sunitinib.R(DDMODEL00000221). -
Notes: Specific scope because the metric is tied to
the Schindler 2016 GIST-on-sunitinib joint biomarker / OS analysis. In
the source NONMEM
.modRCFB1MAXis a record-loop state, not a true subject-level covariate – it is captured at FLAG = 1 / TIME = 168 h from the running SUVmax compartment values and reused on every subsequent record. nlmixr2 / rxode2 do not have an idiomatic equivalent of NONMEM’s record-loop persistent state, so the model file consumesRCFB1MAXas a per-subject input covariate; reproducing the source’s behavior requires a two-stage simulation (run the SUVmax + SLD ODEs first, computeRCFB1MAXper subject from the t = 168 h SUVmax values, then run the OS / dropout TTE arms withRCFB1MAXbound). The vignette virtual cohort follows this pattern.
TUMSZ (canonical for baseline tumor size)
- Description: Baseline tumor size. For solid tumors, the sum of diameters of target lesions per RECIST; for classical Hodgkin lymphoma and lymphoma generally, the sum of products of perpendicular diameters (SPPD) or the sum of linear diameters of target lesions, depending on the source paper.
-
Units: mm (for linear-diameter constructs); mm^2
(for SPPD constructs; record the per-model convention in
covariateData[[TUMSZ]]$unitsandnotes). - Type: continuous
- Scope: general
-
Reference category: n/a – used with power scaling
(TUMSZ / ref)^exponentfor continuous effects, or with a paper-specific threshold for categorical-stratum indicators (e.g., Gibiansky 2014 splits BSIZ at 1750 mm^2 into low- vs high-burden strata). Reference values observed: 41 mm (Zhou 2025); 63 mm (Budha 2023); 90 mm (Lu 2014, source reference 9 cm converted to mm); 1750 mm^2 threshold (Gibiansky 2014, SPPD). -
Source aliases:
-
LDIAM(Zhou 2025; pediatric lymphoma “linear diameter” of target lesions in mm). -
TMBD(originally in cm;TUMSZ_mm = TMBD_cm * 10) – used inLu_2014_trastuzumabemtansine.R. -
BSIZ(Gibiansky 2014; baseline tumor size as the sum of products of perpendicular diameters of target lesions, mm^2; used as the categorical indicator(BSIZ <= 1750)in the obinutuzumab popPK model rather than as a continuous power covariate).
-
-
Example models:
Budha_2023_tislelizumab.R(reference 63 mm),Lu_2014_trastuzumabemtansine.R(reference 90 mm; source column TMBD in cm, values converted to mm on ingestion),Zhou_2025_brentuximab.R(reference 41 mm; source column LDIAM is the sum of linear diameters of target lesions; effect on ADC clearance only),Gibiansky_2014_obinutuzumab.R(SPPD in mm^2; used as a categorical indicator(TUMSZ <= 1750)on the time-dependent clearance decay rate kdes, not as a continuous power covariate),Hansson_2013b_sunitinib.R(DDMODEL00000198; observed baseline tumor SLD used as the per-subject IC of the tumor-size ODE via the IPP-style proportional baseline-residual constructiontumor(0) = TUMSZ * (1 + etaibase * propSd); the source .mod reads OBASE from DV at TIME=0/FLAG=4, but nlmixr2 / rxode2 cannot replicate the in-record assignment idiom and consumes the observed baseline as a covariate instead). -
Notes: Promoted to scope: general on 2026-04-20 as
a conventional oncology baseline-tumor-size measure (RECIST for solid
tumors, SPPD or sum-of-linear-diameters for lymphomas). The SPPD vs
sum-of-diameters vs sum-of-linear-diameters convention is pooled onto a
single column; document the per-model mixture where relevant. When the
source paper reports tumor size in cm, convert to mm (the canonical
unit) on data ingestion and scale the per-model reference accordingly so
(TUMSZ / ref)^expis numerically invariant. For SPPD constructs the natural unit is mm^2 (a product of two perpendicular diameters in mm); record that in the per-modelcovariateData[[TUMSZ]]$unitsfield and do NOT cross-mix mm and mm^2 within a single ingest. When a source paper specifically reports the RECIST 1.1 “sum of longest diameters” of target lesions, use the more specificTUM_SLDcanonical instead –TUMSZremains the pooled-tumor-burden register.
TUM_SLD (canonical for sum of longest diameters of target lesions)
-
Description: Baseline sum of longest diameters of
target lesions per RECIST 1.1. More specific than the pooled
TUMSZcanonical; useTUM_SLDwhen the source paper explicitly reports “sum of longest diameters” (or “sum of lesions”) as the tumor-burden metric, distinct from the pooled “sum of diameters / SPPD / sum of linear diameters” mixture covered byTUMSZ. - Units: mm
- Type: continuous
- Scope: general
-
Reference category: n/a – used with power scaling
(TUM_SLD / ref)^exponent. Reference values observed: 70.0 mm (de Vries Schultink 2020 zenocutuzumab population median). -
Source aliases:
-
SoL/ “sum of lesions” (de Vries Schultink 2020 zenocutuzumab) – same construct, mm.
-
-
Example models:
deVriesSchultink_2020_zenocutuzumab.R(reference 70.0 mm; power exponent 0.447 on Vmax of the parallel non-linear / Michaelis-Menten clearance). -
Notes: Distinct from
TUMSZ(pooled tumor-size canonical covering RECIST sum-of-diameters / SPPD / sum-of-linear-diameters);TUM_SLDis the precise RECIST 1.1 sum-of-longest-diameters metric. Ratified canonically on 2026-04-29 alongside the pilot bispecific extraction (de Vries Schultink 2020 zenocutuzumab). When the source paper reports tumor burden in cm, convert to mm (the canonical unit) on data ingestion and scale the per-model reference accordingly so(TUM_SLD / ref)^expis numerically invariant. Also used as the per-subject initial-condition input for tumour-growth / angiogenesis-inhibition (TGI) ODE models where the source paper sets the SLD state at time zero from the observed baseline SLD rather than estimating a typical-value baseline (e.g.,Ouerdani_2015_pazopanib.RusestumorSize(0) <- TUM_SLD).
TUM_VOL (canonical for caliper-measured baseline tumor volume in xenograft / preclinical studies)
-
Description: Baseline tumor volume measured by
handheld caliper in subcutaneous-xenograft preclinical studies and
similar small-animal models, computed from longest length and orthogonal
width via
volume = (length * width^2) / 2(the standard ellipsoid-approximation formula used in xenograft pharmacology). Per-subject (per-animal) baseline measurement at randomisation into a dosing group. Used both as a covariate stratifier (size at randomisation) and as the per-subject initial-condition input for TGI ODE models where the source paper sets the tumor-volume state at time zero from the observed measurement rather than estimating a typical-value baseline. -
Units:
mm^3 - Type: continuous
- Scope: general
-
Reference category: n/a – typically used directly
as the ODE initial state (
tumorSize(0) <- TUM_VOL); not a covariate effect coefficient. -
Source aliases:
-
P0(Ouerdani 2015 mouse model symbol for the observed initial tumour volume; mm^3 in the CAKI-2 xenograft cohort; range 100-250 mm^3 at randomisation per the paper’s preclinical Methods).
-
-
Example models:
Ouerdani_2015_pazopanib_mouse.R(Ouerdani 2015 preclinical TGI in CAKI-2 xenograft mice; per-subjectTUM_VOLinitialises thetumorSizestate and is held constant per individual across the 24-day dosing window). -
Notes: General scope because xenograft tumor-volume
baselines have a shared meaning across preclinical TGI papers regardless
of the drug or cell line. Use
TUM_VOLwhenever a preclinical paper supplies per-animal caliper tumor volumes as the per-subject initial state of a tumor-volume ODE; useTUM_SLDfor clinical RECIST sum-of-longest-diameters (a length, not a volume) andTUMSZfor the pooled “baseline tumor burden as a covariate on PK” use case in clinical models. Ratified canonically on 2026-05-12 alongside the Ouerdani 2015 pazopanib mouse extraction.
TUMTP_CHL (canonical for classical Hodgkin lymphoma tumor-type indicator)
- Description: 1 = classical Hodgkin lymphoma (cHL) or Hodgkin lymphoma generally, 0 = other tumor types.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 = all other tumor types (e.g., NSCLC, EC, HCC, UC, GC, CRC, NPC, OC, “Other” solid tumors in the Budha 2023 cohort; systemic anaplastic large-cell lymphoma in the Zhou 2025 pediatric cohort).
-
Source aliases:
-
TUMTP(categorical column with levels likecHL,GC, …) – decompose intoTUMTP_CHL = as.integer(TUMTP == "cHL"). -
DIS(Zhou 2025; integer code withDIS == 1flagging HL) – decompose intoTUMTP_CHL = as.integer(DIS == 1). Zhou 2025 calls the complement “non-HL”; in the Zhou 2025 cohort the non-HL group is exclusively sALCL.
-
-
Example models:
Budha_2023_tislelizumab.R,Zhou_2025_brentuximab.R(effects on ADC Q2, MMAE central volume VM, and the ADC->MMAE conversion-decay rate ALFM; the Zhou 2025 paper anchors typical-value parameters to HL patients so the model uses(1 - TUMTP_CHL)as the on-effect indicator with reference category 1 = HL). -
Notes: Paired with
TUMTP_GCin Budha 2023; a patient can have at most one of the indicators set to 1 (the remaining tumor types collapse into the reference 0 group). The reference category is the off-encoded value (0) by definition; when a source paper anchors typical-value parameters to the HL group rather than the non-HL group (as Zhou 2025 does), encode the effect ascoef^(1 - TUMTP_CHL)so the canonical column meaning (1 = cHL/HL) is preserved while the paper’s reference (HL) still receives multiplier 1.
HER2_ECD (canonical for HER2 shed extracellular domain concentration)
- Description: Baseline serum concentration of the shed extracellular domain of human epidermal growth factor receptor 2 (HER2). Serves as a soluble-antigen biomarker of HER2-mediated target-mediated drug disposition for HER2-directed mAbs / ADCs.
- Units: ng/mL
- Type: continuous
- Scope: specific
-
Reference category: n/a – used with power scaling
(HER2_ECD / ref)^exponent. Reference 25 ng/mL used in Lu 2014; reference 8.23 ng/mL (population median) used in Bruno 2005, with a 200 ng/mL plateau cap. -
Source aliases:
-
ECD– used inLu_2014_trastuzumabemtansine.RandBruno_2005_trastuzumab.R.
-
-
Example models:
Lu_2014_trastuzumabemtansine.R(reference 25 ng/mL; exponent 0.035 on CL),Bruno_2005_trastuzumab.R(reference 8.23 ng/mL; power exponents 0.041 on CL and 0.105 on V; HER2_ECD capped at 200 ng/mL inside model() to reflect the Bruno 2005 plateau observation). -
Notes: Scoped specific because the covariate is
meaningful only for HER2-targeted agents; if a non-HER2 paper uses a
shed-antigen analog for a different target, register a target-specific
canonical (e.g.,
EGFR_ECD) rather than reusing this one. Disambiguated from the covariate-columns register by the explicitHER2_prefix.
TRAST_BL (canonical for baseline trastuzumab concentration from prior therapy)
- Description: Baseline serum concentration of residual unconjugated trastuzumab remaining from prior trastuzumab-containing therapy, measured at the start of a subsequent anti-HER2 treatment (e.g., trastuzumab emtansine). Encodes the magnitude of residual HER2-site competition from a previous trastuzumab exposure.
- Units: ug/mL
- Type: continuous
- Scope: specific
-
Reference category: n/a – used as a linear
covariate on log(CL) via
exp(coef * TRAST_BL). TRAST_BL = 0 corresponds to no detectable residual trastuzumab (reference condition used in Lu 2014). -
Source aliases:
-
TBL– used inLu_2014_trastuzumabemtansine.R.
-
-
Example models:
Lu_2014_trastuzumabemtansine.R(linear-on-log coefficient -0.002 per ug/mL on CL). - Notes: Scoped specific because the covariate is meaningful only for drugs that compete with trastuzumab at the HER2 binding site. Expect values clustered at 0 for trastuzumab-naive patients; Lu 2014 observed 0 at the 5th percentile and 54 ug/mL at the 95th percentile.
TUMTP_GC (canonical for gastric-cancer tumor-type indicator)
- Description: 1 = gastric cancer (GC) or adenocarcinoma of the gastroesophageal junction (GEJ), 0 = other tumor types.
- Units: (binary)
- Type: binary
- Scope: specific
-
Reference category: 0 = all other tumor types (same
reference group as
TUMTP_CHL). -
Source aliases:
-
TUMTP(categorical column) – decompose intoTUMTP_GC = as.integer(TUMTP == "GC"). -
TTYPE(Quartino 2019; categorical column with levelsMBC,EBC,HV,AGC,Others) – decompose intoTUMTP_GC = as.integer(TTYPE == "AGC"). -
TTYPE4(Wang 2024; level 4 of a five-level tumor-type factor labelled “GCGEJ” in the source) – decompose intoTUMTP_GC = as.integer(TTYPE4 == 1).
-
-
Example models:
Budha_2023_tislelizumab.R,Quartino_2019_trastuzumab.R(advanced gastric cancer; per-group typical-value switch on linear CL and Vc rather than an exponential multiplier),Wang_2024_sugemalimab.R(gastric + GEJ adenocarcinoma pooled; exponential coefficient log(1.13) on CL and log(1.14) on Vc). -
Notes: Follows the
RACE_<GROUP>indicator-decomposition pattern. New oncology tumor types should be added as additionalTUMTP_<GROUP>entries so the reference set stays explicit. “Advanced gastric cancer” (AGC), “gastric cancer” (GC), and “GC or adenocarcinoma of the gastroesophageal junction” (GCGEJ) are pooled onto a singleTUMTP_GCindicator; document the per-paper stage-of-disease and GEJ-inclusion detail incovariateData[[TUMTP_GC]]$notes. ESCC (squamous histology) is captured by the separateTUMTP_ESCCindicator and is not pooled here.
TUMTP_OTH (canonical for ‘other tumor types’ residual indicator)
- Description: 1 = heterogeneous “other” tumor-type pool (typically NSCLC plus miscellaneous solid tumors such as prostate, ovarian, and colorectal), 0 = one of the named tumor-type groups in the same analysis.
- Units: (binary)
- Type: binary
- Scope: specific
-
Reference category: 0 = named tumor-type groups
defined per-paper (e.g., MBC, EBC, HV, AGC in Quartino 2019). The
complement of all
TUMTP_<GROUP>indicators defined in the same model. -
Source aliases:
-
TTYPE(Quartino 2019) – decompose intoTUMTP_OTH = as.integer(TTYPE == "Others"). -
PAT2(Sathe 2024) – integer-coded tumor type column with levels 1 (mTNBC), 2 (mUC or HR+/HER2- mBC), 4 (Other epithelial); the source NONMEM control stream collapses PAT2 = 1 and PAT2 = 2 into the reference (no effect) and applies the deviation only when PAT2 = 4. Decompose intoTUMTP_OTH = as.integer(PAT2 == 4).
-
-
Example models:
Quartino_2019_trastuzumab.R(per-group typical-value switch on linear CL; NSCLC plus a small residual group of prostate, ovarian, and other histologies),Sathe_2024_sacituzumab.R(multiplicative effect on tAB CL: -13.4% when TUMTP_OTH = 1; “Other” pool = small-cell and non-small-cell lung cancer, colorectal cancer, esophageal cancer, pancreatic ductal adenocarcinoma, etc., n = 184; reference = pooled mTNBC + mUC + HR+/HER2- mBC, n = 345). -
Notes: Scope: specific because the set of
histologies collapsed into “Others” is defined by the analysis plan of
the source paper; two papers’
TUMTP_OTHcolumns are not interchangeable. Document the exact per-paper composition (e.g., “NSCLC + prostate + ovarian + other, n = 107 in Quartino 2019”; “small-cell + non-small-cell lung + CRC + esophageal + pancreatic ductal adenocarcinoma, n = 184 in Sathe 2024”) incovariateData[[TUMTP_OTH]]$notes. A given subject can have at most one of theTUMTP_<GROUP>indicators (includingTUMTP_OTH) set to 1; all-zero means the reference group.
SPDL1 (canonical for soluble PD-L1 concentration)
- Description: Baseline (or time-varying) serum concentration of soluble programmed death-ligand 1 (sPD-L1). Serves as a circulating biomarker of target burden and immune activation for anti-PD-1/PD-L1 antibodies.
- Units: pg/mL
- Type: continuous
- Scope: specific
-
Reference category: n/a – used with power scaling
(SPDL1 / ref)^exponent. Reference value observed: 173.8 pg/mL (study-population median in Ogasawara 2020). -
Source aliases: none;
SPDL1is the standard abbreviation used directly in source analyses. -
Example models:
Ogasawara_2020_durvalumab.R(power effect on CL, exponent 0.0617, reference 173.8 pg/mL; time-varying; values below LOD imputed as LOD/2 = 33.55 pg/mL),Quartino_2019_trastuzumab.R(per-group typical-value switch on linear CL; NSCLC plus a small residual group of prostate, ovarian, and other histologies),Wang_2024_sugemalimab.R(heterogeneous solid-tumor residual group of n = 174; exponential coefficient log(0.885) on CL and log(0.926) on Vc; NSCLC is the reference group, not part ofTUMTP_OTH). -
Notes: Scope: specific because sPD-L1 is meaningful
only for drugs targeting the PD-1/PD-L1 pathway. For other checkpoint
biomarkers (e.g., soluble CTLA-4, soluble LAG-3) register new dedicated
canonicals rather than reusing this one. Ratified canonically on
2026-04-26.
-
TTYPE3(Wang 2024; level 3 of a five-level tumor-type factor labelled “Other” in the source) – decompose intoTUMTP_OTH = as.integer(TTYPE3 == 1).
-
-
Example models:
Quartino_2019_trastuzumab.R(per-group typical-value switch on linear CL; NSCLC plus a small residual group of prostate, ovarian, and other histologies),Wang_2024_sugemalimab.R(heterogeneous solid-tumor residual group of n = 174; exponential coefficient log(0.885) on CL and log(0.926) on Vc; NSCLC is the reference group, not part ofTUMTP_OTH),Ogasawara_2020_durvalumab.R(power effect on CL, exponent 0.0617, reference 173.8 pg/mL; time-varying; values below LOD imputed as LOD/2 = 33.55 pg/mL). -
Notes: Scope: specific because the set of
histologies collapsed into “Others” is defined by the analysis plan of
the source paper; two papers’
TUMTP_OTHcolumns are not interchangeable. Document the exact per-paper composition (e.g., “NSCLC + prostate + ovarian + other, n = 107 in Quartino 2019”; “miscellaneous solid tumors excluding NSCLC, lymphoma, GCGEJ, and ESCC, n = 174 in Wang 2024”) incovariateData[[TUMTP_OTH]]$notes. A given subject can have at most one of theTUMTP_<GROUP>indicators (includingTUMTP_OTH) set to 1; all-zero means the reference group.
MCPROT (canonical for serum monoclonal (M) protein concentration)
- Description: Serum monoclonal (M) protein concentration. Multiple-myeloma plasma-cell-burden marker secreted by the tumor clone; elevated MCPROT reflects higher tumor burden and (for IgG-secreting MM) competes with therapeutic IgG mAbs for FcRn-mediated salvage and target-mediated elimination. Typically time-varying – measured at multiple visits over the treatment course and supplied at every PK observation time via linear interpolation between measurements.
-
Units: g/dL (US-convention; equivalent to 10 g/L
SI). Document the unit used in each model via
covariateData[[MCPROT]]$units. - Type: continuous
- Scope: specific
-
Reference category: n/a – used as a continuous,
log-linear effect on the Vmax of target-mediated elimination via
exp(theta * MCPROT)(i.e., MCPROT enters un-log-transformed). Reference values observed: 0 g/dL (Ide 2020 Vmax,REF reference) and 2.0 g/dL (Ide 2020 figure-1 reference patient). -
Source aliases:
-
TMCPROT(time-varying serum M-protein concentration) – used inIde_2020_elotuzumab.R. NONMEM column with imputation sentinel-99for missing observations, replaced by population median 2.1 g/dL viaIF(TMCPROT.EQ.-99) TMCPROT = 2.1.
-
-
Example models:
Ide_2020_elotuzumab.R(g/dL, time-varying; entered un-log-transformed asexp(0.277 * MCPROT)on Vmax of the Michaelis-Menten target-mediated elimination from the central compartment). -
Notes: Specific scope because the column is
mechanistically meaningful only for plasma-cell-targeting therapies in
multiple myeloma (e.g., elotuzumab anti-SLAMF7, daratumumab anti-CD38,
isatuximab anti-CD38, belantamab anti-BCMA, and BCMA-bispecifics /
CAR-T). MCPROT decreases with treatment response; the time-varying form
is the only correct way to capture the diminishing
target-mediated-elimination component as the tumor regresses. In NONMEM
datasets MCPROT is supplied at each event-row time, with linear
interpolation between observations and last-observation-carried-forward
beyond the last sample (Ide 2020 Methods). Distinct from
MM_NIGG(which is the immunoglobulin subtype, an MM-disease stratifier that is time-fixed),SBCMA(soluble BCMA, a different MM tumor-burden biomarker for BCMA-targeting drugs), andB2M(beta-2-microglobulin, a renal-function-and-MM-disease-burden marker). The 1 g/dL = 10 g/L conversion lets future SI-convention papers register the same canonical with their own unit string.
LMET (canonical for baseline presence of liver metastases)
- Description: Binary indicator of radiologically documented liver metastases at baseline, 1 = liver metastases present, 0 = no liver metastases.
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (no liver metastases at baseline).
-
Source aliases: none;
LMETis the common NONMEM / clinical-PK abbreviation used directly by the source papers. -
Example models:
Quartino_2019_trastuzumab.R(exponential effect on linear CL; +16.4% typical CL when LMET = 1, per Quartino 2019 Table 1 theta12 = 0.152). -
Notes: Liver metastases are associated with hepatic
protein-synthesis impairment and altered IgG catabolism, making
LMETa commonly tested covariate in oncology mAb population PK analyses. Scope: general so future oncology papers can reuse the canonical column. Time-fixed baseline indicator; if a source paper treats it as time-varying (progression during treatment), document incovariateData[[LMET]]$notes.
MET_GE4 (canonical for baseline number of metastatic sites >= 4 indicator)
- Description: Binary indicator dichotomising the count of baseline metastatic sites at 4, 1 = patient has four or more documented metastatic sites at baseline, 0 = patient has zero to three metastatic sites at baseline. Time-fixed per subject. Treated as a surrogate for tumor burden in oncology mAb popPK analyses.
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (fewer than four metastatic sites at baseline).
-
Source aliases:
-
MET– used inBruno_2005_trastuzumab.R(Bruno 2005 Methods: “MET=1 if number of metastatic sites=4 or greater; otherwise MET=0”). When a source paper supplies the raw integer count column (NMET,N_METS, etc.) rather than the pre-binarised indicator, deriveMET_GE4 = as.integer(NMET >= 4).
-
-
Example models:
Bruno_2005_trastuzumab.R(multiplicative effect on linear CL: typical CL multiplied by(1 + 0.221 * MET_GE4), i.e. +22.1% CL in MET_GE4 = 1 patients; reference MET_GE4 = 0 per Bruno 2005 Table 3). -
Notes: The >= 4 split is the dichotomisation
used by Bruno 2005 to capture “patients with four or more metastatic
sites” as a high-tumor-burden subgroup. Scope: general so future
oncology popPK papers using the same dichotomisation can reuse this
canonical column. If a future paper uses a different split (e.g. >= 2
or >= 5), register a separate
MET_GENcanonical rather than overloading this entry. AMET_GE4 = 1patient may also haveLMET = 1; the two columns are not mutually exclusive (liver-only metastases would haveLMET = 1,MET_GE4 = 0).
ECOG_GE1 (canonical for Eastern Cooperative Oncology Group performance-status indicator, >= 1)
- Description: 1 if baseline Eastern Cooperative Oncology Group (ECOG) performance status is greater than or equal to 1, 0 if ECOG = 0. Time-fixed per subject.
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (ECOG performance status = 0, i.e., fully active / asymptomatic).
-
Source aliases:
-
PS/BPS– used inBajaj_2017_nivolumab.R(BPS = “baseline performance status”; the one study using Karnofsky Performance Status was mapped to ECOG via Oken 1982 before binarization) andZhang_2019_nivolumab.R(paper’s binary collapse PS=0 vs. PS>0). -
ECOG_1– alternative explicit form; equivalent toECOG_GE1when ECOG only takes values 0, 1, 2 in the analysis dataset (the typical oncology case). -
ECOG_PS_GT0– retired name used in earlier register drafts; semantically identical (>= 1equals> 0for integer ECOG scores). -
ECOG101(categorical 0/1/2 score with thresholdingIF(ECOG101.GT.0.5)) – used inIde_2020_elotuzumab.R. Decompose:ECOG_GE1 = as.integer(ECOG101 >= 1).
-
-
Example models:
Bajaj_2017_nivolumab.R(exponential effect on CL with coefficient 0.172),Zhang_2019_nivolumab.R(exponential effect exp(0.181) on baseline CL; additive effect -0.138 on the time-varying-CL Emax parameter),Ide_2020_elotuzumab.R(multiplicative effect on CL = 1.03; paired withECOG_GE2for separate ECOG=1 vs ECOG>=2 effects),Netterberg_2017_docetaxel.R(multiplicative effect on baseline ANC of the Friberg myelosuppression chain:BACOV *= (1 + theta * ECOG_GE1)with theta = 0.130; source columnPERFwith ordinal ECOG 0/1/2 values, binarized viaECOG_GE1 = as.integer(PERF >= 1)per Kloft 2006). -
Notes: Oncology papers conventionally report ECOG
as an integer (0-5) but binarize at >= 1 because ECOG >= 2 is rare
in trial cohorts. When a source paper provides the ordinal ECOG score
separately, derive
ECOG_GE1 = as.integer(ECOG >= 1). Zhang 2019 usesECOG_GE1on both baseline CL and the time-varying Emax parameter (unlike Bajaj 2017, which uses it on CL only); document the structural role in each model’scovariateData[[ECOG_GE1]]$notes. When a paper retains separate effects for ECOG = 1 vs ECOG >= 2 (Ide 2020), pair this column withECOG_GE2and supply both indicators in the event dataset.
ECOG_GE2 (canonical for Eastern Cooperative Oncology Group performance-status indicator, >= 2)
-
Description: 1 if baseline Eastern Cooperative
Oncology Group (ECOG) performance status is greater than or equal to 2,
0 if ECOG <= 1. Time-fixed per subject. Used in models that retain
separate effects for ECOG = 1 vs ECOG >= 2 by pairing this column
with
ECOG_GE1. - Units: (binary)
- Type: binary
- Scope: general
-
Reference category: 0 (ECOG performance status
<= 1; in models that pair
ECOG_GE1andECOG_GE2, both indicators = 0 corresponds to ECOG = 0 and (ECOG_GE1= 1,ECOG_GE2= 0) corresponds to ECOG = 1). -
Source aliases:
-
ECOG101(categorical 0/1/2 score with thresholdingIF(ECOG101.GT.1.5)) – used inIde_2020_elotuzumab.R. Decompose:ECOG_GE2 = as.integer(ECOG101 >= 2).
-
-
Example models:
Ide_2020_elotuzumab.R(multiplicative effect on CL = 1.15; paired withECOG_GE1to retain separate ECOG = 1 vs ECOG >= 2 effects). -
Notes: Parallels
ECOG_GE1. Use only when the source paper reports a separate effect for ECOG >= 2 in addition to ECOG_GE1; otherwiseECOG_GE1alone is sufficient. The paired (ECOG_GE1,ECOG_GE2) decomposition reproduces a three-level (ECOG = 0,ECOG = 1,ECOG >= 2) ordinal effect with two binaries.
TUMTP_SCLC (canonical for small-cell-lung-cancer tumor-type indicator)
- Description: 1 = small cell lung cancer (SCLC), 0 = other tumor types.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 = all other tumor types (e.g., melanoma, NSCLC, RCC, HCC, CRC in the Sanghavi 2020 cohort; reference category is melanoma).
-
Source aliases:
-
TUMTP(categorical column with levels includingmelanoma,NSCLC,SCLC,CRC,HCC,RCC) – decompose intoTUMTP_SCLC = as.integer(TUMTP == "SCLC").
-
-
Example models:
Sanghavi_2020_ipilimumab.R(exponential coefficient -0.124 on CL). -
Notes: Follows the
TUMTP_CHL/TUMTP_GCdecomposition pattern. SCLC is the only retained tumor-type indicator in the Sanghavi 2020 final model after backward elimination; the other tumor types collapse into the reference (melanoma) group.
TUMTP_NSCLC (canonical for non-small-cell-lung-cancer tumor-type indicator)
- Description: 1 = non-small cell lung cancer (NSCLC), 0 = other tumor types.
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 = all other tumor types (in Ahamadi 2017 the implicit reference is melanoma, with the rare “other” cancer type pooled into the reference).
-
Source aliases:
-
TUMTP(categorical column with levels includingmelanoma,NSCLC,other) – decompose intoTUMTP_NSCLC = as.integer(TUMTP == "NSCLC").
-
-
Example models:
Ahamadi_2017_pembrolizumab.R(proportional change of +14.5% on CL for NSCLC patients relative to melanoma; the “other” cancer type cohort – 1.01% of the population – is pooled into the melanoma reference per the paper’s model description). -
Notes: Follows the
TUMTP_CHL/TUMTP_GC/TUMTP_SCLCdecomposition pattern. Scope: general because NSCLC is a high-frequency tumor-type contrast (with melanoma or “other” reference) across PD-1 / PD-L1 / chemotherapy popPK analyses, and is likely to recur in future extractions. Ratified canonically on 2026-05-17 alongside the Ahamadi 2017 pembrolizumab extraction.
TUMTP_LYMPH (canonical for lymphoma (pooled) tumor-type indicator)
- Description: 1 = lymphoma (heterogeneous lymphoma pool spanning multiple lymphoma histologies – e.g., classical Hodgkin lymphoma combined with extranodal NK/T-cell lymphoma; or any-histology lymphoma pooled with solid-tumor and leukemia cohorts), 0 = solid tumor or other tumor type.
- Units: (binary)
- Type: binary
- Scope: general
-
Reference category: 0 = non-lymphoma tumor type
(per source paper; e.g., NSCLC, GC/GEJ, ESCC, “Other” solid tumors in
the Wang 2024 cohort, with NSCLC as the implicit reference when paired
with the other Wang 2024
TUMTP_*indicators; or leukemia as the implicit reference in the Akbar 2025 cohort). -
Source aliases:
-
TTYPE1(Wang 2024) – decompose intoTUMTP_LYMPH = as.integer(TTYPE1 == 1). The Wang 2024 source paper uses a multi-levelTTYPEfactor with levels 1 = lymphoma, 2 = lung cancer (reference), 3 = other, 4 = GCGEJ, 5 = ESCC. - Categorical column “type of cancer” with level “Lymphoma” (Akbar
2025) – decompose into
TUMTP_LYMPH = as.integer(cancer_type == "Lymphoma").
-
-
Example models:
Wang_2024_sugemalimab.R(exponential coefficient log(0.877) on baseline CL and log(0.879) on Vc),Akbar_2025_voriconazole.R(additive-fractional +1.91% effect on CL relative to leukemia reference; 95% CI spans zero). -
Notes: Distinct from
TUMTP_CHL(which is specifically classical Hodgkin lymphoma). Wang 2024 pools two lymphoma histologies (extranodal NK/T-cell lymphoma from CS1001-201 / NCT03595657 and classical Hodgkin lymphoma from CS1001-202 / NCT03505996) into a single lymphoma indicator; the indicator therefore captures a generic “hematologic-vs-solid-tumor” contrast rather than a histology-specific effect. Akbar 2025 uses a single “Lymphoma” category alongside leukemia, sarcoma, breast cancer, myeloma, and glioma in a heterogeneous-cancer TDM cohort. When a future paper studies a single lymphoma histology distinct from cHL, register a more specific canonical (e.g.,TUMTP_ENKTL,TUMTP_NHL) rather than overloading this one. Document the per-paper histology composition incovariateData[[TUMTP_LYMPH]]$notes. Promoted fromScope: specifictoScope: generalon 2026-05-09 alongside the Akbar 2025 voriconazole extraction so that any heterogeneous-cancer-cohort PK analysis can use this canonical name without scope-violation.
TUMTP_BC (canonical for breast-cancer tumor-type indicator)
- Description: 1 = breast cancer (any histology / receptor status), 0 = other tumor types.
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 = all other tumor types (per source paper; in Lu 2022 the implicit reference is NSCLC, with colorectal cancer pooled into the reference because its CL effect was found insignificant relative to NSCLC).
-
Source aliases:
-
TUMTP(categorical column with levels includingBC,NSCLC,CRC) – decompose intoTUMTP_BC = as.integer(TUMTP == "BC").
-
-
Example models:
Lu_2022_patritumab.R(multiplicative fractional effect 0.811 on CLlin of DXd-conjugated antibody for breast-cancer patients vs the NSCLC reference; CRC effect was tested and found insignificant so CRC is pooled into the reference). -
Notes: Follows the
TUMTP_CHL/TUMTP_GC/TUMTP_SCLCdecomposition pattern. Registers the breast-cancer arm of an oncology-cohort tumor-type contrast; pair with sisterTUMTP_<GROUP>indicators (e.g.,TUMTP_NSCLC,TUMTP_CRC) when a future paper retains separate effects for additional tumor types beyond the implicit reference. Ratified canonically on 2026-04-28.
TUMTP_ESCC (canonical for oesophageal-squamous-cell-carcinoma tumor-type indicator)
- Description: 1 = oesophageal squamous cell carcinoma (ESCC), 0 = other tumor types.
- Units: (binary)
- Type: binary
- Scope: specific
-
Reference category: 0 = all other tumor types (per
source paper; in Wang 2024, the implicit reference is NSCLC when all the
other
TUMTP_*indicators are also 0). -
Source aliases:
-
TTYPE5(Wang 2024) – decompose intoTUMTP_ESCC = as.integer(TTYPE5 == 1).
-
-
Example models:
Wang_2024_sugemalimab.R(exponential coefficient log(0.99) on baseline CL and log(1.08) on Vc). -
Notes: Follows the
TUMTP_CHL/TUMTP_GC/TUMTP_SCLCdecomposition pattern. Distinct from gastroesophageal-junction adenocarcinoma (which is captured by the broaderTUMTP_GCindicator that pools GC and GEJ adenocarcinomas) – ESCC is a squamous-cell histology, not adenocarcinoma. Document the per-paper histology composition incovariateData[[TUMTP_ESCC]]$notes.
TUMTP_PCALCL (canonical for primary cutaneous anaplastic large-cell lymphoma indicator)
- Description: 1 = primary cutaneous anaplastic large-cell lymphoma (pcALCL), 0 = other tumor types.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 = all other tumor types (in Suri 2018, the non-pcALCL reference comprises Hodgkin lymphoma, systemic ALCL, mycosis fungoides, and other CD30+ hematologic malignancies pooled together).
-
Source aliases:
-
PCALCL– used inSuri_2018_brentuximab.R. Suri 2018 reports the effect as a power-form multipliercl_adc *= 0.728^TUMTP_PCALCL(pcALCL CL ~27% lower than non-pcALCL).
-
-
Example models:
Suri_2018_brentuximab.R(effect on ADC clearance only). -
Notes: Follows the
TUMTP_CHL/TUMTP_GC/TUMTP_SCLCdecomposition pattern. Distinct fromTUMTP_LYMPH(heterogeneous lymphoma pool) andTUMTP_CHL(classical Hodgkin lymphoma). pcALCL is one of two histologies pooled into the broader CTCL category in Suri 2018 (alongside mycosis fungoides); the model singles out pcALCL because Suri 2018 backward elimination retained pcALCL as a separate effect on ADC clearance after exploring the broader CTCL contrast. Ratified canonically on 2026-04-28.
TUMTP_SARC (canonical for sarcoma tumor-type indicator)
- Description: 1 = sarcoma (any histology – soft-tissue or bone sarcoma pooled), 0 = other tumor types.
- Units: (binary)
- Type: binary
- Scope: specific
-
Reference category: 0 = all other tumor types (per
source paper; in Akbar 2025 the implicit reference is leukemia when
paired with the other Akbar
TUMTP_*indicators all = 0). -
Source aliases:
- Categorical column “type of cancer” with level “Sarcoma” – decompose
into
TUMTP_SARC = as.integer(cancer_type == "Sarcoma"). Used inAkbar_2025_voriconazole.R.
- Categorical column “type of cancer” with level “Sarcoma” – decompose
into
-
Example models:
Akbar_2025_voriconazole.R(additive-fractional +18.5% effect on CL relative to leukemia reference). -
Notes: Follows the
TUMTP_CHL/TUMTP_GC/TUMTP_SCLCdecomposition pattern. Akbar 2025 pools soft-tissue and bone sarcoma histologies into a single sarcoma category. Scope: specific because the reference category (leukemia in Akbar 2025) is paper-defined. Ratified canonically on 2026-05-09.
TUMTP_MYELO (canonical for multiple myeloma tumor-type indicator (used in heterogeneous-cancer pooled cohorts))
- Description: 1 = multiple myeloma, 0 = other tumor types.
- Units: (binary)
- Type: binary
- Scope: specific
-
Reference category: 0 = all other tumor types (per
source paper; in Akbar 2025 the implicit reference is leukemia when
paired with the other Akbar
TUMTP_*indicators all = 0). -
Source aliases:
- Categorical column “type of cancer” with level “Myeloma” – decompose
into
TUMTP_MYELO = as.integer(cancer_type == "Myeloma"). Used inAkbar_2025_voriconazole.R.
- Categorical column “type of cancer” with level “Myeloma” – decompose
into
-
Example models:
Akbar_2025_voriconazole.R(additive-fractional -2.33% effect on CL relative to leukemia reference; the 95% CI spans zero). -
Notes: Distinct from the stub
MMentry (which is reserved for multiple-myeloma-as-primary-indication PK studies; theMMdefinition lacks a complete schema and predates the TUMTP_* convention) and fromDIS_SMM(smoldering multiple myeloma, an asymptomatic plasma-cell disorder). UseTUMTP_MYELOwhen the source paper pools multiple myeloma alongside other tumor types in a heterogeneous oncology cohort and treatscancer typeas a many-level categorical covariate. Scope: specific because the reference category (leukemia in Akbar 2025) is paper-defined. Ratified canonically on 2026-05-09.
TUMTP_GLIO (canonical for glioma tumor-type indicator)
- Description: 1 = glioma (any grade / histology – e.g., glioblastoma, anaplastic astrocytoma, oligodendroglioma pooled), 0 = other tumor types.
- Units: (binary)
- Type: binary
- Scope: specific
-
Reference category: 0 = all other tumor types (per
source paper; in Akbar 2025 the implicit reference is leukemia when
paired with the other Akbar
TUMTP_*indicators all = 0). -
Source aliases:
- Categorical column “type of cancer” with level “Glioma” – decompose
into
TUMTP_GLIO = as.integer(cancer_type == "Glioma"). Used inAkbar_2025_voriconazole.R.
- Categorical column “type of cancer” with level “Glioma” – decompose
into
-
Example models:
Akbar_2025_voriconazole.R(additive-fractional +8.81% effect on CL relative to leukemia reference; the 95% CI spans zero). -
Notes: Follows the
TUMTP_CHL/TUMTP_GC/TUMTP_SCLCdecomposition pattern. Akbar 2025 reports a single “glioma” category without further subdivision by histology or grade. Scope: specific because the reference category (leukemia in Akbar 2025) is paper-defined. Ratified canonically on 2026-05-09.
TUMTP_LEUK (canonical for leukemia tumor-type indicator)
- Description: 1 = leukemia (any subtype – AML / ALL / CLL / CML pooled), 0 = other tumor types. Time-fixed per subject.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 = all other tumor types (per source paper). In Akbar 2025 leukemia is the implicit reference (column not used directly), but the canonical name is registered so that future papers that retain leukemia as a non-reference contrast can use it.
-
Source aliases:
- Categorical column “type of cancer” with level “Leukemia” –
decompose into
TUMTP_LEUK = as.integer(cancer_type == "Leukemia"). Implicit reference category inAkbar_2025_voriconazole.R(so the model file does not consume this column directly; it is registered for future heterogeneous-cancer-cohort analyses).
- Categorical column “type of cancer” with level “Leukemia” –
decompose into
-
Example models:
Akbar_2025_voriconazole.R. -
Notes: Distinct from the more specific
DIS_AML,DIS_BCPALL,DIS_CMML,MDSAMLentries – those are for leukemia-only or leukemia-vs-leukemia contrasts;TUMTP_LEUKis for heterogeneous-cancer pooled cohorts where leukemia is one of several tumor types and the analysis treatscancer typeas a many-level categorical. Akbar 2025 had leukemia as 56.8% of the cohort and used it as the reference category. Scope: specific because the reference category in any source paper is paper-defined. Ratified canonically on 2026-05-09.
TUMTP_BCL (canonical for B-cell lymphoma (pooled residual) tumor-type indicator)
- Description: 1 = B-cell lymphoma (BCL), 0 = other tumor types. Time-fixed per subject. In Gibiansky 2014 the BCL category is a pooled residual indolent-B-cell-lymphoma group that includes follicular lymphoma (FL was the primary indication in GAUDI; the four-level DIS column in the NONMEM control stream splits B-cell histologies into CLL = 1, BCL = 2 (residual indolent B-cell-lymphoma pool including FL), DLBCL = 3, MCL = 4).
- Units: (binary)
- Type: binary
- Scope: specific
-
Reference category: 0 = all other tumor types (in
Gibiansky 2014 the implicit reference is CLL when paired with
TUMTP_DLBCLandTUMTP_MCLall = 0; the residual indolent-B-cell-lymphoma pool in this paper is dominated by follicular lymphoma). -
Source aliases:
-
DIS(Gibiansky 2014; integer code withDIS == 2flagging BCL) – decompose intoTUMTP_BCL = as.integer(DIS == 2).
-
-
Example models:
Gibiansky_2014_obinutuzumab.R(effect on time-dependent clearance decay rate kdes via the composite(TUMTP_BCL + TUMTP_DLBCL + TUMTP_MCL)(any-NHL effect; ratio 2.08) and on both time-dependent CL_T and steady-state CL_inf via the composite(TUMTP_BCL + TUMTP_DLBCL)(shared BCL/DLBCL effect; ratio 0.834 in the reverse direction, i.e., 16.6% lower CL than CLL)). -
Notes: Follows the
TUMTP_CHL/TUMTP_GC/TUMTP_SCLCdecomposition pattern. Distinct fromTUMTP_LYMPH(a broader heterogeneous lymphoma pool that lumps cHL with NHL histologies) –TUMTP_BCLis specifically B-cell lymphoma and pairs with siblingTUMTP_DLBCLandTUMTP_MCLfor histology-specific contrasts within the NHL family. When a future paper studies follicular lymphoma in isolation (rather than pooled into BCL), register a more specific canonical (e.g.,TUMTP_FL) rather than overloading this one. Ratified canonically on 2026-05-11.
TUMTP_DLBCL (canonical for diffuse large B-cell lymphoma indicator)
- Description: 1 = diffuse large B-cell lymphoma (DLBCL), 0 = other tumor types. Time-fixed per subject.
- Units: (binary)
- Type: binary
- Scope: specific
-
Reference category: 0 = all other tumor types (in
Gibiansky 2014 the implicit reference is CLL when paired with
TUMTP_BCLandTUMTP_MCLall = 0). -
Source aliases:
-
DIS(Gibiansky 2014; integer code withDIS == 3flagging DLBCL) – decompose intoTUMTP_DLBCL = as.integer(DIS == 3).
-
-
Example models:
Gibiansky_2014_obinutuzumab.R(effect on kdes via the any-NHL composite indicator; effect on CL_T and CL_inf via the shared BCL/DLBCL composite indicator). -
Notes: Follows the
TUMTP_CHL/TUMTP_GC/TUMTP_SCLCdecomposition pattern. Distinct fromTUMTP_LYMPH(broader lymphoma pool) andTUMTP_PCALCL(primary cutaneous anaplastic large-cell lymphoma; a CD30+ T-cell-lineage entity unrelated to DLBCL). DLBCL is the most common high-grade B-cell-NHL subtype; the Gibiansky 2014 cohort had only 30 DLBCL patients (4.4%), so a single estimated effect on CL is shared with BCL (a much larger pooled group; seeTUMTP_BCLnotes). Ratified canonically on 2026-05-11.
TUMTP_MCL (canonical for mantle cell lymphoma indicator)
- Description: 1 = mantle cell lymphoma (MCL), 0 = other tumor types. Time-fixed per subject.
- Units: (binary)
- Type: binary
- Scope: specific
-
Reference category: 0 = all other tumor types (in
Gibiansky 2014 the implicit reference is CLL when paired with
TUMTP_BCLandTUMTP_DLBCLall = 0). -
Source aliases:
-
DIS(Gibiansky 2014; integer code withDIS == 4flagging MCL) – decompose intoTUMTP_MCL = as.integer(DIS == 4).
-
-
Example models:
Gibiansky_2014_obinutuzumab.R(effect on kdes via the any-NHL composite indicator; separate effect on CL_T and CL_inf via the standalone MCL indicator (ratio 1.75, i.e., 75% higher CL than CLL)). -
Notes: Follows the
TUMTP_CHL/TUMTP_GC/TUMTP_SCLCdecomposition pattern. Distinct fromTUMTP_LYMPH(broader lymphoma pool). The Gibiansky 2014 cohort had only 20 MCL patients (2.9%); the paper reports the highest obinutuzumab CL among the four B-cell-malignancy histologies for MCL, consistent with the highest CD20 expression density on MCL B-cells relative to the other histologies. Ratified canonically on 2026-05-11.
LINE_1L (canonical for first-line-therapy indicator)
- Description: 1 = first-line therapy (1L) / treatment-naive, 0 = second-line or greater (2L+) / relapsed-or-refractory.
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (2L+, second-line or greater / relapsed-refractory).
-
Source aliases:
-
LINE(categorical column with levels1L,2L,3L+, …) – decompose intoLINE_1L = as.integer(LINE == "1L"). -
RRFN(relapsed/refractory flag; treatment-naive corresponds to RRFN == 0) – used inLu_2019_polatuzumab.R. Decompose:LINE_1L = as.integer(RRFN == 0).
-
-
Example models:
Sanghavi_2020_ipilimumab.R(exponential coefficient -0.0949 on CL),Lu_2019_polatuzumab.R(multiplicative effects on V1 = 1.20, kdes = 3.38, CL_T = 3.53, FRAC_NS = 0.756; the same pooled-trial NHL cohort mixes 415 R/R and 45 first-line patients). -
Notes: Promoted to scope: general on 2026-04-26
after Lu 2019 polatuzumab vedotin ratified the same 1L vs 2L+
binarization that Sanghavi 2020 ipilimumab introduced. The two papers
use different indicator semantics (Sanghavi reports the effect as
exp(-0.0949 * LINE_1L)and Lu reportstheta^LINE_1Lwith theta < or > 1 depending on the parameter); both reduce to the same canonical 0/1 column. If a future paper requires finer resolution (separate effects for 2L vs 3L+), add a parallelLINE_2Lcanonical rather than overloading this one.
NIVO_1Q3W (canonical for nivolumab 1 mg/kg every 3 weeks co-administration indicator)
- Description: 1 = ipilimumab co-administered with nivolumab 1 mg/kg every 3 weeks; 0 = otherwise (monotherapy or any other nivolumab regimen).
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (no nivolumab or any non-1Q3W nivolumab regimen).
-
Source aliases:
-
NIVO_REGIMEN(categorical column with levelsnone,0.3 mg/kg Q3W,1 mg/kg Q2W,1 mg/kg Q3W,3 mg/kg Q2W,3 mg/kg Q3W) – decompose intoNIVO_1Q3W = as.integer(NIVO_REGIMEN == "1 mg/kg Q3W").
-
-
Example models:
Sanghavi_2020_ipilimumab.R(exponential coefficient 0.0950 on ipilimumab CL). -
Notes: Paired with
NIVO_3Q2Win the Sanghavi 2020 final model; both decomposed indicators are 0 for ipilimumab monotherapy. Other nivolumab regimens (0.3 mg/kg Q3W, 1 mg/kg Q2W, 3 mg/kg Q3W) were tested but not retained in the final model and collapse into the reference 0 group.
NIVO_3Q2W (canonical for nivolumab 3 mg/kg every 2 weeks co-administration indicator)
- Description: 1 = ipilimumab co-administered with nivolumab 3 mg/kg every 2 weeks; 0 = otherwise (monotherapy or any other nivolumab regimen).
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (no nivolumab or any non-3Q2W nivolumab regimen).
-
Source aliases:
-
NIVO_REGIMEN(categorical column) – decompose intoNIVO_3Q2W = as.integer(NIVO_REGIMEN == "3 mg/kg Q2W").
-
-
Example models:
Sanghavi_2020_ipilimumab.R(exponential coefficient 0.191 on ipilimumab CL). -
Notes: Paired with
NIVO_1Q3W; same reference grouping convention.
COMBO_NIVO (canonical for any-regimen nivolumab combination-therapy indicator)
- Description: 1 = ipilimumab co-administered with any nivolumab regimen, 0 = ipilimumab monotherapy.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (ipilimumab monotherapy).
-
Source aliases:
-
COMBO– used inSanghavi_2020_ipilimumab.R. Equivalently derivable fromNIVO_REGIMENasCOMBO_NIVO = as.integer(NIVO_REGIMEN != "none").
-
-
Example models:
Sanghavi_2020_ipilimumab.R(additive effect -0.202 on the Emax parameter of the time-varying CL function). -
Notes: Distinct from the per-regimen
NIVO_1Q3W/NIVO_3Q2Windicators on baseline CL:COMBO_NIVOaggregates across all nivolumab regimens and acts on the time-varying-CL Emax parameter, whereas the per-regimen indicators act on baseline (time-zero) CL.
BLSTABL (canonical for baseline absolute blast counts in peripheral blood)
- Description: Baseline absolute count of blasts (immature lymphoid/myeloid precursor cells) circulating in peripheral blood. Time-fixed baseline value.
- Units: 10^9 counts/L (equivalently 10^9 counts; reported as “x 10^9 counts” in Wu 2024 Table 2).
- Type: continuous
- Scope: specific
-
Reference category: n/a – used with power scaling
(BLSTABL / <ref>)^exponent. Reference value observed: 0.352 x 10^9 counts (Wu 2024 Table 3, BCP-ALL median). -
Source aliases:
-
BLSTABL– used inWu_2024_inotuzumab.R.
-
-
Example models:
Wu_2024_inotuzumab.R(power exponent -0.0484 on kdes for BCP-ALL patients only; the effect is gated off for B-cell NHL by multiplying the exponent byDIS_BCPALL). -
Notes: Distinct from blasts in bone marrow
(different specimen) and from
BLSTPB(percentage of blasts in peripheral blood, used by the predecessor Garrett 2019 adult model). Not applicable for B-cell NHL patients in pooled BCP-ALL + NHL analyses (Wu 2024 retains the effect only in BCP-ALL patients via the DIS_BCPALL gate). When supplying BLSTABL for an NHL subject, set the value to the BCP-ALL reference (0.352) so the gated power term evaluates to 1 numerically. Scope: specific because the covariate is most meaningful in B-cell-leukemia population PK analyses; promote to general if a second paper retains it.
COMBO_RG (canonical for anti-CD20 (rituximab or obinutuzumab) combination-therapy indicator)
- Description: 1 = polatuzumab vedotin co-administered with rituximab OR obinutuzumab, 0 = single-agent polatuzumab vedotin (or any other regimen lacking an anti-CD20 partner). Time-fixed per subject in the source paper’s analysis cohort.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (single-agent polatuzumab vedotin or no anti-CD20 partner).
-
Source aliases:
-
COMBO(categorical: 0 = single agent, 1 = + rituximab, 2 = + obinutuzumab) – used inLu_2019_polatuzumab.R. Decompose:COMBO_RG = as.integer(COMBO == 1 | COMBO == 2). The Lu 2019 NONMEM separately definesRTX = as.integer(COMBO == 1)andGA101 = as.integer(COMBO == 2)and applies effects astheta^(RTX + GA101); because RTX and GA101 are mutually exclusive, RTX + GA101 takes values {0, 1} and the effect collapses totheta^COMBO_RG.
-
-
Example models:
Lu_2019_polatuzumab.R(multiplicative effects on CL_INF = 0.844, kdes = 0.932, FRAC_NS = 0.709). -
Notes: Rituximab and obinutuzumab both bind CD20 on
B cells (rituximab is a Type I anti-CD20 mAb, obinutuzumab a
glycoengineered Type II), so co-administration is hypothesized to alter
polatuzumab vedotin disposition through depletion of CD79b+ target B
cells. The Lu 2019 final model fits a single combined effect rather than
separate rituximab- and obinutuzumab-specific effects. Scope: specific
because the relevant combination partners (CD20-directed mAbs) are tied
to NHL pathway; if a future paper distinguishes rituximab from
obinutuzumab combinations, register
COMBO_RandCOMBO_Gseparately rather than overloading this canonical.
COMBO_DURVA (canonical for durvalumab combination-therapy indicator)
- Description: 1 = the analyzed therapeutic mAb is co-administered with durvalumab (anti-PD-L1 IgG1), 0 = monotherapy.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (monotherapy).
-
Source aliases:
-
COMB– used inHwang_2022_tremelimumab.R($INPUT NM-TRAN data item; control-stream switchIF(COMB.EQ.0)selects monotherapy parameters andIF(COMB.EQ.1)selects combination-therapy parameters).
-
-
Example models:
Hwang_2022_tremelimumab.R(selects between monotherapy and combination-with-durvalumab values of the time-varying-CL Tmax and lambda parameters). -
Notes: Parallels
COMBO_NIVObut for durvalumab rather than nivolumab co-administration. Acts on the time-varying-CL component (Tmax and lambda); baseline CL is shared between monotherapy and combination groups in Hwang 2022.
COMBO_LEN_DEX (canonical for lenalidomide plus dexamethasone combination-therapy indicator)
- Description: 1 = the analyzed therapeutic mAb (or other agent under PK study) is co-administered with the lenalidomide + low-dose-dexamethasone (Ld) backbone, 0 = monotherapy or any non-Ld regimen. Lenalidomide is an immunomodulatory imide (IMiD) that activates natural killer cells; dexamethasone is an immunosuppressant glucocorticoid. The Ld backbone is a standard combination partner in multiple-myeloma and other hematologic-malignancy regimens.
- Units: (binary)
- Type: binary
- Scope: specific
-
Reference category: 0 (monotherapy or any non-Ld
regimen). When a paper reports the reference patient as “with Ld
coadministration” (as Ide 2020 does), the model still stores the
canonical 0/1 column and applies the effect as
exp(theta * (COMBO_LEN_DEX - 1))so that COMBO_LEN_DEX = 1 yields factor 1 (paper’s reference) and COMBO_LEN_DEX = 0 activates the effect. -
Source aliases:
-
LENDEX(1 = with Ld, 0 = without Ld; in Ide 2020 derived fromSTUDY != 204011because study 204011 was the Ld-free elotuzumab-monotherapy cohort) – used inIde_2020_elotuzumab.R. -
COMBO_LD(retired canonical name; renamed toCOMBO_LEN_DEXon 2026-04-27 for clarity).
-
-
Example models:
Ide_2020_elotuzumab.R(multiplicative effects: CLLd = 0.74 on nonspecific CL, encoded asexp(log(0.74) * (COMBO_LEN_DEX - 1)); KINTLd = 10.1 on the second-order target-mediated elimination rate from the peripheral compartment, encoded asexp(log(10.1) * (COMBO_LEN_DEX - 1))). -
Notes: Specific scope because the canonical’s
mechanistic relevance is hematologic-malignancy-domain-bound (multiple
myeloma and related plasma-cell or B-cell disorders). Distinct from
COMBO_BELAMAF(which pools Ld with bortezomib-dex and pomalidomide-dex into a single broader “any-combination” belantamab indicator);COMBO_LEN_DEXis the per-backbone Ld-only flag. If a future paper distinguishes “Ld-only” from a broader “any-IMiD-plus-dex” backbone with separate effects, register a parallel canonical (e.g.,COMBO_PDfor pomalidomide-dex,COMBO_VDfor bortezomib-dex). Sign of the exponential coefficient is paper-dependent: Ide 2020 reportsCLLd = 0.74so the Ld-coadministration arm has 26% lower nonspecific CL than the Ld-free arm, butKINTLd = 10.1so the Ld arm has 10x higher second-order target-mediated elimination – both are mechanistically interpretable (dexamethasone suppresses non-specific catabolic clearance; lenalidomide-activated NK cells increase target-cell-binding-mediated elimination).
COMBO_BELAMAF (canonical for any-combination belantamab mafodotin therapy indicator)
- Description: 1 = belantamab mafodotin administered as part of a combination regimen (with bortezomib + dexamethasone, lenalidomide + dexamethasone, or pomalidomide + dexamethasone) in the relapsed/refractory multiple myeloma setting; 0 = belantamab mafodotin monotherapy.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (belantamab mafodotin monotherapy).
-
Source aliases:
-
COMBO(when the source dataset uses a generic combination flag for belantamab mafodotin pooled regimens) – used inPapathanasiou_2025_belantamab.R.
-
-
Example models:
Papathanasiou_2025_belantamab.R(multiplicative factor theta = 1.44 on the Imax parameter of the time-varying CL function – combination therapy increases the steady-state CL reduction from 33.2 % to 44.0 %). -
Notes: Pools the three combination backbones tested
in DREAMM-6 / DREAMM-7 / DREAMM-8 (Bor-Dex, Len-Dex, Pom-Dex) into a
single binary because Papathanasiou 2025 reports no meaningful
per-backbone difference in cycle-1 ADC exposure. If a future paper tests
per-backbone combination effects, register dedicated indicators
(
COMBO_BELAMAF_BORDEX, etc.) rather than overloading this aggregate.
KG (canonical for subject-specific tumour-growth rate constant from a prior IPP fit)
- Description: Empirical-Bayes posterior estimate of the subject-specific tumour-size first-order growth rate constant carried over from an upstream tumour-size-dynamics population model. Supplied per subject in the dataset and used directly inside a downstream model (e.g., overall-survival hazard) that integrates the tumour-size ODE inline conditional on each subject’s growth/death rates.
-
Units: internal scaled rate; the Zecchin 2016 OS
model carries the source convention
KG / 1000 * tumorSizein the SLD ODE so the column units are(1/day) * 1000as published in the source NONMEM run. Document per-model viacovariateData[[KG]]$units. - Type: continuous
- Scope: specific
-
Reference category: n/a – used directly inside the
SLD ODE (see
Zecchin_2016_survival.R). -
Source aliases:
-
KG(NONMEM$INPUTcolumn in DDMODEL00000218; identical column shipped in the bundle’s Simulated_OS.csv).
-
-
Example models:
Zecchin_2016_survival.R(Zecchin 2016 OS model, DDMODEL00000218). -
Notes: Specific scope because the column is the
empirical-Bayes output of a particular upstream fit
(
modellib('Zecchin_2016_tumorovarian')/ DDMODEL00000217). When this OS model is used standalone, the user must supplyKGper subject – typically by first fitting the SLD model and extracting the per-subject empirical-Bayes posterior. The internal/1000scaling is preserved verbatim from the source$DESblock to maintain numerical equivalence with the published estimates.
KD0 (canonical for subject-specific carboplatin-related tumour-death rate constant from a prior IPP fit)
-
Description: Empirical-Bayes posterior estimate of
the subject-specific carboplatin-driven tumour-size death rate constant
carried over from an upstream tumour-size-dynamics population model.
Pairs with the time-varying
AUC_CARBOcovariate inside the SLD ODE termKD0 * AUC_CARBO * tumorSize. -
Units: internal scaled rate; the Zecchin 2016 OS
model carries the source convention
KD0 / 1000 * AUC_CARBO * tumorSizeso the column units are(1/day per AUC unit) * 1000as published in the source NONMEM run. Document per-model viacovariateData[[KD0]]$units. - Type: continuous
- Scope: specific
-
Reference category: n/a – used directly inside the
SLD ODE (see
Zecchin_2016_survival.R). -
Source aliases:
-
KD0(NONMEM$INPUTcolumn in DDMODEL00000218).
-
-
Example models:
Zecchin_2016_survival.R(Zecchin 2016 OS model, DDMODEL00000218). -
Notes: Specific scope because the column is tied to
a specific drug (carboplatin) and a specific upstream IPP fit. The
internal
/1000scaling is preserved verbatim from the source$DESblock.
KD1 (canonical for subject-specific gemcitabine-related tumour-death rate constant from a prior IPP fit)
-
Description: Empirical-Bayes posterior estimate of
the subject-specific gemcitabine-driven tumour-size death rate constant
carried over from an upstream tumour-size-dynamics population model.
Pairs with the time-varying
AUC_GEMcovariate inside the SLD ODE termKD1 * AUC_GEM * tumorSize. -
Units: internal scaled rate; the Zecchin 2016 OS
model carries the source convention
KD1 / 100 * AUC_GEM * tumorSizeso the column units are(1/day per AUC unit) * 100as published in the source NONMEM run. Document per-model viacovariateData[[KD1]]$units. - Type: continuous
- Scope: specific
-
Reference category: n/a – used directly inside the
SLD ODE (see
Zecchin_2016_survival.R). -
Source aliases:
-
KD1(NONMEM$INPUTcolumn in DDMODEL00000218).
-
-
Example models:
Zecchin_2016_survival.R(Zecchin 2016 OS model, DDMODEL00000218). -
Notes: Specific scope because the column is tied to
a specific drug (gemcitabine) and a specific upstream IPP fit. The
internal
/100scaling is preserved verbatim from the source$DESblock.
IBASE (canonical for subject-specific baseline tumour-size estimate from a prior IPP fit)
-
Description: Empirical-Bayes posterior estimate of
the subject-specific baseline sum-of-longest-diameters (SLD) tumour size
carried over from an upstream tumour-size-dynamics population model.
Used both to set the SLD ODE initial state
(
tumorSize(0) <- IBASE * 1000in the Zecchin 2016 model: source convention multiplies by 1000 to convert the internal value to mm) and to scale the time-varying tumour-size ratio (mmbas <- IBASE * 1000) inside the OS hazard. -
Units: internal scaled length; the Zecchin 2016 OS
model carries the source convention
IBASE * 1000 = mmso the column itself is in metres (1 m = 1000 mm). Document per-model viacovariateData[[IBASE]]$units. - Type: continuous
- Scope: specific
-
Reference category: n/a – used as the SLD ODE
initial condition and as the denominator in
(tumorSize - mmbas) / mmbas. -
Source aliases:
-
IBASE(NONMEM$INPUTcolumn in DDMODEL00000218; identical column shipped in the bundle’s Simulated_OS.csv with values typically in the 0.04-0.50 m range).
-
-
Example models:
Zecchin_2016_survival.R(Zecchin 2016 OS model, DDMODEL00000218). -
Notes: Distinct from the canonical
TUM_SLDcolumn.TUM_SLDcarries the measured baseline tumour size in mm (used to compute the time-fixedNSLD0 = TUM_SLD / 70covariate term in the Zecchin 2016 OS hazard), whereasIBASEcarries the empirical-Bayes fitted baseline from the upstream SLD model (used to initialise the integrated SLD trajectory and to define the time-varying TSR(t) reference). The two are correlated but not equal because the upstream IPP fit smooths measurement noise away from the observed SLD0. Specific scope because the column is the empirical-Bayes output of a specific upstream model fit and the internal*1000unit-conversion is tied to the source NONMEM coding convention.
NWLS (canonical for time-varying new-lesion appearance indicator)
-
Description: Time-varying binary indicator of
whether a new (non-target) RECIST lesion has appeared since enrolment. 1
= new lesion present at the current observation time; 0 = no new lesion
as of the current observation time. Once
NWLSflips to 1 it stays 1 for subsequent observation times in that subject (a step-function flag, not a transient pulse). - Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (no new lesion appeared as of the current time).
-
Source aliases:
-
NWLS(NONMEM$INPUTcolumn in DDMODEL00000218); the bundle’s Simulated_OS.csv re-labels the same columnNWLSCOV. Downstream consumers should mapNWLSCOV->NWLS.
-
-
Example models:
Zecchin_2016_survival.R(Zecchin 2016 OS model, DDMODEL00000218; multiplicative effect on the Weibull hazard viaexp(e_nwls_haz * NWLS)withe_nwls_haz = 1.23per Output_real_OS.lst FINAL TH5 / Table 2 of Zecchin 2016). -
Notes: Specific scope because the column encodes a
per-paper RECIST-style binary that is supplied by the dataset; it is not
a generic “any new lesion” indicator the user can populate from routine
clinical data without an explicit lesion-appearance imaging schedule.
The Zecchin 2016 OS model uses
NWLSdirectly (no time-gating), which is faithful to the simulated dataset shipped in the DDMORE bundle. The sourceOutput_real_OS.lst(the listing on the original real dataset) gates the indicator with an additionalTNWLS(lesion-appearance-time) column not shipped in the bundle’s simulated dataset; the two encodings are functionally equivalent when the dataset’sNWLScolumn is constructed as a 0/1 step that flips at the lesion-appearance time. The bundle’s simulated dataset uses the simpler step-function form, and that is the form the nlmixr2lib model expects.
MM_NIGG (canonical for non-IgG multiple myeloma immunoglobulin-type indicator)
- Description: 1 = patient with non-IgG-secreting multiple myeloma (e.g., IgA, IgD, IgE, IgM, light-chain-only / Bence Jones, or non-secretory MM), 0 = patient with IgG-secreting multiple myeloma.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (IgG MM).
-
Source aliases:
-
Ig_type– used inFau_2020_isatuximab.R. Values 0 / 1 with the same orientation as the canonical (1 = non-IgG MM).
-
-
Example models:
Fau_2020_isatuximab.R(exponential effect on the steady-state linear CL CLinf with coefficient -0.751, and on the time-varying-CL half-time KCL with coefficient -0.931),Xu_2020_daratumumab.R(additive shift(1 + 0.806 * (1 - MM_NIGG))on linear CL — Xu 2020 parameterises with non-IgG MM as reference, so an IgG MM patient receives an 80.6% higher linear CL than a non-IgG MM patient; canonical column semantics 1 = non-IgG / 0 = IgG are preserved). -
Notes: Within-disease (multiple-myeloma)
immunoglobulin-subtype stratifier. The mechanistic rationale (Fau 2020)
is that endogenous IgG monoclonal protein in IgG-MM patients competes
with the therapeutic IgG mAb for FcRn-mediated salvage, raising the
therapeutic mAb’s catabolic clearance; non-IgG-MM patients lack that
competition and exhibit lower therapeutic-mAb clearance. Distinct from
the disease-state indicators (
DIS_SMM= smoldering MM); applies only after a multiple-myeloma diagnosis is established. Scope: specific because the comparison is a within-MM stratifier rather than a cross-population indicator. Reference category at the model level (which value of MM_NIGG corresponds to TVCL = base) varies between papers: Fau 2020 anchors to 0 (IgG MM) and Xu 2020 anchors to 1 (non-IgG MM); the canonical column orientation (1 = non-IgG) is fixed across papers and the per-modelcovariateData[[MM_NIGG]]$reference_categoryfield records which anchor each model uses.
TUM_TP53_MUT (canonical for tumour TP53 / p53 mutation indicator)
- Description: Binary indicator of tumour-cell TP53 mutational status as assessed in the source paper. 1 = TP53 mutant tumour (typically a missense mutation detected by sequencing, or p53 protein overexpression by immunohistochemistry used as a surrogate for TP53 missense mutation per Gillet et al. J Neurooncol 2014); 0 = TP53 wild-type tumour. Time-fixed per subject (a somatic tumour-genotype call made at diagnosis, not a germline genotype).
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (TP53 wild-type).
-
Source aliases:
- “p53 mutation” / “p53 mutated” (Mazzocco 2015, paper text; p53 overexpression by IHC used as a surrogate for TP53 missense mutation).
-
Example models:
Mazzocco_2015_temozolomide.R(exponential effect on the temozolomide tumour-cell-death rate constantgamma:gamma = gamma0 * exp(beta_p53 * TUM_TP53_MUT)withbeta_p53 = log(0.143 / 0.254) = -0.574; TP53-mutant LGG tumours are 44% less sensitive to TMZ than TP53-wild-type tumours). -
Notes: Specific scope because the column encodes a
somatic tumour-genotype call (mechanism = altered DNA-damage response in
tumour cells) rather than a germline pharmacogenomic variant. Distinct
from the
SNP_<GENE>_<RSID>family, which encodes inherited host germline genotypes affecting drug PK;TUM_TP53_MUTencodes a tumour-cell mutation and only makes mechanistic sense for drugs whose effect depends on a functional p53 pathway in the tumour. The Mazzocco 2015 cohort assayed p53 status by IHC overexpression; future extractions that use direct TP53 sequencing should still record their values under this canonical and document the assay method incovariateData[[TUM_TP53_MUT]]$notes. Ratified canonically on 2026-05-17 alongside the Mazzocco 2015 temozolomide extraction.
TUM_1P19Q_CODEL (canonical for tumour 1p/19q chromosomal codeletion indicator)
- Description: Binary indicator of tumour-cell 1p/19q chromosomal codeletion status. 1 = tumour carries the combined loss of the short arm of chromosome 1 (1p) and the long arm of chromosome 19 (19q); 0 = non-codeleted tumour (intact 1p and/or 19q). Time-fixed per subject (a somatic tumour-genotype call made at diagnosis, typically by fluorescence in-situ hybridisation, comparative genomic hybridisation, or loss-of-heterozygosity assay).
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (non-codeleted; intact 1p and/or 19q).
-
Source aliases:
- “1p/19q codeletion” / “1p/19q codeleted” (Mazzocco 2015, paper text).
-
Example models:
Mazzocco_2015_temozolomide.R(exponential effect on the damaged-quiescent-to-proliferative repair rate constantkQpP:kQpP = kQpP0 * exp(beta_1p19q * TUM_1P19Q_CODEL)withbeta_1p19q = log(0.00807 / 0.00947) = -0.160; 1p/19q-codeleted LGG tumours have 15% lower kQpP than non-codeleted tumours, consistent with longer reported duration of response in codeleted patients). - Notes: Specific scope because the column encodes a brain-tumour-specific somatic chromosomal alteration with mechanistic relevance to DNA-repair capacity in glioma cells; it is not a generic “any tumour-chromosomal-alteration” indicator. The 1p/19q codeletion is a defining molecular feature of oligodendrogliomas (per the 2016 WHO classification of CNS tumours) and is mutually exclusive with TP53 missense mutation in the Mazzocco 2015 cohort (Ricard 2007, ref 12 of Mazzocco 2015). Ratified canonically on 2026-05-17 alongside the Mazzocco 2015 temozolomide extraction.
Laboratory / disease-activity
ALBR (canonical for serum albumin normalized to the laboratory’s upper limit of normal)
-
Description: Serum albumin normalized to the
laboratory’s upper limit of normal
(
albumin_observed / ULN_albumin). - Units: (unitless ratio)
- Type: continuous
- Scope: specific
-
Reference category: n/a – used as a power term
(ALBR / <ref>)^exponent. Reference 0.78 used in Xu 2019 (corresponds to a median serum albumin of 38 g/L at a typical ULN of ~48.7 g/L). - Source aliases: none.
-
Example models:
Xu_2019_sarilumab.R. - Notes: Xu 2019 normalizes to each site’s ULN so that values across multiple labs with different reference ranges can be pooled. Scoped specific because the ULN-normalization convention is tied to the Xu 2019 analysis plan; future papers using the same ratio should either add themselves to the example_models list or promote this entry to general.
Inflammatory-bowel-disease disease-activity covariates
CALPRO (canonical for fecal calprotectin)
- Description: Fecal calprotectin, a gut-inflammation biomarker (baseline or time-fixed per subject unless a paper explicitly uses a time-varying value).
-
Units: mg/kg stool (equivalent to ug/g). Document
per-model via
covariateData[[CALPRO]]$units. - Type: continuous
- Scope: general
-
Reference category: n/a – used with power scaling
(CALPRO / ref)^exponent. Reference 700 mg/kg used in Rosario 2015 (overall population median). - Source aliases: none.
-
Example models:
Rosario_2015_vedolizumab.R(reference 700 mg/kg; exponent +0.0310 on linear clearance CLL). -
Notes: Common IBD severity biomarker (inflammation
of the gut epithelium). Assays typically report in ug/g stool; 1 ug/g =
1 mg/kg. Document per-model whether baseline-only or time-varying in
covariateData[[CALPRO]]$notes.
CDAI (canonical for Crohn’s Disease Activity Index)
- Description: Crohn’s Disease Activity Index composite score. Higher values indicate more active disease; <150 remission, 150-219 mild, 220-450 moderate, >450 severe. Defined only for patients with a CD diagnosis; set to the reference value (or gate via the indicator) for UC patients.
- Units: (score, 0-600)
- Type: continuous
- Scope: general
-
Reference category: n/a – used with power scaling
(CDAI / ref)^exponent. Reference 300 used in Rosario 2015 (typical moderate-CD score). - Source aliases: none.
-
Example models:
Rosario_2015_vedolizumab.R(reference 300; exponent -0.0515 on CLL gated byIBD_CDso the effect applies only to CD patients). -
Notes: Mutually exclusive with
PMAYOin pooled UC+CD populations: each patient has exactly one disease-activity score (CDAI for CD, partial Mayo for UC). Gate via theIBD_CDindicator when pooling.
PMAYO (canonical for partial Mayo score)
-
Description: Partial Mayo score for ulcerative
colitis (sum of stool-frequency, rectal-bleeding, and physician-global
subscores, range 0-9). Higher values indicate more active disease.
Defined only for patients with a UC diagnosis; gate via the
IBD_CDindicator when pooling UC+CD. - Units: (score, 0-9)
- Type: continuous
- Scope: general
-
Reference category: n/a – used with power scaling
(PMAYO / ref)^exponent. Reference 6 used in Rosario 2015 (typical moderate-UC score). - Source aliases: none.
-
Example models:
Rosario_2015_vedolizumab.R(reference 6; exponent +0.0408 on CLL gated by(1 - IBD_CD)so the effect applies only to UC patients). -
Notes: The partial Mayo score excludes the
endoscopy subscore (the full Mayo score is 0-12). Mutually exclusive
with
CDAIin pooled UC+CD populations.
MAYO_E (canonical for baseline Mayo endoscopic subscore)
-
Description: Mayo endoscopic subscore at baseline
for ulcerative colitis, integer 0-3 (0 = normal / inactive disease, 1 =
mild, 2 = moderate, 3 = severe). The endoscopic subscore is one of the
four components of the full Mayo score (0-12); the partial Mayo score
(
PMAYO) excludes it. Time-fixed per subject. - Units: (score, 0-3)
- Type: categorical
- Scope: general
-
Reference category: depends on per-model encoding –
papers that include the full 0-3 range typically reference Mayo 0 or 1
(mild), while papers restricted to moderate-to-severe UC (the typical
biologic-induction-therapy population) reference Mayo 2 or 3. Document
the per-model reference category in
covariateData[[MAYO_E]]$reference_category. -
Source aliases:
-
MPRE– used inFaelens_2021_infliximab.R(NONMEM column for “Mayo endoscopic score pre-induction”). The Faelens 2021 dataset additionally codes a “missing” sentinelMPRE = -99; treat this as out-of-domain when applying the model and document per-model.
-
-
Example models:
Faelens_2021_infliximab.R(categorical effect on KE: separate typical KE for Mayo 1 / Mayo 2 / Mayo 3; reference category Mayo 2). -
Notes: Distinct from the full Mayo score (0-12) and
the partial Mayo score (
PMAYO, 0-9). The endoscopic subscore alone is the core inclusion criterion in many UC induction-therapy popPK datasets (typically Mayo 2 or 3 = moderate-to-severe disease). Mutually compatible withPMAYOin datasets that report both.
ENDO_ULCER (canonical for endoscopically active luminal disease at baseline)
- Description: 1 = mucosal ulcerations confirmed at baseline ileocolonoscopy / endoscopy (endoscopically active luminal disease), 0 = no mucosal ulcerations at baseline. Time-fixed (assessed at study entry).
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (no mucosal ulcerations at baseline).
- Source aliases: none known.
-
Example models:
Aguiar_2021_ustekinumab.R(Aguiar 2021 Table 3; covariate on baseline fecal calprotectin FC0: 213 mg/kg with ulcers vs 102 mg/kg without). - Notes: IBD-specific structural-disease-activity indicator, distinct from the symptom-driven CDAI / PMAYO and the biomarker-driven CALPRO / CRP. Useful as a covariate on baseline biomarkers (FC, CRP) and as a stratifier for simulations of biochemical remission.
Inflammatory-bowel-disease diagnosis
IBD_CD (canonical for Crohn’s disease indicator)
- Description: 1 = Crohn’s disease (CD) diagnosis, 0 = ulcerative colitis (UC) diagnosis. Used to gate pooled UC+CD population models where some parameters or covariate effects differ by diagnosis.
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (ulcerative colitis).
-
Source aliases:
-
DIAGNOSIS,DX(categorical"UC"/"CD") – deriveIBD_CD = as.integer(DX == "CD").
-
-
Example models:
Rosario_2015_vedolizumab.R(two typical-CLL switch between UC and CD; gatesPMAYOandCDAIeffects; multiplicative +1% effect on Vc). - Notes: Rosario 2015 models separate typical CLL for UC vs CD and gates the partial-Mayo (UC-only) and CDAI (CD-only) disease-activity covariates via this indicator.
Concomitant / prior medication
CONMED_AD (canonical for concomitant Alzheimer’s-symptomatic medication)
-
Description: 1 = subject is on concomitant
Alzheimer’s-symptomatic medication (typically a cholinesterase inhibitor
and/or memantine) at the observation, 0 = not on such medication.
Time-fixed in the Conrado 2014 source dataset (one indicator per
subject) and treated as such here; future models that need a
time-varying form may register a successor canonical or document
time-varying use in
covariateData[[CONMED_AD]]$notes. - Units: (binary)
- Type: binary
- Scope: specific
-
Reference category: 1 (on concomitant
Alzheimer’s-symptomatic medication) in the Conrado 2014 model – the
source paper labels CONMED_AD = 1 as the “most common” category and
centres the slope-effect coefficient on this group, so the
multiplicative factor on disease-progression slope is
1forCONMED_AD = 1and1 + e_sl_conmed_ad_offforCONMED_AD = 0. The non-standard “most-common-as-reference” convention is preserved from the source for traceability; future Alzheimer’s models that adopt the more common “off-treatment as reference” convention should register a successor canonical (CONMED_AD_TREATED) rather than overloading this one. -
Source aliases:
-
COMED2– used inConrado_2014_alzheimer.R(DDMORE Foundation Model Repository entry DDMODEL00000290). The2suffix in the source distinguishes this binary from upstreamCOMEDandPRIMCOMEDcolumns (free-text concomitant-medication entries) in the same NONMEM input dataset; the binaryCOMED2flag is what enters the model.
-
-
Example models:
Conrado_2014_alzheimer.R(multiplicative factor on the typical-value disease-progression slope:slope_factor = 1 + (1 - CONMED_AD) * e_sl_conmed_ad_offwithe_sl_conmed_ad_off = -0.302, i.e. ~30% smaller progression slope for the off-treatment reference cohort). -
Notes: The source
.moddoes not specify the symptomatic-medication class beyond the binary flag; the Conrado 2014 publication context (CAMD ADAS-Cog disease-progression dataset, 2014) makes cholinesterase-inhibitor / memantine the dominant interpretation. TreatingCONMED_AD = 1as the reference category is unusual relative to the rest of theCONMED_*family (CONMED_PARA,CONMED_NSAID,CONMED_AZA, etc.) which all use 0 = not-on as reference; the inversion is preserved here only because the source paper’s coefficient was estimated with the “most common = 1” convention. Ratified canonically on 2026-05-06 alongside the Conrado 2014 DDMORE extraction.
CONMED_AED (canonical for any concomitant antiepileptic drug coadministration)
- Description: 1 = subject is on at least one concomitant antiepileptic drug (AED) other than the modelled AED at the PK observation, 0 = on the modelled AED as monotherapy. Time-varying when concurrent AEDs cycle on / off across study occasions, time-fixed when the source paper analyses chronic-maintenance cohorts whose concurrent therapy is unchanged across the analysis window.
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (modelled-AED monotherapy).
-
Source aliases:
-
CO_AED– used inYukawa_1990_phenytoin.R(paper’sCOindicator inverted: sourceCO = 1if PHT alone, theta_co otherwise; canonicalCONMED_AED = 1 - CO_indicatorso 0 is the PHT-monotherapy reference).
-
-
Example models:
Yukawa_1990_phenytoin.R(multiplicative^CONMED_AEDfactor on Vmax (1.08) and Km (1.32) for chronic phenytoin patients on at least one of phenobarbital, carbamazepine, valproate, primidone, clonazepam, sultiame, ethotoin, ethosuximide, acetazolamide, or diazepam). -
Notes: Generic concomitant-AED indicator covering
the heterogeneous mix of older AEDs (PB, CBZ, VPA, primidone,
clonazepam, etc.) studied alongside the modelled drug; the per-paper
list of qualifying AEDs must be documented in
covariateData[[CONMED_AED]]$notes. Distinct from drug-specific concomitant-AED indicators (e.g., a futureCONMED_PBfor concomitant phenobarbital alone) which would warrant separate canonicals when a paper distinguishes effects by AED class. Follows theCONMED_*family pattern (CONMED_AZA,CONMED_NSAID, etc.). Ratified canonically on 2026-05-10 alongside the Yukawa 1990 phenytoin extraction.
CONMED_AMIO (canonical for concomitant amiodarone coadministration indicator)
- Description: 1 = subject is coadministered amiodarone (Class III antiarrhythmic; CYP3A4 / CYP2C9 / P-gp inhibitor) during the study, 0 = no concomitant amiodarone. Time-varying when amiodarone start / stop events are captured; the Xia 2024 source treats amiodarone as time-fixed at the analysis baseline.
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (no concomitant amiodarone).
-
Source aliases:
-
CM1– used inXia_2024_warfarin.R(Xia 2024 Figure 1 / Table 2 covariate-screening variable labelling:CM1 = 1indicates combined amiodarone,0no combination).
-
-
Example models:
Xia_2024_warfarin.R(piecewise multiplicative effect on warfarin EC50:ec50 *= (1 + e_amio_ec50 * CONMED_AMIO)withe_amio_ec50 = -0.602, i.e. amiodarone reduces EC50 by ~60% in the Han Chinese cohort). -
Notes: Amiodarone is the canonical CYP2C9 / CYP3A4
inhibitor that potentiates warfarin’s anticoagulant effect in clinical
practice; the Xia 2024 cohort prevalence was 25.7% (Table 1). The
per-paper definition (any amiodarone use vs current loading-dose use vs
steady-state use) should be documented in
covariateData[[CONMED_AMIO]]$notes. Ratified canonically on 2026-05-16 alongside the Xia 2024 warfarin extraction.
CONMED_AMINO (canonical for concomitant aminosalicylate therapy)
- Description: 1 = on concomitant aminosalicylate (5-aminosalicylic acid / mesalamine / mesalazine / olsalazine / sulfasalazine etc.) therapy at the PK observation, 0 = not.
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (no concomitant aminosalicylate).
-
Source aliases:
AMINO– used inRosario_2015_vedolizumab.R. -
Example models:
Rosario_2015_vedolizumab.R(power-form on CLL:CLL * 1.02^CONMED_AMINO). -
Notes: Covers the full aminosalicylate class (5-ASA
is the single active moiety shared by most agents); use
CONMED_AMINOrather thanCONMED_5ASAunless the source paper explicitly restricts the indicator to 5-ASA monotherapy.
CONMED_AVD (canonical for brentuximab vedotin + AVD (adriamycin/doxorubicin, vinblastine, dacarbazine) combination indicator)
- Description: 1 = subject is receiving brentuximab vedotin in combination with the AVD chemotherapy backbone (adriamycin a.k.a. doxorubicin, vinblastine, dacarbazine) for newly diagnosed advanced-stage Hodgkin lymphoma; 0 = otherwise (single-agent brentuximab vedotin). Encodes the A+AVD frontline regimen as a study-design covariate on ADC clearance.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (single-agent brentuximab vedotin – no AVD coadministration).
-
Source aliases:
-
DOX– used inZhou_2025_brentuximab.R(the NONMEM dataset uses the doxorubicin-administration flag as the AVD-coadministration indicator because doxorubicin is given on the same days as the other AVD agents in this regimen).
-
-
Example models:
Zhou_2025_brentuximab.R(power-form effect on ADC clearance:CL * 2.12^CONMED_AVD– ADC clearance is ~2.1-fold higher under A+AVD vs single-agent BV). -
Notes: Distinct from
CONMED_CHEMO(which is nivolumab + platinum-based chemotherapy). The A+AVD regimen is the standard chemotherapy backbone for frontline classical Hodgkin lymphoma; promote to general scope if a second BV paper reports the same A+AVD covariate effect with a comparable encoding.
CONMED_AZA (canonical for concomitant azathioprine)
- Description: 1 = on concomitant azathioprine at the PK observation, 0 = not on azathioprine.
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (no concomitant azathioprine).
-
Source aliases:
AZA– used inRosario_2015_vedolizumab.R. -
Example models:
Rosario_2015_vedolizumab.R(power-form on CLL:CLL * 0.998^CONMED_AZA; effect ~= null). - Notes: Thiopurine immunomodulator used as maintenance therapy in IBD. Standard convention is baseline-use-only, but time-varying use is permitted; document per-model.
CONMED_AZOLE (canonical for concomitant azole antifungal therapy (CYP3A4/P-gp inhibitor))
- Description: 1 = patient coadministered an azole antifungal (itraconazole, voriconazole, fluconazole, ketoconazole, posaconazole, isavuconazole, or another systemic azole) during the observation interval, 0 = no concomitant azole antifungal. Time-varying per subject because azole exposure starts and stops during the observation period.
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (no concomitant azole antifungal).
-
Source aliases:
-
AZOLE– used inKirubakaran_2022_tacrolimus.R.
-
-
Example models:
Kirubakaran_2022_tacrolimus.R(state-dependent typical CL/F: 21.1 L/h without azole and 4.2 L/h with azole, an 80% reduction; also a state-dependent BSV magnitude on CL/F: 61% CV without azole vs 89.5% CV with azole). -
Notes: Azoles are mechanism-based CYP3A4 and
P-glycoprotein inhibitors with different inhibitor potencies
(itraconazole > voriconazole > fluconazole). The per-model
covariateData[[CONMED_AZOLE]]$notesmust document (1) which azoles are pooled into the indicator, (2) any post-cessation lag (Kirubakaran 2022 carriesCONMED_AZOLE = 1for one week after azole discontinuation to allow tacrolimus apparent clearance to stabilize given itraconazole’s long half-life), and (3) whether the indicator is a baseline-only proxy or a true time-varying flag.
CONMED_CBZ (canonical for concomitant carbamazepine coadministration indicator)
- Description: 1 = subject is taking carbamazepine (CBZ) as a concomitant antiepileptic drug at the PK observation, 0 = no concomitant carbamazepine. Carbamazepine is a strong CYP3A4 / UGT / P-gp inducer that increases the apparent clearance of co-administered drugs. Time-varying when carbamazepine starts / stops within the observation window; time-fixed when the source paper analyses chronic-maintenance cohorts whose AED therapy is stable across the analysis window.
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (no concomitant carbamazepine).
-
Source aliases:
-
CBZ– used inSchoemaker_2017_brivaracetam.R(paper covariateCBZfor carbamazepine coadministration).
-
-
Example models:
Schoemaker_2017_brivaracetam.R(multiplicative effect on apparent oral clearance:cl *= (1 + 0.479 * CONMED_CBZ); +47.9% relative to no-CBZ reference, corresponding to ~32% lower brivaracetam exposure, Schoemaker 2017 Table 1). -
Notes: Drug-specific CONMED_* indicator anticipated
in the [[CONMED_AED]] notes; used when a paper estimates a separate
CBZ-induction effect distinct from the pooled EIAED / AED class effect.
Distinct from the broader [[CONMED_EIAED]] (any enzyme-inducing AED) and
[[CONMED_AED]] (any concomitant AED). When a paper distinguishes
individual AEDs separately (Schoemaker 2017: PB, CBZ, VPA each carry
their own coefficient), use the drug-specific canonicals [[CONMED_PB]],
CONMED_CBZ, [[CONMED_VPA]] rather than collapsing into the class-level indicator. Ratified canonically on 2026-05-20 alongside the Schoemaker 2017 brivaracetam paediatric extraction.
CONMED_CHEMO (canonical for anti-PD-(L)1 mAb + chemotherapy combination indicator)
- Description: 1 = subject is receiving an anti-PD-(L)1 monoclonal antibody in combination with platinum-based chemotherapy (gemcitabine + cisplatin, pemetrexed + cisplatin, paclitaxel + carboplatin, or platinum-doublet); 0 = otherwise. Encodes chemotherapy coadministration as a study-design covariate on the antibody’s CL.
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (no chemotherapy coadministration – monotherapy or, where applicable, a non-chemotherapy combination such as anti-PD-1 + anti-CTLA-4).
-
Source aliases:
-
CHEMO– used inZhang_2019_nivolumab.R. -
MONOTR– used inKuchimanchi_2024_dostarlimab.R(the paper’s structural-equation indicator for monotherapy; the canonical column carries the inverse value, i.e.CONMED_CHEMO = 1 - MONOTR, so the canonical column is 1 when the patient is on combo-chemotherapy).
-
-
Example models:
Zhang_2019_nivolumab.R(exponential effect on baseline CL:exp(-0.104)~= 0.90 fold, i.e. ~9.7% lower CL relative to monotherapy),Kuchimanchi_2024_dostarlimab.R(multiplicative effect on baseline CL:1 - 0.0779= 0.922, i.e. 7.79% lower CL on dostarlimab + carboplatin/paclitaxel relative to dostarlimab monotherapy). -
Notes: Promoted from specific to general scope on
2026-04-27 after the Kuchimanchi 2024 dostarlimab +
carboplatin/paclitaxel analysis ratified the same pooling convention
(any chemotherapy backbone collapsed into a single binary indicator).
The two papers use different functional forms for the effect on CL –
Zhang 2019 uses
exp(theta * CONMED_CHEMO)(exponential) and Kuchimanchi 2024 uses(1 + theta * CONMED_CHEMO)(multiplicative); these are different parameterisations of the same underlying study-design indicator and the canonical column meaning is unchanged. Document the per-model functional form incovariateData[[CONMED_CHEMO]]$notes.
CONMED_EFV (canonical for concomitant efavirenz indicator)
- Description: 1 = subject is receiving efavirenz (EFV)-based antiretroviral therapy as the third agent in a combination ART regimen; 0 = subject is on the comparator regimen specified by the source paper (typically a protease-inhibitor-based regimen such as standard lopinavir/ritonavir 4:1). Efavirenz is a CYP3A and UGT inducer, so the indicator is used to flag PXR-mediated induction of metabolic clearance for co-administered antiretrovirals.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (non-EFV reference regimen, paper-defined; e.g., LPV/r 4:1 in Tikiso 2021).
-
Source aliases:
-
EFV– used inTikiso_2021_abacavir.R(the dataset’s paper-defined indicator, 1 = on EFV-based ART, 0 = on standard LPV/r 4:1).
-
-
Example models:
Tikiso_2021_abacavir.R(multiplicative effect on apparent oral clearance:cl *= (1 + 0.120 * CONMED_EFV); +12.0% relative to the LPV/r 4:1 reference). - Notes: Specific scope because the comparator regimen (LPV/r 4:1 in Tikiso 2021) is paper-defined; future ART population-PK models that test EFV-vs-other contrasts should extend the example list when the comparator matches, or register a finer-grained sibling indicator otherwise.
CONMED_EIAED (canonical for concomitant enzyme-inducing antiepileptic drug indicator)
- Description: 1 = subject is taking at least one enzyme-inducing antiepileptic drug (EIAED) such as carbamazepine, phenobarbital, or phenytoin during the study; 0 = no EIAED coadministration. EIAEDs induce hepatic metabolism (CYP3A4/UGT-mediated pathways) and increase the clearance of co-administered antiepileptic drugs and their active metabolites.
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (no EIAED coadministration).
-
Source aliases:
-
MED(with the value convention inverted: source codes 1 = absence of EIAEDs and 0 = presence; canonical inverts this so 1 = presence and 0 = absence) – used inRodrigues_2017_oxcarbazepine.R.
-
-
Example models:
Rodrigues_2017_oxcarbazepine.R(exponential effect on MHD apparent clearance:cl_mhd *= exp(e_eiaed_cl_mhd * (1 - CONMED_EIAED)); CL_MHD is 29.3% higher with EIAEDs vs without, encoded as +0.257 on the absence-indicator in the source paper). -
Notes: Per-model
covariateData[[CONMED_EIAED]]$notesshould list the specific EIAEDs counted as “EIAED = 1” since inclusion criteria vary across antiepileptic-drug studies. Rodrigues 2017 counts carbamazepine, phenobarbital, and phenytoin as EIAEDs; other AEDs in the dataset (vigabatrin, clobazam, valproic acid, clonazepam, lamotrigine, diazepam, ethosuccimide, progabide) are not. The source paper uses an “absence of EIAED” indicator (MED = 1if no EIAED,MED = 0if EIAED present); the canonical column inverts this so that the 1 group is “on EIAED”, matching the convention for other CONMED_* indicators.
CONMED_ERA (canonical for concomitant endothelin-receptor-antagonist monotherapy indicator in PAH)
- Description: 1 = patient is on a concomitant endothelin-receptor-antagonist (ERA) PAH therapy (e.g., bosentan, macitentan, ambrisentan) alone – i.e. on an ERA but NOT also on a phosphodiesterase type 5 inhibitor; 0 = otherwise. Time-fixed per subject for the PK/PD analysis (PAH comedication at baseline, stable dose required before selexipag start).
- Units: (binary)
- Type: binary
- Scope: specific
-
Reference category: 0 (PAH-comedication-naive OR on
PDE5 inhibitor alone OR on both ERA+PDE5; mutually exclusive with
CONMED_PDE5IandCONMED_ERA_PDE5I). -
Source aliases:
-
PAHCOMEDcategory 1 (“ERA only”) – decomposed from the categoricalPAHCOMEDsource column with levels {naive, ERA, PDE5, ERA+PDE5} inKrause_2017_selexipag.R.
-
-
Example models:
Krause_2017_selexipag.R(multiplicative effect on the ACT-333679 elimination rate constant:km *= (1 + 0.15 * CONMED_ERA); +15% relative to the PAH-comedication-naive reference, Krause 2017 Table 1). -
Notes: Used together with
CONMED_PDE5IandCONMED_ERA_PDE5Ito decompose a four-level PAH-comedication categorical (naive / ERA-only / PDE5-only / ERA-and-PDE5) into three orthogonal mutually-exclusive binary indicators, with the PAH-comedication-naive group as the reference (all three indicators = 0). The four-level categorical encoding preserves the source paper’s stratum-specific coefficient set (Krause 2017 reports a separate categorical b-coefficient for each non-reference stratum rather than independent class-level effects). Specific scope because the indicator’s semantics are tied to the GRIPHON study’s PAH-comedication taxonomy.
CONMED_ERA_PDE5I (canonical for concomitant ERA + PDE5-inhibitor combination indicator in PAH)
- Description: 1 = patient is on both a concomitant endothelin-receptor-antagonist (ERA) and a phosphodiesterase type 5 inhibitor (PDE5I) as PAH therapy; 0 = otherwise. Time-fixed per subject for the PK/PD analysis.
- Units: (binary)
- Type: binary
- Scope: specific
-
Reference category: 0 (PAH-comedication-naive OR on
ERA alone OR on PDE5I alone; mutually exclusive with
CONMED_ERAandCONMED_PDE5I). -
Source aliases:
-
PAHCOMEDcategory 3 (“ERA and PDE5 inh.”) – decomposed from the categoricalPAHCOMEDsource column with levels {naive, ERA, PDE5, ERA+PDE5} inKrause_2017_selexipag.R.
-
-
Example models:
Krause_2017_selexipag.R(multiplicative effect on the ACT-333679 elimination rate constant:km *= (1 + 0.37 * CONMED_ERA_PDE5I); +37% relative to the PAH-comedication-naive reference, Krause 2017 Table 1; the combined stratum has its own categorical coefficient distinct from the sum of the ERA-only and PDE5I-only effects). -
Notes: Used together with
CONMED_ERAandCONMED_PDE5Ito decompose a four-level PAH-comedication categorical (naive / ERA-only / PDE5-only / ERA-and-PDE5) into three orthogonal mutually-exclusive binary indicators. The standalone combined-stratum coefficient (rather than a product of class-level indicator effects) preserves Krause 2017’s full categorical-effect parameterisation. Specific scope because the indicator’s semantics are tied to the GRIPHON study’s PAH-comedication taxonomy.
CONMED_EZE (canonical for concomitant ezetimibe coadministration indicator)
- Description: 1 = patient is taking ezetimibe (with or without other lipid-lowering comedication), 0 = not on ezetimibe.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (not on ezetimibe).
-
Source aliases:
- Derived from an ezetimibe-identifier column in the source.
-
Example models:
Kuchimanchi_2018_evolocumab.R(multiplicative effect 1.20 on Vmax:Vmax * 1.20^CONMED_EZE; labeled “Statin + ezetimibe exponent” in Kuchimanchi 2018 Table 3 because ~99% of ezetimibe users in the dataset were also on a conmed_statin, so the effect effectively captures combination therapy). - Notes: Scope: specific because Kuchimanchi 2018 interprets the ezetimibe indicator as a combination-therapy marker rather than a pure ezetimibe effect. Future popPK/PD models with cleaner ezetimibe separation should add themselves here or register a more specific canonical.
CONMED_H2RA (canonical for concomitant H2-receptor-antagonist use)
- Description: 1 = patient on concomitant histamine H2-receptor-antagonist therapy (e.g., ranitidine, famotidine), 0 = no CONMED_H2RA use. Captures another class of gastric-pH-modifying co-medication that may reduce bioavailability of pH-sensitive orally administered drugs.
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (no CONMED_H2RA use).
-
Source aliases:
-
H2– used inGoel_2016_Sonidegib.R(Goel 2016 dataset; defined as “significant” CONMED_H2RA use, i.e. duration of CONMED_H2RA use >= 80% of the PK assessment phase).
-
-
Example models:
Goel_2016_Sonidegib.R(multiplicative effect on F:0.996^CONMED_H2RA, no clinically meaningful effect; reported alongsideCONMED_PPIfor completeness). -
Notes: Per-model
covariateData[[CONMED_H2RA]]$notesmust document the operational definition (Goel 2016: >= 80% of PK assessment phase). Distinct fromCONMED_PPI.
CONMED_IFNB1A (canonical for concomitant interferon beta-1a coadministration indicator)
- Description: 1 = patient coadministered subcutaneous interferon beta-1a (Rebif or equivalent recombinant IFN beta-1a product) during the observation interval, 0 = no concomitant IFN beta-1a. Time-varying per subject because the source studies enrol both monotherapy and IFN beta-1a combination periods.
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (no concomitant IFN beta-1a).
-
Source aliases:
-
IFNB1A– used inSavic_2017_cladribine.R.
-
-
Example models:
Savic_2017_cladribine.R(multiplicative effect on cladribine non-renal clearance:cl_nonrenal *= (1 + e_ifn_clnr * CONMED_IFNB1A)withe_ifn_clnr = 0.21, i.e. a 21% increase in non-renal CL when coadministered with IFN beta-1a). -
Notes: Captured at the dose-record level in Savic
2017 (multiple-dose study 26486 alternated between cladribine alone and
cladribine + IFN beta-1a periods). The interaction mechanism is not
definitively established in the source paper; Savic 2017 discusses that
the observed effect could alternatively be modelled on bioavailability
or interpreted as a period-effect / interoccasion variability artefact.
Future cladribine + immunomodulator extractions should reuse this
canonical when IFN beta-1a is the specific concomitant agent; register a
sibling canonical (e.g.
CONMED_IFNB1B,CONMED_IFNALPHA) if a different interferon species is intended.
CONMED_IPI_1Q6W (canonical for nivolumab + ipilimumab 1 mg/kg q6w combination indicator)
- Description: 1 = subject is receiving nivolumab in combination with ipilimumab 1 mg/kg every 6 weeks (continuous maintenance); 0 = otherwise.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (any non-1Q6W regimen – monotherapy, chemotherapy combination, or another ipilimumab schedule).
-
Source aliases:
-
IPI1Q6W– used inZhang_2019_nivolumab.R.
-
-
Example models:
Zhang_2019_nivolumab.R(exponential effect on baseline CL:exp(0.159)~= 1.17 fold increase relative to monotherapy). -
Notes: Paired with
CONMED_IPI_3Q3W. See the CONMED_IPI_3Q3W note for how the other ipilimumab schedules collapse into the reference group.
CONMED_IPI_3Q3W (canonical for nivolumab + ipilimumab 3 mg/kg q3w combination indicator)
- Description: 1 = subject is receiving nivolumab in combination with ipilimumab 3 mg/kg every 3 weeks (4-dose induction); 0 = otherwise. Encodes the high-intensity ipilimumab combination regimen as a study-design covariate on nivolumab CL.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (any non-3Q3W regimen – monotherapy, chemotherapy combination, or another ipilimumab schedule).
-
Source aliases:
-
IPI3Q3W– used inZhang_2019_nivolumab.R.
-
-
Example models:
Zhang_2019_nivolumab.R(exponential effect on baseline CL:exp(0.227)~= 1.25 fold increase relative to monotherapy). -
Notes: Paired with
CONMED_IPI_1Q6W; both indicators can coexist in one population, but a single subject has at most one set to 1 in the Zhang 2019 cohort. The remaining ipilimumab schedules (1 mg/kg q3w x 4 induction, 1 mg/kg q12w) had no statistically significant effect on nivolumab CL and were therefore folded into the reference (0) group along with monotherapy, leaving only IPI3Q3W and IPI1Q6W as named non-reference indicators.
CONMED_IPI_ANY (canonical for any-ipilimumab-coadministration indicator)
- Description: 1 = subject is receiving nivolumab in combination with any ipilimumab regimen (regardless of dose or schedule); 0 = nivolumab monotherapy or nivolumab + chemotherapy. Encodes the “is there ipilimumab in the regimen” question as a single binary covariate, distinct from the regimen-specific CONMED_IPI_3Q3W and CONMED_IPI_1Q6W indicators above.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (no ipilimumab coadministration).
-
Source aliases:
-
IPICO– used inZhang_2019_nivolumab.R.
-
-
Example models:
Zhang_2019_nivolumab.R(additive effect on the time-varying-CL Emax parameter: Emax += -0.0668 when CONMED_IPI_ANY = 1). - Notes: Logically the union of the regimen-specific indicators (CONMED_IPI_3Q3W, CONMED_IPI_1Q6W, plus the unmodeled 1 mg/kg q3wx4 and 1 mg/kg q12w schedules). Zhang 2019 uses it on the time-varying Emax (a different structural parameter from baseline CL), which is why it coexists with the regimen-specific indicators on baseline CL rather than substituting for them.
CONMED_METFORMIN (canonical for concomitant metformin co-administration indicator)
-
Description: 1 = subject is on concomitant
metformin during the modelled treatment period, 0 = not. Time-fixed in
source datasets where metformin is a study-design “add-on” arm (e.g.,
Retlich 2015 Study 4 add-on-to-metformin design); permits a time-varying
form for cohorts with on/off metformin transitions, document per-model
via
covariateData[[CONMED_METFORMIN]]$notes. - Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (no concomitant metformin).
- Source aliases: none known.
-
Example models:
Retlich_2015_linagliptin.R(1 = study 4 add-on-to-metformin cohort; multiplicative effect on linagliptin relative bioavailability F: +69% F for metformin co-administration vs the monotherapy reference; the effect is attributed to a metformin – linagliptin drug-drug interaction consistent with a separately published DDI study, Graefe-Mody 2009). -
Notes: Follows the
CONMED_*concomitant-medication pattern (AZA / MP / MTX / AMINO / NSAID / PARA / AD / RITUX / AED / CHEMO / EIAED / EFV / AZOLE). Metformin is a widely-co-prescribed first-line T2DM oral antidiabetic; future T2DM-popPK / -DDI extractions should reuse this canonical. Ratified canonically alongside the Retlich 2015 linagliptin extraction.
CONMED_MP (canonical for concomitant 6-mercaptopurine)
- Description: 1 = on concomitant 6-mercaptopurine (6-MP), 0 = not.
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (no concomitant 6-MP).
-
Source aliases:
MP– used inRosario_2015_vedolizumab.R. -
Example models:
Rosario_2015_vedolizumab.R(power-form on CLL:CLL * 1.04^CONMED_MP). -
Notes: Second thiopurine immunomodulator used in
IBD maintenance; typically mutually exclusive with
CONMED_AZAfor a given subject.
CONMED_MTX (canonical for concomitant methotrexate)
- Description: 1 = on concomitant methotrexate, 0 = not.
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (no concomitant methotrexate).
-
Source aliases:
MTX– used inRosario_2015_vedolizumab.R. -
Example models:
Rosario_2015_vedolizumab.R(power-form on CLL:CLL * 0.983^CONMED_MTX). - Notes: Immunomodulator used especially in CD maintenance. Generic concomitant-MTX indicator that may also appear in non-IBD models; start as scope: general.
CONMED_NSAID (canonical for concomitant NSAID use)
- Description: 1 = on concomitant non-steroidal anti-inflammatory drug (NSAID) therapy at baseline, 0 = not.
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (no concomitant NSAID use; typical patient).
-
Source aliases:
-
NSAID– used inLi_2019_abatacept.R.
-
-
Example models:
Li_2019_abatacept.R(exponential effect on CL:CL * exp(0.0640 * CONMED_NSAID); ~6.6% higher CL, not clinically relevant per Li 2019). -
Notes: Baseline-use-only in Li 2019; time-varying
use is permitted, document per-model. Follows the
CONMED_*concomitant-medication pattern established for IBD models (AZA / MP / MTX / AMINO).
CONMED_PARA (canonical for concomitant paracetamol (acetaminophen) use)
- Description: 1 = on concomitant paracetamol (acetaminophen) at the observation, 0 = not.
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (no concomitant paracetamol).
-
Source aliases:
-
PCM– used inPlan_2012_pain.R(DDMORE Foundation Model Repository entry DDMODEL00000194).
-
-
Example models:
Plan_2012_pain.R(additive shift on the placebo-effect logit of the typical pain-score lambda:phl = logit(TVLAM) + 0.364 * CONMED_PARA; mean pain score ~9% higher on the 0-10 Likert scale during paracetamol use). -
Notes: Distinct from
CONMED_NSAID– paracetamol is not classed as an NSAID (no anti-inflammatory mechanism, distinct AE profile). Time-varying use is permitted; the daily Likert measurements in Plan 2012 carry PCM as a per-observation flag. Follows theCONMED_*concomitant-medication pattern established for IBD models (AZA / MP / MTX / AMINO) andCONMED_NSAID(Li 2019). Ratified canonically alongside the Plan 2012 DDMORE extraction.
CONMED_PB (canonical for concomitant phenobarbital coadministration indicator)
- Description: 1 = subject is taking phenobarbital (PB) – including primidone, which is metabolised to phenobarbital and is conventionally pooled with PB – as a concomitant antiepileptic drug at the PK observation, 0 = no concomitant phenobarbital / primidone. Phenobarbital is a broad-spectrum CYP and UGT inducer that increases the apparent clearance of co-administered drugs. Time-varying when phenobarbital starts / stops within the observation window; time-fixed when the source paper analyses chronic-maintenance cohorts whose AED therapy is stable.
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (no concomitant phenobarbital or primidone).
-
Source aliases:
-
PB– used inSchoemaker_2017_brivaracetam.R(paper covariatePBfor phenobarbital or primidone coadministration; the source pools primidone with phenobarbital because primidone is metabolised to phenobarbital).
-
-
Example models:
Schoemaker_2017_brivaracetam.R(multiplicative effect on apparent oral clearance:cl *= (1 + 0.408 * CONMED_PB); +40.8% relative to no-PB reference, corresponding to ~29% lower brivaracetam exposure, Schoemaker 2017 Table 1). -
Notes: Drug-specific CONMED_* indicator anticipated
in the [[CONMED_AED]] notes; used when a paper estimates a separate
phenobarbital-induction effect distinct from the pooled EIAED / AED
class effect. Per-model
covariateData[[CONMED_PB]]$notesshould document whether primidone is pooled with phenobarbital (Schoemaker 2017 pools them because primidone is metabolised to phenobarbital). Distinct from the broader [[CONMED_EIAED]] (any enzyme-inducing AED) and [[CONMED_AED]] (any concomitant AED). When a paper distinguishes individual AEDs separately, use the drug-specific canonicals [[CONMED_CBZ]],CONMED_PB, [[CONMED_VPA]] rather than collapsing into the class-level indicator. Ratified canonically on 2026-05-20 alongside the Schoemaker 2017 brivaracetam paediatric extraction.
CONMED_PDE5I (canonical for concomitant PDE5-inhibitor monotherapy indicator in PAH)
- Description: 1 = patient is on a concomitant phosphodiesterase type 5 inhibitor (PDE5I) PAH therapy (e.g., sildenafil, tadalafil) alone – i.e. on a PDE5 inhibitor but NOT also on an endothelin-receptor antagonist; 0 = otherwise. Time-fixed per subject for the PK/PD analysis (PAH comedication at baseline, stable dose required before selexipag start).
- Units: (binary)
- Type: binary
- Scope: specific
-
Reference category: 0 (PAH-comedication-naive OR on
ERA alone OR on both ERA+PDE5; mutually exclusive with
CONMED_ERAandCONMED_ERA_PDE5I). -
Source aliases:
-
PAHCOMEDcategory 2 (“PDE5 inh.”) – decomposed from the categoricalPAHCOMEDsource column with levels {naive, ERA, PDE5, ERA+PDE5} inKrause_2017_selexipag.R.
-
-
Example models:
Krause_2017_selexipag.R(multiplicative effect on the ACT-333679 elimination rate constant:km *= (1 + 0.07 * CONMED_PDE5I); +7% relative to the PAH-comedication-naive reference, Krause 2017 Table 1. The PDE5-only coefficient is statistically not significant (p = 0.19) but retained in the final model so the four-level PAH-comedication categorical is preserved end-to-end). -
Notes: Used together with
CONMED_ERAandCONMED_ERA_PDE5Ito decompose a four-level PAH-comedication categorical (naive / ERA-only / PDE5-only / ERA-and-PDE5) into three orthogonal mutually-exclusive binary indicators. Specific scope because the indicator’s semantics are tied to the GRIPHON study’s PAH-comedication taxonomy.
CONMED_PPI (canonical for concomitant proton-pump inhibitor use)
- Description: 1 = patient on concomitant proton-pump inhibitor (CONMED_PPI) therapy, 0 = no CONMED_PPI use. Captures gastric-pH-elevating co-medication that can reduce the bioavailability of solubility-limited (typically weakly basic) orally administered drugs.
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (no CONMED_PPI use).
-
Source aliases:
-
CONMED_PPI– used inGoel_2016_Sonidegib.R(Goel 2016 dataset; defined as “significant” CONMED_PPI use, i.e. duration of CONMED_PPI use >= 80% of the PK assessment phase).
-
-
Example models:
Goel_2016_Sonidegib.R(multiplicative effect on F:0.696^CONMED_PPI, ~30% lower bioavailability under CONMED_PPI coadministration). -
Notes: Per-model
covariateData[[CONMED_PPI]]$notesmust document the operational definition (e.g., Goel 2016 requires CONMED_PPI use covering >= 80% of the PK assessment window; other studies may use a simpler ever-vs-never indicator or a per-record time-varying flag). Distinct fromCONMED_H2RA(H2-receptor antagonist) which acts on gastric pH via a different mechanism.
CONMED_PROBENECID (canonical for concomitant probenecid co-administration indicator)
- Description: 1 = subject coadministered probenecid (organic-anion transport inhibitor; multidrug resistance protein (MRP) family modulator with documented activity at the blood-brain barrier, blood-CSF barrier, and renal tubular secretion sites), 0 = no concomitant probenecid. Time-varying per subject because probenecid exposure can start and stop within the observation period.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (no probenecid coadministration).
-
Source aliases:
- Paper-specific day-2 treatment-arm indicator – used in
Xie_2000_m3g_rat.R(the paper distinguishes a probenecid-treatment arm from a control arm by experimental day; the canonical column is 1 during the day-2 probenecid co-infusion and 0 otherwise, including on day 1 of the probenecid arm before the probenecid loading dose).
- Paper-specific day-2 treatment-arm indicator – used in
-
Example models:
Xie_2000_m3g_rat.R(multiplicative exponential effect on the unbound BBB influx clearance CL_u,in:cluin *= exp(e_conmed_probenecid_cluin * CONMED_PROBENECID)withe_conmed_probenecid_cluin = log(0.17 / 0.11) = 0.4353, i.e. a 1.55-fold increase in CL_u,in into rat brain ECF under probenecid co-administration; CL_u,out and the intercompartmental brain clearance Q_br are not statistically affected by probenecid in the paper’s model selection and so are not paired with this canonical). -
Notes: Scope: specific because the only on-disk
source is a preclinical microdialysis rat BBB-transport paper (Xie 2000)
and the column meaning is intrinsically tied to the day-2 probenecid
co-infusion design rather than a general clinical-coadministration
indicator. Per-model
covariateData[[CONMED_PROBENECID]]$notesmust document the dose / regimen of probenecid used (Xie 2000: 70 umol/kg IV loading dose plus 70 umol/kg/h constant infusion in male Sprague-Dawley rats), the parameter the indicator modifies, and any per-subject vs per-record time-varying convention.
CONMED_RIF_LPVR4 (canonical for concomitant rifampicin-based antitubercular treatment with super-boosted lopinavir/ritonavir 4:4 indicator)
- Description: 1 = subject is receiving rifampicin-based antitubercular treatment together with super-boosted lopinavir/ritonavir 4:4 (extra ritonavir added to standard 4:1 LPV/r to counter rifampicin-driven LPV induction); 0 = subject is on the comparator regimen specified by the source paper (typically standard LPV/r 4:1 without rifampicin in Tikiso 2021). Used to flag the combined induction effect of rifampicin (PXR-mediated UGT induction) plus extra ritonavir on co-administered antiretroviral PK.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (non-RIF + standard LPV/r 4:1 reference).
-
Source aliases:
-
RIF– used inTikiso_2021_abacavir.R(the dataset’s paper-defined indicator, 1 = on rifampicin-based TB treatment with super-boosted LPV/r 4:4, 0 = standard LPV/r 4:1 or EFV).
-
-
Example models:
Tikiso_2021_abacavir.R(multiplicative effect on bioavailability:f_depot *= (1 + (-0.294) * CONMED_RIF_LPVR4); -29.4% relative to the LPV/r 4:1 reference). - Notes: Specific scope because the joint rifampicin + super-boosted-LPV/r contrast is paper-defined; the underlying induction is plausibly rifampicin-driven (the paper’s discussion notes that LPV concentrations were similar with vs without super-boosting, weakening a separate ritonavir contribution), but the canonical column captures the joint indicator the paper modeled. Future TB/HIV co-treatment models that test the same regimen contrast should extend the example list rather than register a new canonical.
CONMED_RITUX (canonical for concomitant rituximab combination therapy)
- Description: 1 = on concomitant rituximab combination therapy (with or without backbone chemotherapy), 0 = not on rituximab.
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (no concomitant rituximab; single-agent or non-rituximab combination).
-
Source aliases:
-
RITUX– used inWu_2024_inotuzumab.R.
-
-
Example models:
Wu_2024_inotuzumab.R(additive fractional change on CL1:CL1 * (1 + (-0.132) * CONMED_RITUX)~= 13% lower CL1 with concomitant rituximab). -
Notes: Wu 2024 Table 3 footnote b explicitly flips
the reference category vs. the predecessor Garrett 2019 adult model: in
Garrett 2019 the reference was “with rituximab” (RITUX = 0 meant
on-rituximab), whereas in Wu 2024 the reference is “without rituximab”
(RITUX = 0 means no concomitant rituximab). Future models that pool an
analogous rituximab-combination cohort with a single-agent reference
should use this canonical with the Wu 2024 sign convention; if a paper
retains the Garrett 2019 reverse-coded convention, document the value
transformation in
covariateData[[CONMED_RITUX]]$notes(CONMED_RITUX = 1 - source$RITUX) rather than registering a second canonical. Ratified canonically on 2026-04-26.
CONMED_SPART (canonical for spartalizumab (PDR001, anti-PD-1) coadministration indicator)
- Description: 1 = the analyzed therapeutic mAb is coadministered with spartalizumab (PDR001, anti-PD-1 IgG4), 0 = no spartalizumab coadministration. Time-fixed per subject in source analyses to date.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (no spartalizumab coadministration – monotherapy or combination with non-spartalizumab agents such as a hypomethylating agent).
-
Source aliases:
-
HASPDR– used inXu_2023_MBG453.R(Monolix supplement Appendix S2; the source describes the column as “this patient HAS received PDR001 [spartalizumab, anti PD-1 mAb]”).
-
-
Example models:
Xu_2023_MBG453.R(exponential effect on CL:exp(0.0194 * CONMED_SPART); not statistically significant in the full covariate model but retained because Xu 2023 used the full-covariate-model approach). -
Notes: Parallels
COMBO_NIVO(ipilimumab + nivolumab) andCOMBO_DURVA(durvalumab combinations) but for spartalizumab. Promote to general scope if a second paper reports a spartalizumab-coadministration covariate with comparable encoding.
CONMED_STATIN (canonical for concomitant conmed_statin (HMG-CoA reductase inhibitor) therapy)
- Description: 1 = patient coadministered a conmed_statin (HMG-CoA reductase inhibitor) during the study, 0 = no conmed_statin coadministration.
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (no conmed_statin coadministration).
- Source aliases: none known.
-
Example models:
Martinez_2019_alirocumab.R(additive effect on linear clearance CLL:CLL = TVCLL + COV1*(WT-82.9) + COV2*CONMED_STATIN; +0.00644 L/h when conmed_statin is coadministered). -
Notes: Per-model
covariateData[[CONMED_STATIN]]$notesmust document which statins and dose thresholds are included in the “CONMED_STATIN = 1” category, since inclusion criteria vary by study. Martinez 2019 codes CONMED_STATIN = 1 for coadministration of rosuvastatin (< 20 mg/day), atorvastatin (< 40 mg/day), or simvastatin (any dose); other conmed_statin regimens are coded as 0.
CONMED_STATIN_MONO (canonical for concomitant statin-monotherapy indicator)
- Description: 1 = patient is on a conmed_statin and no other lipid-lowering comedication (conmed_statin monotherapy), 0 = not on conmed_statin monotherapy (either on no lipid-lowering therapy or on a multi-drug lipid-lowering combination such as conmed_statin + ezetimibe).
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (not on conmed_statin monotherapy).
-
Source aliases:
- Derived from a conmed_statin-identifier column in the source (any of atorvastatin, rosuvastatin, simvastatin, lovastatin, pravastatin, pitavastatin, fluvastatin) with an AND over “no other lipid-lowering comedication”.
-
Example models:
Kuchimanchi_2018_evolocumab.R(multiplicative effect 1.13 on Vmax:Vmax * 1.13^CONMED_STATIN_MONO). -
Notes: Scope: specific because Kuchimanchi 2018
narrowly defines the conmed_statin covariate as monotherapy only
(“patients on a conmed_statin only and no other comedication”). Mutually
compatible with
CONMED_EZE: a subject on conmed_statin+ezetimibe hasCONMED_STATIN_MONO = 0andCONMED_EZE = 1; a subject on conmed_statin alone hasCONMED_STATIN_MONO = 1andCONMED_EZE = 0; a subject on no lipid-lowering therapy has both 0. Future popPK/PD models that adopt a broader “any conmed_statin” definition should register a separateCONMED_STATINorCONMED_STATINcanonical rather than reusing this name.
CONMED_STEROID (canonical for baseline/concomitant systemic corticosteroid use)
- Description: 1 = patient on systemic corticosteroid therapy at baseline (typically continued as concomitant medication during the study), 0 = no baseline corticosteroid use. Time-fixed per subject.
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (no baseline corticosteroid use).
-
Source aliases:
-
BSTEROID– used inNarwal_2013_sifalimumab.RandZheng_2016_sifalimumab.R.
-
-
Example models:
Narwal_2013_sifalimumab.R(multiplicative on CL:CL * (1 + 0.195 * CONMED_STEROID)),Zheng_2016_sifalimumab.R(multiplicative on CL(1 + 0.11 * CONMED_STEROID)and on V1(1 - 0.09 * CONMED_STEROID)in the SLE phase IIb cohort, which was ~85% conmed_steroid-treated at baseline). -
Notes: Distinct from
PRICORT, which is strictly a prior (pre-study) indicator.CONMED_STEROIDcaptures concurrent corticosteroid use at / from study baseline in diseases where background conmed_steroid use is standard of care (SLE, severe asthma, etc.). When a future paper needs the two jointly, both can coexist on the same subject. The nameSTEROID_BLwas used as an alias in earlier register drafts and is retired; useCONMED_STEROIDfor all future models.
CONMED_VPA (canonical for concomitant valproate (valproic acid) coadministration indicator)
- Description: 1 = subject is taking valproate (valproic acid, sodium valproate, divalproex) as a concomitant antiepileptic drug at the PK observation, 0 = no concomitant valproate. Valproate is a broad-spectrum AED that inhibits UGT and CYP2C9, and chronic use is associated with weight gain. Time-varying when valproate starts / stops within the observation window; time-fixed when the source paper analyses chronic-maintenance cohorts whose AED therapy is stable.
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (no concomitant valproate).
-
Source aliases:
-
VPA– used inSchoemaker_2017_brivaracetam.R(paper covariateVPAfor valproate coadministration).
-
-
Example models:
Schoemaker_2017_brivaracetam.R(multiplicative effect on apparent oral clearance:cl *= (1 - 0.101 * CONMED_VPA); -10.1% relative to no-VPA reference, corresponding to ~11% higher brivaracetam exposure, Schoemaker 2017 Table 1). -
Notes: Drug-specific CONMED_* indicator anticipated
in the [[CONMED_AED]] notes. Schoemaker 2017 retained the VPA effect in
the final model even though it did not formally meet the SCM inclusion
criteria (forward p < 0.01) because quantifying its contribution was
considered informative; the authors note the apparent VPA-induced
exposure rise may be confounded with VPA-driven weight / fat gain in
chronic users. Distinct from the broader [[CONMED_AED]] (any concomitant
AED). When a paper distinguishes individual AEDs separately, use the
drug-specific canonicals [[CONMED_CBZ]], [[CONMED_PB]],
CONMED_VPArather than collapsing into the class-level indicator. Ratified canonically on 2026-05-20 alongside the Schoemaker 2017 brivaracetam paediatric extraction.
PRICORT (canonical for prior corticosteroid use indicator)
- Description: 1 = patient received systemic corticosteroid treatment prior to study entry, 0 = no prior corticosteroid use. Time-fixed per subject.
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (no prior corticosteroid use).
- Source aliases: none.
-
Example models:
Ma_2020_sarilumab_das28crp.R(multiplicative on DAS28-CRP Kout:Kout * theta^PRICORT),Ma_2020_sarilumab_anc.R(power-form on Emax:Emax * 0.819^PRICORT). -
Notes: Ma 2020 applies it as a multiplicative
effect of the form
param * theta^PRICORTin both DAS28-CRP and ANC PD models. Generally applicable clinical-history indicator.
PRIOR_ANTHRACYCLINE_DOSE (canonical for prior cumulative anthracycline dose)
- Description: Cumulative dose of anthracycline chemotherapy received by the subject prior to the first dose analysed in the current popPK / popPK-PD model, expressed in doxorubicin-equivalent body-surface-area-normalised mg/m^2. Time-fixed per subject for the analysis window (the running cumulative anthracycline dose at the first observed dose).
- Units: mg/m^2 (doxorubicin-equivalent)
- Type: continuous
- Scope: specific
-
Reference category: n/a – typically used as a
linear shift
(
(1 + theta * (PRIOR_ANTHRACYCLINE_DOSE - ref))) on a baseline parameter (e.g., baseline cardiac troponin I before the next anthracycline cycle). Reference values observed: 90 mg/m^2 (Kunarajah 2017, cohort median). -
Source aliases:
-
PCAMT– Kunarajah 2017 NM-TRAN convention (“Prior Cumulative Anthracyclines aMounT”; doxorubicin-equivalent mg/m^2).
-
-
Example models:
Kunarajah_2017_doxorubicin.R(linear shift on baseline cardiac troponin I:bl_cTnI * (1 + 0.00308 * (PRIOR_ANTHRACYCLINE_DOSE - 90))– ~0.31% increase in baseline cTnI per 1 mg/m^2 of prior cumulative anthracycline exposure). -
Notes: Distinct from
PRIOR_ANTICANCER(a binary modality indicator, 1 = any prior anticancer therapy) –PRIOR_ANTHRACYCLINE_DOSEcarries the actual cumulative dose, restricted to the anthracycline drug class (doxorubicin, daunorubicin, epirubicin, idarubicin), and is the column needed when the source paper’s effect is dose-response in the prior-exposure regime rather than presence / absence. When a paper records anthracycline exposure as anthracycline-class-by-class doses and the model effect aggregates them, sum to a single doxorubicin-equivalent value before populating this column (use the published bone-marrow / cardiotoxicity isoeffective conversion factors). When a paper distinguishes the type of anthracycline (e.g., doxorubicin vs daunorubicin separately), register parallel canonicals (PRIOR_DOXORUBICIN_DOSE,PRIOR_DAUNORUBICIN_DOSE) rather than overloading this name. Scope: specific because the column meaning is intrinsically tied to anthracycline-class chemotherapy exposure; promote to general if a second paper ratifies the same definition.
PRIOR_ANTICANCER (canonical for prior anticancer therapy of any modality)
-
Description: 1 = subject received any prior
anticancer therapy (cytotoxic chemotherapy, radiotherapy, hormonal
therapy, targeted therapy, immunotherapy, or surgical debulking with
adjuvant intent) before the start of the analyzed treatment, 0 =
treatment-naive. Broader than
LINE_1L, which is specifically a treatment-line indicator restricted to systemic drug therapy lines.PRIOR_ANTICANCERcaptures the full clinical concept of prior cancer treatment exposure as used in cytotoxic-chemotherapy myelosuppression analyses, where any prior anticancer modality may have depleted the bone-marrow proliferating pool and therefore affects baseline ANC. - Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (treatment-naive; no prior anticancer therapy of any modality).
-
Source aliases:
-
PC(Kloft 2006 / Netterberg 2017 NM-TRAN convention for “previous anticancer therapy”; values 0 = naive, 1 = had prior anticancer therapy) – used inNetterberg_2017_docetaxel.R.
-
-
Example models:
Netterberg_2017_docetaxel.R(multiplicative effect on baseline ANC:BACOV *= (1 + theta * PRIOR_ANTICANCER)with theta = -0.147; prior-anticancer patients have ~14.7% lower baseline ANC than treatment-naive patients). -
Notes: Distinct from
LINE_1L(which is the inverse semantics for systemic-drug therapy lines only) andPRIOR_TNF/PRIOR_BIO(which are modality-specific to anti-TNF / biologic exposure in inflammatory-disease cohorts). UsePRIOR_ANTICANCERwhen the source paper’s covariate counts any anticancer modality (including radiotherapy and surgery) as prior exposure. When a future paper restricts the indicator to cytotoxic chemotherapy alone, useLINE_1L(with values inverted: paper’sPRIOR_CHEMO = 1 - LINE_1L). When a paper distinguishes prior chemotherapy from prior radiotherapy, register a parallelPRIOR_RADIATIONcanonical.
PRIOR_BIO (canonical for prior biologic exposure)
-
Description: 1 = subject previously treated with
any biologic (broader than
PRIOR_TNF: includes anti-TNF agents plus anti-integrin, anti-IL-12/23, anti-IL-17, anti-IL-23, anti-IL-6, etc.), 0 = biologic-naive. - Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (biologic-naive).
-
Source aliases:
-
bio-naive(Aguiar 2021 source-paper variable, with the indicator inverted: paper’sbio-naive = 1 - PRIOR_BIO). Effect coefficients in the source paper apply tobio-naive; inmodel()derivebio_naive <- 1 - PRIOR_BIOto preserve the paper’s reported coefficient.
-
-
Example models:
Aguiar_2021_ustekinumab.R(Aguiar 2021 Table 2 footnote a; multiplicative effect on CL: factor(1 - 0.227 * (1 - PRIOR_BIO)), so bio-naive patients have ~23% lower CL than previously-exposed patients). -
Notes: Distinct from
PRIOR_TNF(a strict subset). UsePRIOR_BIOwhen the source paper’s covariate counts any biologic as prior exposure (anti-TNF, anti-integrin, anti-IL-12/23, anti-IL-17, anti-IL-23, anti-IL-6, etc.); usePRIOR_TNFwhen the source paper specifically tested anti-TNF exposure. When the source paper uses the inverted “bio-naive” indicator (1 = naive), document the inversion incovariateData[[PRIOR_BIO]]$notesand apply1 - PRIOR_BIOinmodel()so the canonical column stores 1 = previously exposed.
PRIOR_TNF (canonical for prior anti-TNF biologic exposure indicator)
- Description: 1 = subject previously treated with an anti-TNF (tumor necrosis factor) inhibitor, 0 = TNF-naive.
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (TNF-naive).
-
Source aliases:
-
PRIORTNF(all caps, no underscore) – acceptable alternative spelling.
-
-
Example models:
Moein_2022_etrolizumab.R(multiplicative fractional effect on CL, +4.9%). - Notes: Use when the source paper reports a binary “prior anti-TNF inhibitor” covariate on any PK parameter. Generally applicable across RA/PsA/IBD/axSpA biologic PK models.
PRIOR_IPI (canonical for prior ipilimumab treatment indicator)
- Description: 1 = subject previously treated with ipilimumab (anti-CTLA-4 monoclonal antibody) before the start of the current anti-PD-1 / anti-PD-L1 (or other) checkpoint-inhibitor regimen, 0 = ipilimumab-naive.
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (ipilimumab-naive). Document per-paper how subjects with missing IPI status are pooled (Ahamadi 2017 pools “missing” with naive in the final coefficient; some papers may report a separate “missing” effect).
-
Source aliases:
-
IPI(Ahamadi 2017; categorical with levelsIPI-naive,IPI-treated,missing) – decompose intoPRIOR_IPI = as.integer(IPI == "IPI-treated")and treat the missing category like naive unless the source paper retains a separate “missing” coefficient.
-
-
Example models:
Ahamadi_2017_pembrolizumab.R(proportional changes on CL of +14.0% and on Vc of +7.36% for IPI-treated relative to IPI-naive; “missing” 26.4% of cohort is pooled with naive in the canonical encoding because Table 3 reports only the naive-vs-treated coefficient). -
Notes: Distinct from
PRIOR_ANTICANCER(any modality),PRIOR_BIO(any biologic),PRIOR_TNF(anti-TNF biologic). UsePRIOR_IPIwhen the source paper specifically tested prior ipilimumab exposure as a covariate; this is a common covariate in advanced-melanoma popPK analyses where ipilimumab was the standard-of-care immune-checkpoint inhibitor preceding PD-1 / PD-L1 entrants. Ratified canonically on 2026-05-17 alongside the Ahamadi 2017 pembrolizumab extraction.
Rheumatoid-arthritis disease-activity covariates
RHEUMATOID_FACTOR (canonical for serum rheumatoid factor concentration)
-
Description: Serum rheumatoid factor (an
autoantibody, predominantly IgM, directed against the Fc portion of IgG)
concentration. Baseline value typical; document time-varying use in
per-model
notes. -
Units: U/mL or IU/mL (interchangeable in the
clinical-PK literature). Document per-model via
covariateData[[RHEUMATOID_FACTOR]]$units. - Type: continuous
- Scope: general
-
Reference category: n/a – used with power scaling
on a log-transformed value:
(log(RHEUMATOID_FACTOR) / log(ref))^exponent(or, equivalently, the source-paper form(LRF / log(ref))^exponentwhereLRF = log(RHEUMATOID_FACTOR)). Reference value observed: 110 U/mL (Frey 2010, corresponding to LRF = 4.7 in the paper’s final-model equation). -
Source aliases:
-
LRF– log-transformed RF (natural log of the value in U/mL); Frey 2010 fits the covariate on the log scale and reports the reference asLRF = 4.7(i.e.,log(110) ~= 4.7). The canonical column carries the raw RF concentration in U/mL; the log transform is applied insidemodel(). -
RF– universal NONMEM/clinical-PK abbreviation; rejected as the canonical name on 2026-04-28 because the bare two-letter abbreviation is uncommon in published popPK papers and could be confused with other shortenings.
-
-
Example models:
Frey_2010_tocilizumab.R(U/mL, reference 110 U/mL == LRF = 4.7; small positive exponent +0.1 on linear CL applied tolog(RHEUMATOID_FACTOR)). - Notes: RF concentrations span several orders of magnitude across the rheumatoid-arthritis population (Frey 2010 observed range 15-11,800 U/mL across the four phase-III studies; reference paper: Frey 2010 Table I), motivating the log transform before power scaling. The mechanistic rationale (Frey 2010 Discussion, p764) is that RF – being an anti-IgG autoantibody – could in principle bind the Fc region of the therapeutic IgG monoclonal antibody and accelerate clearance, but the observed CL effect was small (-4.9% to +6.5% across the observed RF range) and the paper acknowledges that high RF concentrations may also reduce the assay’s ability to detect the drug, leading to an apparent CL increase. Ratified canonically on 2026-04-28 alongside the Frey 2010 extraction.
BLPHYVAS (canonical for baseline physician’s global assessment VAS)
- Description: Baseline Physician’s Global Assessment of Disease Activity, 100-mm visual analogue scale (0 = no disease activity, 100 = maximum). Time-fixed per subject.
- Units: mm (0-100 VAS)
- Type: continuous
- Scope: general
-
Reference category: n/a – used as a power term
(BLPHYVAS / <ref>)^exponent. Reference 66 used in Ma 2020. - Source aliases: none.
-
Example models:
Ma_2020_sarilumab_das28crp.R. - Notes: One of the components of the DAS28 composite score; in Ma 2020 it appears as a baseline covariate on the DAS28-CRP disease-activity BASE rather than on the score itself. Applicable to any rheumatology model where baseline physician-assessed disease activity is used as a PK/PD covariate.
BLHAQ (canonical for baseline HAQ-DI score)
- Description: Baseline Health Assessment Questionnaire Disability Index (HAQ-DI; 0 = no disability, 3 = maximum disability). Time-fixed per subject.
- Units: unitless (0-3 composite score)
- Type: continuous
- Scope: general
-
Reference category: n/a – used as a power term
(BLHAQ / <ref>)^exponent. Reference 1.75 used in Ma 2020. - Source aliases: none.
-
Example models:
Ma_2020_sarilumab_das28crp.R. - Notes: Patient-reported disability score frequently used as a baseline covariate in rheumatoid-arthritis PK/PD analyses.
PAIN (canonical for patient-reported global pain visual analogue score)
-
Description: Patient-reported global pain on a
100-mm visual analogue scale (PAIN; 0 = no pain, 100 = worst imaginable
pain). Distinct from
BLPHYVAS(the physician’s global assessment of disease activity). Both baseline and time-varying usages are covered; document per-model incovariateData[[PAIN]]$noteswhether the column is baseline-only. - Units: mm (0-100 VAS).
- Type: continuous
- Scope: general
-
Reference category: n/a – used with power scaling
(PAIN / ref)^exponent. Reference value observed: 60 in Frey 2013 (approximate dataset median across OPTION/TOWARD). - Source aliases: none known (the PAIN column name is used directly in the Frey 2013 NONMEM control stream).
-
Example models:
Frey_2013_tocilizumab.R(baseline; power effect on the indirect-response BASE parameter with reference 60 and exponent 0.062). -
Notes: PAIN can take the value 0 in real cohorts (a
patient who reports no pain at the assessment), which makes the bare
power form
(PAIN/60)^expreturn 0. Frey 2013 documents an explicit 0.010 floor on PAIN in its Table 2 covariate range; the model file applies the same floor insidemodel()so simulation under PAIN = 0 returns a finite BASE rather than collapsing to zero. Canonical name follows theEOS/EASIconvention (noBLprefix); baseline-vs-time-varying status is per-model.
SWOL_28JOINT (canonical for 28-joint swollen joint count)
-
Description: Swollen joint count on the 28-joint
(DAS28) scale (integer 0-28; component of the DAS28 composite). Baseline
value is typical; document time-varying use in per-model
notes. - Units: count (0-28)
- Type: continuous
- Scope: general
-
Reference category: n/a – used as a shifted power
term
((SWOL_28JOINT + 1)/(<ref> + 1))^exponentto avoid the zero-count edge case. Reference value observed: 16 in Li 2019 (approximate dataset median of the popPK cohort). -
Source aliases:
-
SWOL– used inLi_2019_abatacept.R(Li 2019 Methods abbreviation).
-
-
Example models:
Li_2019_abatacept.R(power effect on CL with exponent 0.0965; not clinically relevant per Li 2019). -
Notes: The
_28JOINTsuffix distinguishes this from the 66/68-joint swollen count used in some earlier RA scales – register a separate canonical (SWOL_66JOINTor similar) if a future paper uses a different joint-count scale. Canonical name drops theBLprefix to match theEASI/AGE/WT/ALBconvention where baseline-vs-time-varying status is documented incovariateDatanotes rather than the column name.
Pharmacogenetics
CYP2D6 (canonical for CYP2D6 individual metabolic-activity score)
-
Description: Continuous individual-level CYP2D6
metabolic-activity score. The intent is a single canonical column for
any CYP2D6 phenotype proxy that the source paper reports as a continuous
number (probe-substrate model-based individual clearance,
copy-number-corrected expression score, activity-score sum from
*allelegenotypes, etc.); the per-modelcovariateData[[CYP2D6]]$units,description, andnotesdocument which proxy is in force and the population-median reference value used inside the model. Time-invariant in all known examples (germline genotype or one-time probe-substrate measurement). -
Units: Paper-specific – document per-model (e.g.,
ng/Lin Ter Heine 2014 where the value is the dextromethorphan-probe model-based individual CYP2D6 clearance). - Type: continuous
- Scope: general
-
Reference category: n/a (continuous). Models center
on a population median (e.g., 1560 ng/L in Ter Heine 2014); document the
reference value per-model in
covariateData[[CYP2D6]]$notes. -
Source aliases:
-
CYP2D6– used directly inTerHeine_2014_tamoxifen.R.
-
-
Example models:
TerHeine_2014_tamoxifen.R(power-law effect on the tamoxifen -> endoxifen formation clearance:(CYP2D6 / 1560)^e_CYP2D6_cl_endx). -
Notes: TODO – consolidate the various CYP2D6
phenotype encodings (continuous probe-derived activity,
copy-number-corrected score, categorical PM/IM/EM/UM phenotype,
activity-score sum) into a single canonical
CYP2D6column with an enumeratednotes-documentedencodingfield, OR introduce companion canonicals (CYP2D6_PHENO_GROUPfor the categorical PM/IM/EM/UM grouping,CYP2D6_ACTSCOREfor the AS sum) when a future model needs the categorical form. The current general-scope continuous canonical is sufficient for the Ter Heine 2014 use case but will likely need refinement as more CYP2D6-aware models are added.
CYP3A4 (canonical for CYP3A4 / CYP3A4-and-CYP3A5 individual metabolic-activity score)
-
Description: Continuous individual-level CYP3A4 (or
combined CYP3A4 + CYP3A5) metabolic-activity score. Same intent and
documentation policy as
CYP2D6above. Some sources measure CYP3A4 alone via probe substrate; others (Ter Heine 2014) report a combined CYP3A4/5 activity because the chosen probe (dextromethorphan N-demethylation) cannot distinguish CYP3A4 from CYP3A5. The per-modelnotesfield documents whether the value is CYP3A4-only or the CYP3A4 + CYP3A5 combined score. -
Units: Paper-specific – document per-model (e.g.,
ng/Lin Ter Heine 2014 where the value is the dextromethorphan-probe model-based individual CYP3A4/5 clearance). - Type: continuous
- Scope: general
- Reference category: n/a (continuous). Models center on a population median (e.g., 44.7 ng/L in Ter Heine 2014); document the reference value per-model.
-
Source aliases:
-
CYP3A4– used directly inTerHeine_2014_tamoxifen.R(the column carries combined CYP3A4 + CYP3A5 activity per the source paper). -
CYP3A4/5– long form sometimes used in source manuscripts; standardize the column name toCYP3A4and document the combined-isoform semantics in per-modelnotes.
-
-
Example models:
TerHeine_2014_tamoxifen.R(power-law effect on the tamoxifen -> endoxifen formation clearance:(CYP3A4 / 44.7)^e_CYP3A4_cl_endx). -
Notes: TODO – register the rest of the canonical
drug-metabolizing-CYP set prospectively (
CYP1A2,CYP2A6,CYP2B6,CYP2C8,CYP2C9,CYP2C19,CYP2E1,CYP3A5) using the same continuous-individual-activity-score pattern, so future popPK models that report CYP-probe-derived covariates can drop straight into the existing convention rather than re-deliberating the encoding each time. Coordinate with the consolidation TODO onCYP2D6so categorical-vs-continuous encoding is handled uniformly across all CYPs.
CYP3A5_EXPR (canonical for CYP3A5 expresser status)
- Description: 1 = subject carries at least one functional CYP3A51 allele (genotype 1/1 or 1/3, equivalent to one or two A alleles at rs776746); 0 = homozygous CYP3A53/*3 (G/G at rs776746) – i.e., a nonexpresser. Time-fixed per subject (germline genotype).
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (CYP3A53/3 nonexpresser).
-
Source aliases:
-
X– used inBergmann_2014_tacrolimus.R(Bergmann 2014 Table 2 footnote:X = 1for 1/1 and 1/3;X = 0for 3/3). -
CYP3A5 expresser– used inStorset_2014_tacrolimus.R(Storset 2014 Table 2 final theory-based model;*1/*1and*1/*3pooled as expressers because Storset 2014 had only n = 3 1/1 subjects).
-
-
Example models:
Bergmann_2014_tacrolimus.R(multiplicative effect on tacrolimus CL/F:theta_CYP3A5 ^ CYP3A5_EXPR, withtheta_CYP3A5 = 1.60; expressers have 60% higher apparent oral clearance than nonexpressers),Storset_2014_tacrolimus.R(multiplicative effects on apparent plasma clearance:cl *= 1.30^CYP3A5_EXPR; and on oral bioavailability:fdepot *= 0.82^CYP3A5_EXPR; Storset 2014 Table 2 final theory-based model). -
Notes: Distinct from the SNP-pattern canonical
SNP_<GENE>_<RSID>(which encodes “mutant allele presence” – 1 = at least one variant allele). For CYP3A5 the 3 allele (rs776746 G) is the variant that abolishes function, so a literal “mutant-allele-presence” indicator (1 = any G allele) would group 1/3 heterozygotes with the 3/3 nonexpressers, which is the opposite of the clinically meaningful expresser-vs-nonexpresser dichotomy used by every CYP3A5-aware popPK model. TheCYP3A5_EXPRcanonical preserves the expresser-equals-1 orientation directly. Future CYP3A5 papers using a 3/3 indicator (rather than 1 carrier) should still record their values underCYP3A5_EXPRand document the value inversion innotes(CYP3A5_EXPR = 1 - source_indicator); registering a parallelCYP3A5_NONEXPRis discouraged. The canonical name follows the<gene>_<phenotype>rather than the<gene>_<rsid>pattern because the column captures derived metabolic phenotype rather than raw genotype. Distinct fromCYP3A4(continuous individual-activity score for CYP3A4 / CYP3A4 + CYP3A5 combined): the binaryCYP3A5_EXPRis the right fit for source papers that report only the rs776746 genotype, while the continuousCYP3A4is for sources that report a probe-substrate-derived activity number. Ratified canonically on 2026-05-08 alongside the Bergmann 2014 extraction.
CYP2C9_EM (canonical for CYP2C9 extensive-metabolizer phenotype indicator)
- Description: 1 = subject is a CYP2C9 extensive metabolizer (genotype 1/1; wild-type homozygote with full enzyme activity); 0 = subject is a CYP2C9 intermediate or poor metabolizer (heterozygous or homozygous carrier of any reduced-function allele such as 2, 3, *13). Time-fixed per subject (germline genotype-derived phenotype).
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (intermediate or poor metabolizer; reduced-function-allele carrier).
-
Source aliases:
-
CYP2C9 phenotype– Jeong 2022 (paper Table 1 phenotype classification:*1/*1= EM;*1/*3,*1/*13= IM).
-
-
Example models:
Jeong_2022_torsemide.R(linear-deviation effect on apparent clearance and inter-compartmental clearance:CL/F = tvCL/F * (1 + 0.510 * CYP2C9_EM)andQ/F = tvQ/F * (1 + 0.365 * CYP2C9_EM); CYP2C9 extensive metabolizers have 51% higher apparent clearance and 36.5% higher apparent inter-compartmental clearance than intermediate metabolizers; Jeong 2022 Table 4 final Pop-PK model). -
Notes: Follows the
CYP3A5_EXPRprecedent above (functional-allele-carrier = 1) rather than theSNP_<GENE>_<RSID>mutant-presence pattern, because (a) clinical CYP2C9 phenotype is reported as EM / IM / PM and the canonical name should mirror the clinically meaningful axis, and (b) Jeong 2022 chose IM as the model reference, so the EM-equals-1 orientation preserves the paper’s reported coefficient signs and typical-value parameters directly. The canonical name pools reduced-function alleles (2, 3, *13, etc.) into the0category because the population-PK literature typically does not separately resolve them; future papers that distinguish IM from PM should propose a paired companion (CYP2C9_PM) so the three-level EM / IM / PM phenotype can be encoded with two binary indicators on theSLCO1B1_HAP15_HET/SLCO1B1_HAP15_HOMpattern. Distinct fromCYP2D6/CYP3A4(continuous-activity-score canonicals): useCYP2C9_EMwhen the source paper reports a discrete phenotype label; use a continuousCYP2C9canonical (not yet ratified; see TODO onCYP3A4) when a future paper reports a probe-derived activity number. Ratified canonically on 2026-05-17 alongside the Jeong 2022 torsemide extraction.
CYP3A4_INH (canonical for concomitant CYP3A4 inhibitor coadministration indicator)
-
Description: 1 = subject coadministered any CYP3A4
inhibitor during the study, 0 = no concomitant CYP3A4 inhibitor.
Distinct from the
CYP3A4continuous-activity-score canonical above:CYP3A4_INHcaptures concomitant-medication exposure (a drug-drug-interaction indicator), not intrinsic enzyme activity. Use this canonical when the source paper enters CYP3A4-inhibitor coadministration into the popPK model as a binary indicator, regardless of which inhibitor strengths (strong / moderate / weak) the paper pools into the1category. - Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (no CYP3A4 inhibitor coadministration).
-
Source aliases: none standardized; source datasets
typically encode the column as
CYP3AI,CYP3A4I,CYP3AINH, or a free-text concomitant-medication indicator. Document the source-column name per-model incovariateData[[CYP3A4_INH]]$source_name. -
Example models:
Yassen_2025_asundexian.R(proportional-shift effect on CL/F:(1 + e_cyp3a4_inh_cl * CYP3A4_INH)withe_cyp3a4_inh_cl = -0.0531; the asundexian dataset pools weak + moderate CYP3A4 inhibitors into theCYP3A4_INH = 1category because strong inhibitors were a Phase II exclusion criterion). -
Notes: Per-model
covariateData[[CYP3A4_INH]]$notesmust document which inhibitor strengths (strong / moderate / weak) and which specific drug examples are pooled into theCYP3A4_INH = 1category, since inclusion criteria vary by study. Future models that need stratified encoding (separate strong / moderate / weak indicators) should register companion canonicals (e.g.CYP3A4_INH_STRONG,CYP3A4_INH_MOD,CYP3A4_INH_WEAK) rather than overloadingCYP3A4_INH. The complementary CYP3A4-inducer indicator should follow the same pattern as a separate canonical (CYP3A4_IND) when first needed. Ratified canonically on 2026-05-08 alongside the Yassen 2025 asundexian extraction.
CYP3A4_IND (canonical for concomitant CYP3A4 inducer coadministration indicator)
-
Description: 1 = subject coadministered any CYP3A4
inducer during the study, 0 = no concomitant CYP3A4 inducer. Sibling
indicator to
CYP3A4_INH; both capture concomitant-medication exposure (a drug-drug-interaction indicator), not intrinsic enzyme activity (distinct from theCYP3A4continuous-activity-score canonical above). Use this canonical when the source paper enters CYP3A4-inducer coadministration into the popPK model as a binary indicator, regardless of which inducer strengths (strong / moderate / weak) the paper pools into the1category. - Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (no CYP3A4 inducer coadministration).
-
Source aliases: none standardized; source datasets
typically encode the column as
CYP3AIND,CYP3A4IND,INDU,INDUCER, or a free-text concomitant-medication indicator. Document the source-column name per-model incovariateData[[CYP3A4_IND]]$source_name. -
Example models:
Gupta_2016_lenvatinib.R(multiplicative power-form effect on CL/F:1.30^CYP3A4_INDwithe_cyp3a4_ind_cl = log(1.30) ~ 0.262; the Gupta dataset pools any concomitant CYP3A4 inducer reported in the per-subject medication log into theCYP3A4_IND = 1category, withn = 19(2.4%) of the 779-subject pooled cohort flagged positive). -
Notes: Per-model
covariateData[[CYP3A4_IND]]$notesmust document which inducer strengths (strong / moderate / weak) and which specific drug examples are pooled into theCYP3A4_IND = 1category, since inclusion criteria vary by study. Future models that need stratified encoding (separate strong / moderate / weak indicators) should register companion canonicals (e.g.CYP3A4_IND_STRONG,CYP3A4_IND_MOD,CYP3A4_IND_WEAK) rather than overloadingCYP3A4_IND. Sibling canonical toCYP3A4_INH(anticipated by that entry’s notes). Ratified canonically alongside the Gupta 2016 lenvatinib extraction.
APOE4_COUNT (canonical for APOE-epsilon4 allele count)
- Description: Continuous individual-level APOE-epsilon4 allele count: 0 = non-carrier, 1 = heterozygous (one epsilon4 allele), 2 = homozygous (two epsilon4 alleles). Time-invariant (germline genotype). Models in the Alzheimer’s-disease-progression literature treat the 0 / 1 / 2 count as a continuous effect on baseline cognitive score and / or disease-progression slope, with the population-mean carrier-allele count used as the centring value (e.g., 0.72 in the Conrado 2014 CAMD cohort).
- Units: (count, 0 / 1 / 2 alleles per subject; population-mean centring value documented per-model)
- Type: continuous
- Scope: general
-
Reference category: n/a (continuous). Models centre
on the dataset / population mean APOE-epsilon4 count; document the
centring value per-model in
covariateData[[APOE4_COUNT]]$notes. -
Source aliases:
-
APOE4C– used directly inConrado_2014_alzheimer.R. The “C” suffix in the source distinguishes the cleaned continuous APOE-epsilon4 count column (0 / 1 / 2 withunknownrecoded to the population mean) from the upstream rawAPOE4column (0 = non-carrier, 1 = heterozygous, 2 = homozygous, 3 = unknown).
-
-
Example models:
Conrado_2014_alzheimer.R(centring 0.72; multiplicative effect on baseline ADAS-Cog and on disease-progression slope:factor = 1 + e * (APOE4_COUNT - 0.72)withe_blapoe4 = 0.0372on baseline ande_slapoe4 = 0.195on slope). -
Notes: APOE-epsilon4 carrier status is the
strongest established genetic risk factor for late-onset Alzheimer’s
disease; the allele-count form (rather than a binary carrier indicator)
is preferred when the source paper distinguishes heterozygous from
homozygous carriers. Future models that report only a binary carrier
indicator (any-epsilon4 vs none) should register a separate canonical
(
APOE4_CARRIER) rather than overloadingAPOE4_COUNT. Theunknowncategory (often recorded asAPOE4 = 3in CDISC datasets) is conventionally recoded by the source paper to the population-mean count to avoid dropping subjects; document the recoding rule used per-model. Ratified canonically on 2026-05-06 alongside the Conrado 2014 DDMORE extraction.
FCGR3A_VV (canonical for FCGR3A 158 V/V homozygote indicator)
- Description: 1 = subject is homozygous for valine at amino-acid position 158 of the FcgammaRIIIa receptor (V/V), encoded by the rs396991 polymorphism in the FCGR3A gene; 0 = otherwise (heterozygote V/F or homozygote F/F pooled). The dominant V/V vs (V/F + F/F) grouping is the encoding used in the Aguiar 2021 source paper after testing dominant and recessive groupings during covariate model building.
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 = V/F heterozygote or F/F homozygote (combined).
-
Source aliases:
-
FCGR3A(genotype string, e.g.,"V/V"/"V/F"/"F/F"): deriveFCGR3A_VV = as.integer(FCGR3A == "V/V"). -
rs396991(raw allele coding, often"AA"/"AC"/"CC"or"GG"/"GT"/"TT"depending on assay strand): map V allele -> 1, F allele -> 0 with the assay-specific allele convention; deriveFCGR3A_VV = as.integer(genotype is V-homozygous).
-
-
Example models:
Aguiar_2021_ustekinumab.R(Aguiar 2021 Table 2 footnote; logit-scale effect on subcutaneous bioavailability F: 88.8% in V/V vs 71.0% in V/F + F/F). -
Notes: rs396991 (FCGR3A 158V>F) is a
well-studied pharmacogenetic polymorphism affecting FcgammaRIIIa-IgG
affinity and has been associated with response to several IgG monoclonal
antibodies (rituximab, infliximab, ustekinumab). The V allele is the
higher-affinity variant. Document the assay-strand allele convention
used in the source paper in
covariateData[[FCGR3A_VV]]$notes. Future models that use a recessive (F/F vs V/* combined) or codominant (additive 0/1/2) coding should register a separate canonical (e.g.,FCGR3A_FF,FCGR3A_VCTfor V-allele count) rather than overloadingFCGR3A_VV.
Immunogenicity
ADA_POS (canonical for anti-drug antibody positive status indicator)
- Description: 1 = antidrug-antibody-positive, 0 = ADA-negative.
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (ADA-negative).
-
Source aliases:
-
ADA(semantically “ever positive”) – used inZhu_2017_lebrikizumab.R. When translating from a paper that usesADAas “ever positive,” verify the time-frame matches ADA_POS semantics before renaming. -
ADA(time-varying positivity, primary covariate in Xu 2019) – used inXu_2019_sarilumab.R. -
NAB(neutralizing antibody positive – used inPetrov_2024_romiplostim.R). Strictly a subset of total ADA-positive (ADA antibodies that neutralize the drug’s biological effect). Document per-model when the source assay measured NAB only and the canonical column thus excludes binding-only ADA. -
ATAPOSNEW(ADA-positive in the newer/updated-assay cohort) – used inSuri_2018_brentuximab.Ras the modern-assay arm of a two-era ADA decomposition. -
ADA_POSNEW(retired intermediate name; renamed toADA_POSon 2026-04-29 for consistency across single- and multi-assay models).
-
-
Example models:
Clegg_2024_nirsevimab.R,Hu_2026_clesrovimab.R,Petrov_2024_romiplostim.R,Suri_2018_brentuximab.R(multi-assay; paired withADA_POSOLDandADA_MISSING;cl *= (1 + 0.125 * ADA_POS)),Xu_2019_sarilumab.R. -
Notes: In single-assay studies this is a
straightforward binary. In studies pooling data across assay generations
(different sensitivity / drug-tolerance characteristics),
ADA_POSrepresents modern/current-assay positivity; companion indicatorsADA_POSOLDandADA_MISSINGcapture historical-assay-positive and missing-result sub-groups respectively. All three are mutually exclusive; reference is ADA-negative (all three = 0).
ADA_POSOLD (canonical for ADA-positive in older-assay study indicator)
- Description: 1 = subject is anti-drug-antibody-positive in a study that used the older lower-sensitivity, lower-drug-tolerance ADA assay; 0 = otherwise.
- Units: (binary)
- Type: binary
- Scope: specific
-
Reference category: 0 (ADA-negative or not in an
older-assay study). Mutually exclusive with
ADA_POSandADA_MISSING. -
Source aliases:
-
ATAPOSOLD– used inSuri_2018_brentuximab.R.
-
-
Example models:
Suri_2018_brentuximab.R(multiplicative additive effect on ADC clearance:cl *= (1 + 0.177 * ADA_POSOLD)). -
Notes: Companion to
ADA_POS(multi-assay form); see that entry’s Notes for the decomposition rationale. The “newer” vs “older” assay split is paper-specific (Suri 2018 newer assay: sensitivity 23.573 ng/mL, drug tolerance 25 ug/mL; older assay: sensitivity 4 ng/mL, drug tolerance 3,125 ng/mL). Time-varying once positive. Ratified canonically on 2026-04-28.
ADA_MISSING (canonical for ADA-result-missing indicator)
- Description: 1 = ADA value is missing (subject did not have a measured ADA result, distinct from a measured negative); 0 = ADA result is reported (positive or negative).
- Units: (binary)
- Type: binary
- Scope: specific
-
Reference category: 0 (ADA result reported).
Mutually exclusive with
ADA_POSandADA_POSOLD. -
Source aliases:
-
ATAMISSING– used inSuri_2018_brentuximab.R.
-
-
Example models:
Suri_2018_brentuximab.R(multiplicative additive effect on ADC clearance:cl *= (1 + 0.192 * ADA_MISSING)). -
Notes: Used when a substantial fraction of the
pooled cohort has no ADA measurement (Suri 2018: 205 of 380 patients)
and the modeler retains ADA-missing as a separate level rather than
collapsing missingness onto the ADA-negative reference. The non-zero
positive estimate of
e_adam_adc_clindicates ADA-missing patients are not exchangeable with the ADA-negative reference – interpret with caution given the missingness mechanism is not random. Ratified canonically on 2026-04-28.
ADA_TITER (canonical for continuous antidrug-antibody titer/titre)
-
Description: Continuous antidrug-antibody
titer/titre (time-varying; matched in time to the PK sample). Covers
both the British-spelling reciprocal-dilution convention
(
ADA_TITRE, withADA_TITRE = 1for ADA-negative solog_e(1) = 0cancels a log-linear effect) and the American-spelling linear-titer convention (ADA_TITER, withADA_TITER = 0for ADA-negative). The per-modelcovariateData[[ADA_TITER]]$descriptionandnotesmust state which zero-encoding convention is in force so the covariate column cannot be misinterpreted. -
Units: Reciprocal dilution (e.g., 10, 20, 40, …,
2560) OR assay units (log2 or arbitrary) – document per-model in
covariateData[[ADA_TITER]]$units. - Type: continuous
- Scope: general
- Reference category: n/a – ADA-negative encoded per-model (see zero-encoding note).
-
Source aliases:
-
ADA_TITRE– British spelling (reciprocal-dilution convention;1for negative). -
ADA titre– British spelling long form. -
ADAT– used inMoein_2022_etrolizumab.R(American linear-titer convention;0for negative).
-
-
Example models:
Jackson_2022_ixekizumab.R(reciprocal-dilution reference convention withADA_TITER = 1for negatives and(1 + coef * log_e(ADA_TITER))on CL),Moein_2022_etrolizumab.R(linear-titer convention withADA_TITER = 0for negatives andexp(theta * ADA_TITER)on CL, per-unit-titer theta = 0.0365),Robbie_2012_palivizumab.R(reciprocal-dilution values 0/10/20/40/>=80 with category-specific multiplicative effects per titer bin; 0 = ADA negative reference). -
Notes: The prior separate
ADA_TITRE(British,1= negative) andADA_TITER(American,0= negative) canonicals were merged on 2026-04-20 into a single general-scopeADA_TITER. The zero-encoding convention is the load-bearing semantic and must be documented per-model. Distinct fromADA_POS(binary presence/absence); when the paper reports both, the final model usually keeps only one. Imputation rules (LOCF / NOCB / baseline-as-negative) should be documented per-model.
Disease / treatment history
HCT_COND_RIC (canonical for reduced-intensity conditioning regimen indicator)
- Description: 1 = subject received reduced-intensity conditioning (RIC) chemotherapy prior to allogeneic hematopoietic cell transplantation, 0 = subject received myeloablative conditioning (MAC). Conditioning intensity is fixed per subject for the analysis window (the conditioning regimen was completed before transplantation, before any of the post-transplant tacrolimus PK observations). RIC regimens use lower-dose chemotherapy / radiotherapy to preserve some host haematopoiesis and rely on graft-versus-tumour effect for cytoreduction; MAC regimens deliver high-dose chemotherapy and / or total-body irradiation that fully ablates host marrow.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (myeloablative conditioning, MAC).
-
Source aliases:
-
RIC(Dunlap 2025 NM-TRAN convention; binary 0 / 1) – used directly inDunlap_2025_tacrolimus.R.
-
-
Example models:
Dunlap_2025_tacrolimus.R(Dunlap 2025 Table 2 reduced-covariate-model column; exponential effect on apparent oral clearance:cl *= 0.63 ^ HCT_COND_RIC, so RIC recipients have ~37% lower apparent oral tacrolimus clearance than MAC recipients). -
Notes: Conditioning regimen intensity has been
reported to associate with post-transplant tacrolimus apparent
clearance, likely via gut / hepatic CYP3A activity, GVHD-related
inflammatory response, and post-transplant haematopoietic state. The
paper-specific definition of “RIC” follows the source publication’s own
classification (e.g., Dunlap 2025 follows the institutional protocol at
UNCMC, which pools non-myeloablative conditioning regimens into the RIC
category when assigning the binary indicator); document the source
paper’s RIC criteria in
covariateData[[HCT_COND_RIC]]$notes. When a future paper distinguishes a third intensity tier (non-myeloablative, NMA) as a separate covariate level rather than pooling NMA into RIC, register a parallel canonical (e.g.HCT_COND_NMA) instead of overloadingHCT_COND_RIC. Scope: specific because the column is meaningful only for allo-HCT recipients. Ratified canonically on 2026-05-09 alongside the Dunlap 2025 tacrolimus extraction.
DISEXT_EP (canonical for extensive colitis / pancolitis indicator)
- Description: 1 = extensive colitis or pancolitis disease extension, 0 = otherwise (any non-extensive disease extension, e.g. left-sided colitis or proctitis).
- Units: (binary)
- Type: binary
- Scope: general
-
Reference category: 0. In papers that decompose the
disease-extension categorical into both
DISEXT_EPandDISEXT_OTHER,DISEXT_EP = 0 AND DISEXT_OTHER = 0corresponds to the left-sided-colitis reference group; in papers that use a single binary indicator for extensive colitis, the reference is pooled non-extensive (left-sided + any other extension). -
Source aliases:
-
EXTCOL– used inFaelens_2021_infliximab.R(binary 0/1 for extensive colitis at baseline; no separate “other” category). - Derived from a multi-level
DISEXTcolumn in the source (levels: left-sided colitis, extensive/pancolitis, other):DISEXT_EP = as.integer(DISEXT == "extensive/pancolitis").
-
-
Example models:
Moein_2022_etrolizumab.R(paired withDISEXT_OTHER; multiplicative effect on CL, +8.2% vs. left-sided colitis),Faelens_2021_infliximab.R(single-binary encoding; multiplicative fold-change on V of 1.25 when DISEXT_EP = 1). -
Notes: Optionally paired with
DISEXT_OTHERwhen the source paper decomposes a three-level disease-extension categorical (left-sided / extensive-pancolitis / other) into two indicators; the pairing is paper-specific and not required. Promoted from scope: specific to scope: general on 2026-04-27 because the binary “extensive colitis vs not” semantics generalize across UC popPK papers regardless of whether the original dataset additionally distinguished an “other” disease-extension category.
DISEXT_OTHER (canonical for ‘other disease extension’ indicator)
- Description: 1 = disease extension other than left-sided colitis or extensive/pancolitis, 0 = not.
- Units: (binary)
- Type: binary
- Scope: specific
-
Reference category: 0 (left-sided colitis, when
paired with
DISEXT_EP = 0). -
Source aliases: Derived from a multi-level
DISEXTcolumn:DISEXT_OTHER = as.integer(DISEXT == "other"). -
Example models:
Moein_2022_etrolizumab.R(multiplicative effect on CL, +18% vs. left-sided colitis; large uncertainty due to 2% prevalence). -
Notes: Paired with
DISEXT_EP; together they encode the three-level disease-extension categorical.
PRIOR_TAXANE (canonical for binary prior-taxane chemotherapy indicator)
- Description: 1 = subject received any prior taxane regimen (docetaxel, paclitaxel, cabazitaxel, etc.) before study entry, 0 = taxane-naive. Time-invariant within a subject (records treatment history at baseline). Oncology-pretreatment indicator: relevant for cohorts in which prior taxane exposure plausibly alters disease biology (e.g., advanced / castration-resistant prostate cancer, where taxane pretreatment is associated with more advanced disease and selects for taxane-resistant clones).
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (taxane-naive).
-
Source aliases:
-
PTAX– used invanHasselt_2015_eribulin.R(van Hasselt 2015 paper notation; binary 0 / 1 for prior docetaxel pretreatment).
-
-
Example models:
vanHasselt_2015_eribulin.R(multiplicative effect on baseline serum PSA:psa0 = exp(lpsa0 + etalpsa0) * e_prior_taxane_psa0^PRIOR_TAXANEwithe_prior_taxane_psa0 = 3.23– prior-taxane patients have ~3.2x higher baseline PSA than taxane-naive patients, consistent with more advanced disease at study entry). -
Notes: Pairs with
PRIOR_TAXANE_DAYSwhen a continuous duration-of-pretreatment is also relevant (e.g., van Hasselt 2015 uses both: PTAX on PSA0 and NTRT on KD). Distinct fromCONMED_*(which is concomitant medication during the study, not pretreatment history) and from genericPRIOR_*chemotherapy indicators (which would warrant a separate canonical when a paper differentiates by drug class rather than collapsing to taxanes). Scope: specific because the population semantics (CRPC) and the “any prior taxane” pooling are tied to van Hasselt 2015; future papers that distinguish per-drug pretreatment (docetaxel vs paclitaxel vs cabazitaxel separately) should register parallel canonicals rather than overloadingPRIOR_TAXANE.
PRIOR_TAXANE_DAYS (canonical for cumulative days of prior taxane treatment)
-
Description: Cumulative number of days of prior
taxane chemotherapy at study entry. 0 for taxane-naive patients (i.e.,
for any subject with
PRIOR_TAXANE = 0). Time-invariant within a subject (records the pretreatment history at baseline). - Units: days
- Type: continuous
- Scope: specific
-
Reference category: n/a – normalised by a
population reference value (van Hasselt 2015 uses the population median
of 720 days among prior-taxane-pretreated patients) and entered as a
power covariate
(1 + PRIOR_TAXANE_DAYS / 720) ^ e_prior_taxane_days_<param>. The+1inside the bracket makes the covariate effect collapse to a multiplier of 1 for taxane-naive patients (PRIOR_TAXANE_DAYS = 0) regardless of the estimated exponent. -
Source aliases:
-
NTRT– used invanHasselt_2015_eribulin.R(van Hasselt 2015 paper notation; cumulative number of days of prior taxane treatment).
-
-
Example models:
vanHasselt_2015_eribulin.R(van Hasselt 2015 Eq. 4 power covariate on drug PSA inhibition rate KD0:kd0 = exp(lkd0 + etalkd0) * (1 + PRIOR_TAXANE_DAYS / 720)^e_prior_taxane_days_kd0withe_prior_taxane_days_kd0 = -4.00– KD0 decreases with longer prior-taxane exposure, encoding cross-resistance between docetaxel and eribulin via the shared microtubule-inhibition mechanism). -
Notes: Pairs with
PRIOR_TAXANE(binary). The paper also considered an alternative continuous parameterisation in cycles of prior taxane (NCYCL, median 30 cycles) which was deemed slightly less informative (dOFV = -8 for NCYCL vs -10 for NTRT) and was not retained in the final model. If a future model needs the cycle-count form, register a parallel canonical (e.g.PRIOR_TAXANE_CYCLES) rather than overloading this one. Scope: specific because the 720-day normalisation reference is tied to the van Hasselt 2015 study population (post-docetaxel mCRPC patients).
Hypercholesterolemia biomarkers
PCSK9 (canonical for baseline unbound serum PCSK9 concentration)
- Description: Baseline unbound serum proprotein convertase subtilisin/kexin type 9 (PCSK9) concentration.
-
Units: ng/mL (document per-model in
covariateData[[PCSK9]]$unitsif a different unit – typically nM – is used in a given model; conversion uses a PCSK9 molecular weight of ~72 kDa, so 1 nM ~= 72 ng/mL). - Type: continuous
- Scope: general
-
Reference category: n/a – used with power scaling
(PCSK9 / ref)^exponent. Reference values observed: 425 ng/mL (= 5.9 nM) inKuchimanchi_2018_evolocumab.R(population median). - Source aliases: none known.
-
Example models:
Kuchimanchi_2018_evolocumab.R(power exponent 0.194 on Vmax:Vmax * (PCSK9/425)^0.194). - Notes: PCSK9 is the pharmacological target of anti-PCSK9 monoclonal antibodies (evolocumab, alirocumab, etc.); baseline PCSK9 drives the magnitude of target-mediated elimination and is a recurring covariate in anti-PCSK9 popPK models. Baseline (time-fixed) covariate; patients with missing baseline PCSK9 are typically excluded from analyses that include PCSK9 as a covariate.
Pharmacogenomic SNPs
Canonical pattern:
SNP_<GENE>_<RSID>. Use one binary
indicator per SNP genotype that the source paper tests as a model
covariate. The SNP_ prefix makes the category unambiguous;
the gene symbol disambiguates rsIDs grouped by gene; the rsID provides a
globally unique identifier. Encoding follows the most common
pharmacogenomic convention (also used by Papachristos 2020):
1 = mutant allele present (heterozygous or homozygous
mutant); 0 = homozygous wild-type. When a paper uses a
different encoding (e.g., per-allele dosage 0/1/2, dominant
model with mutant homozygotes only, or recessive model), document the
encoding explicitly in covariateData[[<COL>]]$notes
and consider registering a separate canonical name. SNP indicators
default to scope: specific because the parameter on which they act and
the encoded reference category are tied to the source paper’s analysis
plan; promote to general when a second paper ratifies identical
semantics.
SNP_ABCG2_RS4148157 (canonical for ABCG2 rs4148157 variant indicator)
- Description: Binary genotype indicator for the ABCG2 (BCRP) rs4148157 single-nucleotide polymorphism (G > A; intron 11, population MAF ~0.10). 1 = at least one variant (A) allele present (heterozygous AG or homozygous AA carrier); 0 = homozygous wild-type (GG).
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (homozygous wild-type GG).
-
Source aliases:
-
GENECAT– Roberts 2016 (paper Results ‘Population Pharmacokinetic Analysis’ section: the individual-Ka equation Ka_i = Ka_pop * exp(theta_1 * GENECAT), with GENECAT = 1 for AG / AA and 0 for GG).
-
-
Example models:
Roberts_2016_topotecan.R(exponential effect on oral topotecan absorption rate constant Ka:Ka = Ka_pop * exp(1.06 * SNP_ABCG2_RS4148157); AG/AA carriers have Ka approximately 2.89x higher than GG homozygotes, corresponding to an observed Cmax approximately 1.7x higher). - Notes: Time-fixed per subject (germline genotype). Variant carrier rate in the Roberts 2016 paediatric brain-tumour cohort: 19% (10 of 52 successfully genotyped patients; 9 AG heterozygotes plus 1 AA homozygote, with AA and AG pooled because only one AA homozygote was present). The rs4148157 variant is intronic and is in strong linkage disequilibrium with the better-characterised rs2231142 variant (Q141K, exon 6); Roberts 2016 Discussion proposes that rs4148157 may be acting as a surrogate marker for rs2231142 in this analysis. ABCG2 (also called BCRP, breast cancer resistance protein) is an intestinal and hepatic efflux transporter that influences oral bioavailability of topotecan and other substrates.
SNP_ICAM1_RS1799969 (canonical for ICAM-1 rs1799969 mutant indicator)
- Description: Binary genotype indicator for the ICAM1 rs1799969 single-nucleotide polymorphism (G > A; Gly241Arg / K469E in some references). 1 = at least one mutant (A) allele present (heterozygous or homozygous mutant); 0 = homozygous wild-type (GG).
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (homozygous wild-type).
-
Source aliases:
-
cat– Papachristos 2020 (the paper writes the indicator ascatin the CL covariate equation; no formal column name is given in the published narrative).
-
-
Example models:
Papachristos_2020_bevacizumab_pk.R,Papachristos_2020_bevacizumab_qss.R,Papachristos_2020_bevacizumab_pkpd.R(multiplicative effect on bevacizumab CL:CL * exp(-0.423 * SNP_ICAM1_RS1799969)in the PK and PK/PD models;CL * exp(-0.33 * SNP_ICAM1_RS1799969)in the binding QSS model – mutant carriers have lower CL and higher trough levels). -
Notes: Time-fixed per subject. Mutant carrier rate
in the Papachristos 2020 mCRC cohort: 20% (
NotesTable 1 of the paper). The biological mechanism by which the ICAM1 mutant slows bevacizumab clearance is unknown; the association is empirical and may be specific to mCRC.
SNP_VEGFA_RS1570360 (canonical for VEGF-A rs1570360 mutant indicator)
- Description: Binary genotype indicator for the VEGFA rs1570360 single-nucleotide polymorphism (-1154 G > A; promoter region). 1 = at least one mutant (A) allele present; 0 = homozygous wild-type (GG).
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (homozygous wild-type).
-
Source aliases:
-
cat1– Papachristos 2020 (used as the first categorical indicator in the inter-compartmental clearance equation of the PK model; no formal column name in the narrative).
-
-
Example models:
Papachristos_2020_bevacizumab_pk.R(multiplicative effect on bevacizumab Q:Q * exp(0.378 * SNP_VEGFA_RS1570360)– mutant carriers have higher inter-compartmental clearance). - Notes: Time-fixed per subject. Mutant carrier rate in the Papachristos 2020 mCRC cohort: 33%. The covariate is significant in the standalone PK model but does not appear in the binding QSS or PK/PD models because in those models the inter-compartmental clearance covariate effects are absorbed into the rs699947 effect on Q (PK/PD) or into the K_ss / BM0 effects (QSS).
SNP_VEGFA_RS699947 (canonical for VEGF-A rs699947 mutant indicator)
- Description: Binary genotype indicator for the VEGFA rs699947 single-nucleotide polymorphism (-2578 C > A; promoter region). 1 = at least one mutant (A) allele present; 0 = homozygous wild-type (CC).
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (homozygous wild-type).
-
Source aliases:
-
cat2– Papachristos 2020 PK model (second categorical indicator on Q). -
cat– Papachristos 2020 binding QSS model (effect on K_ss and BM0) and PK/PD model (effect on Q).
-
-
Example models:
Papachristos_2020_bevacizumab_pk.R(effect on Q: -0.429),Papachristos_2020_bevacizumab_qss.R(effect on K_ss: +1.22, on BM0: -0.851),Papachristos_2020_bevacizumab_pkpd.R(effect on Q: -0.414). - Notes: Time-fixed per subject. Mutant carrier rate in the Papachristos 2020 mCRC cohort: 52%. The mutant allele is associated with lower baseline free VEGF-A levels and a higher in-vivo affinity (higher K_ss), consistent with reports that rs699947 mutants have prolonged overall survival on bevacizumab-based therapy.
SNP_SLCO1B1_RS11045819 (canonical for SLCO1B1 rs11045819 mutant indicator)
- Description: Binary genotype indicator for the SLCO1B1 rs11045819 single-nucleotide polymorphism (C > A; OATP1B1 transporter, exon 5, P155T). 1 = at least one mutant (A) allele present (heterozygous AC or homozygous AA); 0 = homozygous wild-type (CC). The Hennig 2015 cohort (n = 35 successfully genotyped of 44) reported 5 AC heterozygotes, 30 CC homozygotes, and 0 AA homozygotes, so the indicator is effectively heterozygous-vs-CC in that cohort.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (homozygous wild-type CC).
-
Source aliases:
-
SLCO1B1 rs11045819 genotype– Hennig 2015 (paper text; the source NONMEM control stream is in the unrecovered AAC supplement, so the formal column name is not on disk).
-
-
Example models:
Hennig_2015_rifabutin.R(multiplicative effect on rifabutin bioavailability F:F * (1 + 0.304 * SNP_SLCO1B1_RS11045819)– AC carriers have ~30% higher rifabutin F than CC reference; dOFV = -6.5). - Notes: Time-fixed per subject. Carrier rate in the Hennig 2015 South-African HIV/TB cohort: 14% (5 of 35 genotyped). SLCO1B1 encodes OATP1B1, a hepatic uptake transporter; rs11045819 has been associated with reduced rifampicin and lopinavir concentrations in prior studies but in Hennig 2015 was associated with INCREASED rifabutin bioavailability (note opposite direction of effect across rifamycins).
SLCO1B1_HAP15_HET (**canonical for SLCO1B1*15 haplotype heterozygote indicator**)
-
Description: Binary haplotype indicator for the
SLCO1B1
*15reduced-function haplotype (the cis combination of the 388A>G / rs2306283 and 521T>C / rs4149056 variants; encodes the OATP1B1 N130D + V174A double mutant). 1 = subject carries exactly one 15 allele (heterozygous:*1a/*15or*1b/*15), 0 = otherwise (the union of 15-noncarriers and *15-homozygotes; the paired indicatorSLCO1B1_HAP15_HOMflags the homozygous group). Time-fixed per subject (germline haplotype). - Units: (binary)
- Type: binary
- Scope: general
-
Reference category: 0 (homozygous 1a/1a,
1a/1b, or 1b/1b – i.e., no 15 allele). The
reference group is the union of the three non-15 diplotypes;
SLCO1B1_HAP15_HOMflags the 15/15 group. -
Source aliases:
-
HT– Ide 2009 (paper text Eq. forFrel = 1 * theta1^HT * theta2^HMwhereHT = 1for heterozygotes*1a/*15and*1b/*15). -
OATP1B1 phenotype– Jeong 2022 (paper Section 3.2 + Table 1 phenotype-to-haplotype mapping: ET = 1a/1a + 1a/1b + 1b/1b, IT = 1a/15 + 1b/15, PT = 15/15; IT subjects map to SLCO1B1_HAP15_HET = 1).
-
-
Example models:
Ide_2009_pravastatin.R(multiplicative effect on relative bioavailability Frel:Frel = 1.50^SLCO1B1_HAP15_HET * 1.95^SLCO1B1_HAP15_HOM– 15 heterozygotes have 50% higher Frel than 15-noncarriers; dOFV = 32.2 in backward elimination, p < 0.001),Jeong_2022_torsemide.R(linear-deviation effect on apparent central volume V/F:V/F = tvV/F * (1 + (-0.410) * SLCO1B1_HAP15_HET + (-0.646) * SLCO1B1_HAP15_HOM)– 15 heterozygotes have 41% lower V/F than 15-noncarriers; Jeong 2022 Table 4). -
Notes: Paired with
SLCO1B1_HAP15_HOMto encode a three-level haplotype categorical (noncarrier / heterozygote / homozygote) with*15-noncarrier as the implicit reference (both indicators = 0). Distinct from the SNP-level canonicalSNP_SLCO1B1_RS11045819(which encodes only the C>A variant at a different position; rs11045819 = P155T) and from the 15 component SNPs rs2306283 (388A>G) and rs4149056 (521T>C) individually: future Ide-style extractions that pool 5 (521T>C only) with *15 should still record their values under this canonical and document the pooling rule incovariateData[[SLCO1B1_HAP15_HET]]$notes. Distribution in the Ide 2009 cohort of 57 healthy Japanese male volunteers (Table I): 28 noncarriers, 23 heterozygotes, 6 homozygotes; in the Jeong 2022 cohort of 112 healthy Korean male volunteers (Table 1): 86 ET noncarriers (76.8%), 23 IT heterozygotes (20.5%), 3 PT homozygotes (2.7%). Ratified canonically on 2026-05-12 alongside the Ide 2009 extraction; scope promoted from specific to general on 2026-05-17 alongside the Jeong 2022 torsemide extraction (second model using the same haplotype encoding across an unrelated drug class, demonstrating the canonical generalizes beyond statins).
SLCO1B1_HAP15_HOM (**canonical for SLCO1B1*15 haplotype homozygote indicator**)
-
Description: Binary haplotype indicator for the
SLCO1B1
*15reduced-function haplotype. 1 = subject carries two 15 alleles (homozygous:*15/*15), 0 = otherwise (the union of 15-noncarriers and *15-heterozygotes; the paired indicatorSLCO1B1_HAP15_HETflags the heterozygous group). Time-fixed per subject (germline haplotype). - Units: (binary)
- Type: binary
- Scope: general
-
Reference category: 0 (homozygous 1a/1a,
1a/1b, or 1b/1b – i.e., no 15 allele). The
reference group is the union of the three non-15 diplotypes;
SLCO1B1_HAP15_HETflags the *15-heterozygote group. -
Source aliases:
-
HM– Ide 2009 (paper text Eq. forFrel = 1 * theta1^HT * theta2^HMwhereHM = 1for homozygotes*15/*15). -
OATP1B1 phenotype– Jeong 2022 (paper Section 3.2 + Table 1 phenotype-to-haplotype mapping: PT = 15/15 corresponds to SLCO1B1_HAP15_HOM = 1).
-
-
Example models:
Ide_2009_pravastatin.R(multiplicative effect on relative bioavailability Frel:Frel = 1.50^SLCO1B1_HAP15_HET * 1.95^SLCO1B1_HAP15_HOM– 15 homozygotes have 95% higher Frel than 15-noncarriers; dOFV = 33.7 in backward elimination, p < 0.001),Jeong_2022_torsemide.R(linear-deviation effect on apparent central volume V/F:V/F = tvV/F * (1 + (-0.410) * SLCO1B1_HAP15_HET + (-0.646) * SLCO1B1_HAP15_HOM)– 15 homozygotes have 64.6% lower V/F than 15-noncarriers; Jeong 2022 Table 4). -
Notes: Paired with
SLCO1B1_HAP15_HETto encode a three-level haplotype categorical (noncarrier / heterozygote / homozygote) with*15-noncarrier as the implicit reference (both indicators = 0). SeeSLCO1B1_HAP15_HETNotes for the broader context; population distribution in Ide 2009 was 6 of 57 (10.5%) homozygotes and in Jeong 2022 was 3 of 112 (2.7%) homozygotes. Ratified canonically on 2026-05-12 alongside the Ide 2009 extraction; scope promoted from specific to general on 2026-05-17 alongside the Jeong 2022 torsemide extraction (second model using the same haplotype encoding across an unrelated drug class).
CYP2C9_S1_COUNT (**canonical for CYP2C9*1 (wild-type) allele count**)
-
Description: Continuous individual-level
CYP2C91 allele count: 0 = no 1 allele, 1 = one 1 allele
(heterozygous), 2 = two 1 alleles (homozygous wild-type).
Time-invariant (germline genotype). Paired with
CYP2C9_S2_COUNTandCYP2C9_S3_COUNTto encode the three loss-of-function-allele dosage form used by Hamberg-family warfarin models, where the subject’s CL is the sum of per-allele CL contributions across the two CYP2C9 alleles. The three count columns sum to 2 for each subject. - Units: (count, 0/1/2 alleles per subject)
- Type: continuous
- Scope: general
- Reference category: n/a (continuous). Used directly as a multiplier on a fixed per-allele CL contribution (0.174 L/h per *1 allele in Hamberg / Xia 2024).
-
Source aliases:
-
CYP2C9(genotype string such as"*1/*1","*1/*3"): deriveCYP2C9_S1_COUNT = (length of *1 matches in the genotype string). The modelXia_2024_warfarin.Rcarries the sourceCYP2C9genotype string mapped to the three count columns.
-
-
Example models:
Xia_2024_warfarin.R(per-allele CL contributions:cl = CYP2C9_S1_COUNT * 0.174 + CYP2C9_S2_COUNT * 0.0879 + CYP2C9_S3_COUNT * 0.0422, times an age effect). -
Notes: Hamberg’s warfarin K-PD model parameterises
CL as a sum of two per-allele CL contributions (one per CYP2C9 allele on
each chromosome), which is more flexible than a single genotype
indicator because it naturally accommodates any combination of 1,
2, 3 alleles (six diplotypes: 1/1, 1/2,
1/3, 2/2, 2/3, 3/3). When a paper
reports additional CYP2C9 alleles (e.g. 5, 6, 8, *11),
register parallel canonicals (
CYP2C9_S5_COUNTetc.) rather than overloading the existing three counts. Distinct from the categorical phenotype canonicalsCYP3A5_EXPR(binary expresser) and from continuous-activity scores likeCYP3A4– the count form preserves loss-of-function-allele dosage exactly. Ratified canonically on 2026-05-16 alongside the Xia 2024 warfarin extraction.
CYP2C9_S2_COUNT (**canonical for CYP2C9*2 reduced-function allele count**)
-
Description: Continuous individual-level
CYP2C92 allele count: 0 = no 2 allele, 1 = one 2 allele
(heterozygous), 2 = two 2 alleles (homozygous). Time-invariant
(germline genotype). Paired with
CYP2C9_S1_COUNTandCYP2C9_S3_COUNT; the three counts sum to 2 for each subject. The *2 allele (rs1799853, R144C) encodes a reduced-function CYP2C9 isoform. - Units: (count, 0/1/2 alleles per subject)
- Type: continuous
- Scope: general
- Reference category: n/a (continuous). Used directly as a multiplier on a fixed per-allele CL contribution (0.0879 L/h per *2 allele in Hamberg / Xia 2024).
-
Source aliases:
-
CYP2C9(genotype string such as"*1/*2","*2/*2","*2/*3"): deriveCYP2C9_S2_COUNT = (length of *2 matches in the genotype string).
-
-
Example models:
Xia_2024_warfarin.R(per-allele CL contributions; the Xia 2024 Han cohort had no *2 carriers, but the model retains the term for general use across CYP2C9 papers). -
Notes: See
CYP2C9_S1_COUNTfor the broader rationale. Ratified canonically on 2026-05-16 alongside the Xia 2024 warfarin extraction.
CYP2C9_S3_COUNT (**canonical for CYP2C9*3 reduced-function allele count**)
-
Description: Continuous individual-level
CYP2C93 allele count: 0 = no 3 allele, 1 = one 3 allele
(heterozygous), 2 = two 3 alleles (homozygous). Time-invariant
(germline genotype). Paired with
CYP2C9_S1_COUNTandCYP2C9_S2_COUNT; the three counts sum to 2 for each subject. The *3 allele (rs1057910, I359L) encodes a strongly reduced-function CYP2C9 isoform and is the dominant CYP2C9 pharmacogenomic risk variant in East-Asian populations (warfarin / phenytoin sensitivity). - Units: (count, 0/1/2 alleles per subject)
- Type: continuous
- Scope: general
- Reference category: n/a (continuous). Used directly as a multiplier on a fixed per-allele CL contribution (0.0422 L/h per *3 allele in Hamberg / Xia 2024).
-
Source aliases:
-
CYP2C9(genotype string such as"*1/*3","*3/*3"): deriveCYP2C9_S3_COUNT = (length of *3 matches in the genotype string).
-
-
Example models:
Xia_2024_warfarin.R(per-allele CL contributions; 5.7% of the Han cohort were 1/3 heterozygous per Xia 2024 Table 1). -
Notes: See
CYP2C9_S1_COUNTfor the broader rationale. Ratified canonically on 2026-05-16 alongside the Xia 2024 warfarin extraction.
VKORC1_1639G_COUNT (canonical for VKORC1 -1639G allele count)
-
Description: Continuous individual-level count of
VKORC1 -1639G alleles (rs9923231, also reported as VKORC1 -1639 G > A
or 1173 C > T depending on numbering convention). 0 = AA homozygous
(warfarin-sensitive), 1 = GA heterozygous, 2 = GG homozygous
(warfarin-resistant). Time-invariant (germline genotype). The
complementary -1639A count is
2 - VKORC1_1639G_COUNT. - Units: (count, 0/1/2 alleles per subject)
- Type: continuous
- Scope: general
-
Reference category: n/a (continuous). In the
Hamberg / Xia 2024 EC50 model the typical EC50 is a per-allele sum:
ec50_typ = VKORC1_1639G_COUNT * ec50_per_G + (2 - VKORC1_1639G_COUNT) * ec50_per_A. Distribution in the Xia 2024 Han cohort (Table 1): AA 80.3%, GA 18.7%, GG 0.9% (G allele frequency ~10%, consistent with East-Asian populations). -
Source aliases:
-
VKORC1(genotype string such as"AA","GA","GG"or"1639AA"etc.): deriveVKORC1_1639G_COUNT = (count of G in the two-letter genotype).
-
-
Example models:
Xia_2024_warfarin.R(per-allele EC50 contributions, re-estimated for the Han Chinese cohort: 4.3 mg/L per G allele, 1.14 mg/L per A allele). - Notes: VKORC1 -1639G > A is the strongest single-SNP determinant of warfarin sensitivity (the A allele reduces VKORC1 expression via a promoter-region effect, requiring less warfarin to achieve target anticoagulation). The per-allele count form is preferred over a binary carrier indicator because the heterozygous and homozygous mutant subjects respond detectably differently to warfarin. Ratified canonically on 2026-05-16 alongside the Xia 2024 warfarin extraction. ### CYP2C19_S2_CARRIER (**canonical for CYP2C19*2 loss-of-function allele carrier indicator**)
-
Description: Binary indicator for carriage of the
CYP2C192 loss-of-function allele (rs4244285, c.681G>A;
creates a cryptic splice site that abolishes CYP2C19 enzyme activity). 1
= subject carries at least one 2 allele (heterozygous 1/2
or homozygous 2/2); 0 = no 2 allele. Time-fixed per
subject (germline genotype). Het and hom carriers are pooled in studies
where the 2/2 (poor metabolizer) frequency is too low for a
separate phenotype layer; future extractions that distinguish
heterozygous-intermediate-metabolizer from homozygous-poor-metabolizer
effects should register paired
CYP2C19_S2_HET/CYP2C19_S2_HOMindicators (or, where the source uses the per-allele count form, aCYP2C19_S2_COUNTcolumn following theCYP2C9_S2_COUNTprecedent). - Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (no 2 allele – 1/1, 1/17, or 17/*17).
-
Source aliases:
-
CYP2C19*2– Danielak 2017 (paper Methods ‘Determination of genetic polymorphisms’ and Table 2 final-modelEffect of CYP2C19*2 on FM (COV)row; PCR-RFLP genotyping for rs4244285). The Danielak 2017 cohort had no 2/2 homozygous-poor-metabolizers, so heterozygous 1/2 carriers were pooled into the binary CYP2C19_S2_CARRIER = 1 group with no information loss.
-
-
Example models:
Danielak_2017_clopidogrel.R(linear-deviation effect on the fraction of clopidogrel metabolised to the active thiol H4:fm = TVFM * (1 + e_cyp2c19_s2_fm * CYP2C19_S2_CARRIER)withe_cyp2c19_s2_fm = -0.45; carriers convert 45% less of the absorbed clopidogrel to the active H4 metabolite, giving a 36.7% lower predicted AUC of H4 vs non-carriers; Danielak 2017 Table 2 final-model and Results page 1628). -
Notes: CYP2C192 is the dominant
loss-of-function* CYP2C19 allele in clopidogrel pharmacogenetics;
it reduces clopidogrel’s metabolic activation to the antiplatelet-active
H4 thiol and is associated with elevated rates of stent thrombosis and
major adverse cardiovascular events on clopidogrel therapy (FDA boxed
warning, 2010). The paired *17 ultra-rapid-metabolizer allele
(rs12248560) is typically registered separately when present (a
CYP2C19_S17_CARRIERindicator following the same pattern). The continuous-individual-activity-score consolidation TODO logged onCYP2D6line 3321 also applies prospectively to CYP2C19, but the binary carrier indicator remains the standard discrete encoding used by most published clopidogrel popPK / PD models. Ratified canonically on 2026-05-20 alongside the Danielak 2017 clopidogrel extraction.
ABCB1_HAP_TTT (canonical for ABCB1 TTT haplotype carrier indicator)
-
Description: Binary haplotype indicator for the
ABCB1
TTThaplotype across the rs1128503 (1236C>T, exon 12, synonymous Gly412Gly) / rs2032582 (2677G>T/A, exon 21, Ala893Ser/Thr) / rs1045642 (3435C>T, exon 26, synonymous Ile1145Ile) SNP block. 1 = subject carries at least oneTTThaplotype (heterozygous or homozygous; pooled because the homozygote frequency was < 0.1 in the de Wit 2016 cohort); 0 = noTTThaplotype. Time-fixed per subject (germline haplotype). - Units: (binary)
- Type: binary
- Scope: specific
-
Reference category: 0 (no
TTThaplotype). -
Source aliases:
-
ABCB1 TTT haplotype– de Wit 2016 (paper text Methods ‘Pharmacogenetic analysis’ and Table 2 final-modeltheta TTT on Frow; haplotypes phased in gPLINK with certainty > 0.97).
-
-
Example models:
deWit_2016_everolimus.R(multiplicative effect on apparent bioavailability F:F = 1 * 0.792^ABCB1_HAP_TTT– carriers have 20.8% lower F than non-carriers; dOFV = 9.6 in backward elimination, P < 0.01). -
Notes: The
TTThaplotype of ABCB1 (P-glycoprotein, MDR1 efflux transporter) is associated with enhanced P-gp efflux activity and reduced everolimus bioavailability (de Wit 2016 Discussion paragraph 5); de Wit cites prior evidence that the sameTTThaplotype also reduces exposure / efficacy of other P-gp substrates [refs 20-22 in the paper], although directionally inconsistent results have been reported [refs 7, 23, 24]. Het and hom carriers were pooled in de Wit 2016 because the homozygote frequency was < 0.1 (Methods ‘Pharmacogenetic analysis’); future extractions that estimate separate het / hom effects should register pairedABCB1_HAP_TTT_HETandABCB1_HAP_TTT_HOMindicators following theSLCO1B1_HAP15_HET/SLCO1B1_HAP15_HOMprecedent above. Distinct from any individual ABCB1 SNP indicator (rs1128503, rs2032582, rs1045642 alone) because the haplotype is the cis combination tested jointly. Ratified canonically on 2026-05-16 alongside the de Wit 2016 everolimus extraction.
ALDH2_S2_CARRIER (**canonical for ALDH2*2 inactive-variant carrier indicator**)
-
Description: Binary genotype indicator for the
ALDH2
*2(inactive) variant allele (rs671 G>A; Glu487Lys, aldehyde dehydrogenase 2 mitochondrial isoform). 1 = subject carries at least one ALDH22 allele (heterozygous 1/2 or homozygous 2/2); 0 = ALDH21/*1 wild-type. Time-fixed per subject (germline genotype). - Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (ALDH21/1 wild-type, fully active aldehyde dehydrogenase 2).
-
Source aliases:
-
ALDH2(the source NONMEM indicator used inNemoto_2017_ethanol.R; 1 for 1/2 carriers, 0 for 1/1 wild-type; the Nemoto 2017 cohort had no 2/2 homozygotes so the indicator is effectively heterozygote-vs-wild-type in that cohort).
-
-
Example models:
Nemoto_2017_ethanol.R(additive shift on Vd/F: -20.4 L when ALDH2_S2_CARRIER = 1 vs ALDH21/1 reference; Nemoto 2017 Table II final model). -
Notes: ALDH22 is the canonical East-Asian
aldehyde-dehydrogenase deficiency allele (rs671 GAA -> AAA,
Glu487Lys); homozygous 2/2 subjects are essentially ALDH2-null
and experience severe acetaldehyde-flush after even small ethanol doses,
so are typically excluded from alcohol-PK studies. The Nemoto 2017
Japanese cohort contained 21/34 (62%) 1/1 and 13/34 (38%)
1/2 subjects; no 2/2 homozygotes were enrolled. The
single binary carrier indicator pools heterozygous and homozygous
variant carriers; future ethanol-PK extractions that distinguish
1/2 from 2/*2 effects should register paired
ALDH2_S2_HETandALDH2_S2_HOMindicators following theSLCO1B1_HAP15_HET/SLCO1B1_HAP15_HOMprecedent. Ratified canonically on 2026-05-18 alongside the Nemoto 2017 ethanol extraction.
ADH1B_S2_HOM (**canonical for ADH1B*2 homozygote indicator**)
-
Description: Binary genotype indicator for the
ADH1B
*2(high-activity) variant allele (rs1229984 G>A; Arg47His, alcohol dehydrogenase 1B class I beta-subunit). 1 = subject is homozygous 2/2; 0 = otherwise (the default reference covers ADH1B2/1 heterozygotes; ADH1B1/1 wild-type subjects fall outside the published parameterization of Nemoto 2017 and are conventionally assigned the same reference value as heterozygotes). Time-fixed per subject (germline genotype). - Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (ADH1B2/1 heterozygous; ADH1B1/1 wild-type by extension when the model is applied beyond the published parameter range).
-
Source aliases:
-
ADH1B(the source NONMEM indicator used inNemoto_2017_ethanol.R; 1 for 2/2 homozygotes, 0 for 2/1 heterozygotes; Nemoto 2017 Table II structural model encodes the Vmax conditional via two separate THETAs).
-
-
Example models:
Nemoto_2017_ethanol.R(additive shift on Vmax: +176 mg/h for 2/2 homozygotes relative to the 2/1 reference Vmax of 7790 mg/h; Nemoto 2017 Table II final model). -
Notes: ADH1B2 is the East-Asian high-activity
alcohol-dehydrogenase variant (rs1229984 Arg47His); the 2 allele
encodes a ~40x faster ethanol oxidation rate than the 1 wild-type.
Allele frequency in Japanese populations is ~70%, so most subjects are
2/2 or 1/2 with a small 1/1 minority (~9%
expected). Nemoto 2017 inherits the two-Vmax parameterization from Seng
et al. 2014 (which estimated Vmax separately for 2/1 and
2/2 in a Chinese + Indian cohort) and does not report an ADH1B
genotype distribution for its 34-subject Japanese cohort; the structural
model is silent on ADH1B1/*1 subjects. Future ADH1B-aware
ethanol-PK extractions that explicitly model all three genotypes should
register
ADH1B_S2_HETas a companion indicator following theSLCO1B1_HAP15_HET/SLCO1B1_HAP15_HOMprecedent. Ratified canonically on 2026-05-18 alongside the Nemoto 2017 ethanol extraction.
Lifestyle / medical history
SMOKE (canonical for current-smoker binary indicator)
- Description: 1 = current smoker at baseline, 0 = non-smoker.
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (non-smoker).
-
Source aliases:
-
Smoking(case-insensitive) – used inMa_2020_sarilumab_anc.R.
-
-
Example models:
Ma_2020_sarilumab_anc.R(power-form on baseline ANC:BASE * 1.15^SMOKE). -
Notes: Baseline-only indicator; does not track
within-study smoking-cessation changes. Use this two-level (current vs
non-smoker) encoding when the source paper does not split former and
never smokers. When the source uses a 3-level smoking-status categorical
(never / former / current), use the paired
SMOKE_CURRENT+SMOKE_NEVERindicators below instead – the 3-level encoding cannot be reduced to a singleSMOKEcolumn without losing information.
SMOKE_CURRENT (canonical for current-smoker indicator (paired with SMOKE_NEVER))
-
Description: 1 = current smoker at baseline, 0
otherwise (former or never smoker). Paired with
SMOKE_NEVERto encode a 3-level smoking-status categorical with former smoker as the implicit reference (both indicators = 0). - Units: (binary)
- Type: binary
- Scope: general
-
Reference category: 0 (former smoker, when paired
with
SMOKE_NEVER= 0). The pairing follows theRACE_<GROUP>convention for paired indicators. -
Source aliases:
-
Smoking status = Current/SMOK = 2(case-insensitive) – derived from a 3-level smoking-status column.
-
-
Example models:
Hwang_2023_monalizumab.R(proportional-shift effect on V1:(1 + 0.0484)^SMOKE_CURRENT; reference category former smoker). -
Notes: Baseline-only indicator. See also
SMOKE_NEVER(paired indicator) andSMOKE(binary current-vs-non-smoker encoding when the source paper does not split former vs never).
SMOKE_NEVER (canonical for never-smoker indicator (paired with SMOKE_CURRENT))
-
Description: 1 = never smoker at baseline, 0
otherwise (former or current smoker). Paired with
SMOKE_CURRENTto encode a 3-level smoking-status categorical with former smoker as the implicit reference (both indicators = 0). - Units: (binary)
- Type: binary
- Scope: general
-
Reference category: 0 (former smoker, when paired
with
SMOKE_CURRENT= 0). The pairing follows theRACE_<GROUP>convention for paired indicators. -
Source aliases:
-
Smoking status = Never/SMOK = 0(case-insensitive) – derived from a 3-level smoking-status column.
-
-
Example models:
Hwang_2023_monalizumab.R(proportional-shift effect on V1:(1 - 0.141)^SMOKE_NEVER; reference category former smoker). -
Notes: Baseline-only indicator. See also
SMOKE_CURRENT(paired indicator) andSMOKE(binary current-vs-non-smoker encoding when the source paper does not split former vs never).
Formulation / assay / study
ROUTE_IV (canonical for IV-vs-SC administration route indicator)
- Description: 1 = subject received intravenous (IV) administration, 0 = subcutaneous (SC) administration. Per-subject (study-fixed) covariate flagging the dosing route when a population analysis pools cohorts that differ by route, with covariate effects on PK parameters that capture route-specific disposition behaviour.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (SC).
-
Source aliases:
- “Admin route = IV” (categorical effect column in Yu 2022 covariate equations).
-
IV– used inZierhut_2008_osteoprotegerin.R(DDMODEL00000233dataObjcolumn flagging IV vs SC cohort, switching the PK observation residual SD betweenCcpropSdIVandCcpropSdSC). - “IV” / “SC arm” – used in
Wang_2021_pertuzumab.R(per-subject FeDeriCa arm indicator P+H IV vs PH FDC SC, switching the proportional residual SD betweenCcpropSdIvandCcpropSdSc).
-
Example models:
-
Yu_2022_ofatumumab.R(exponential effect on R0, CL, Q, ksyninf). -
Zierhut_2008_osteoprotegerin.R(per-subject indicator switching the PK observation residual SD between the IV cohort (CcpropSdIV) and the SC cohort (CcpropSdSC)). -
Wang_2021_pertuzumab.R(per-subject indicator switching the proportional residual SD between the IV (CcpropSdIv= 0.175) and SC (CcpropSdSc= 0.155) cohorts of the FeDeriCa popPK). -
Fiedler-Kelly_2019_fremanezumab.R(per-subject indicator switching both the central volume of distribution (Vc,IV = 2.98 L FIXED vs Vc,SC = 1.88 L) and the residual-error structure (IV: proportional-only with SD = sqrt(0.0467) = 0.21610; SC: combined additive sqrt(0.204) + proportional sqrt(0.0531)) in the pooled phase 1/2b/3 fremanezumab popPK).
-
-
Notes: This is the per-subject covariate-equation
indicator, distinct from the dosing-event
cmtcolumn that names the target compartment. When simulating, setROUTE_IV = 1for IV cohorts and dose into the central compartment; setROUTE_IV = 0for SC cohorts and dose into the depot. Scope: specific because the set of parameters that differ by route is paper-specific (Yu 2022 carries route-specific exponential effects on disposition parameters; Zierhut 2008, Wang 2021, and Fiedler-Kelly 2019 carry route-specific PK observation residual SDs; Fiedler-Kelly 2019 additionally carries a route-specific central volume of distribution).
DEVICE_AI (canonical for autoinjector-vs-prefilled-syringe SC device indicator)
- Description: 1 = subject’s SC dose delivered via autoinjector (AI), 0 = prefilled syringe (PFS). Per-subject (study-fixed) covariate flagging the SC delivery device when a model carries device-specific PK effects.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (PFS).
- Source aliases: “Formulation = AI” (categorical effect column in Yu 2022 covariate equations).
-
Example models:
Yu_2022_ofatumumab.R(exponential effect on k_e(P) and R0),Diep_2026_donidalorsen.R(Phoenix linear-effect(1 + e_device_ai_ka * DEVICE_AI)on the typical SC absorption rate constant with theta = +0.262 -> multiplier 1.262 for autoinjector vs vial-and-syringe reference; characterized in the ISIS 721744-CS9 single-dose bioequivalence cohort). -
Notes: Set to 0 (PFS / vial reference) for IV
subjects, since the device is undefined for IV; the IV-specific effects
are captured by
ROUTE_IVinstead. Scope: specific because the AI / vial / PFS contrast and which parameters it affects depend on the study’s device-comparison design.
INJSITE_ARM (canonical for SC injection-site = arm indicator)
- Description: 1 = subject’s SC dose injected into the arm, 0 = abdomen (the universal SC reference site across the popPK literature). Per-dose-record covariate flagging the SC injection site when a population analysis estimates site-specific absorption parameters.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (abdomen).
- Source aliases: paper narrative “arm” / “abdomen” subgroup labels driving site-specific ka in Diep 2022.
-
Example models:
Diep_2022_eplontersen.R(additive log-shifte_injsite_arm_ka = log(ka_arm / ka_ab)on the typical absorption rate constant: ka_arm = 0.217 1/h vs ka_ab = 0.282 1/h, ~30% higher ka for abdomen; INJSITE_ARM = 1 selects the arm typical value),Diep_2026_donidalorsen.R(Phoenix linear-effect(1 + e_injsite_arm_ka * INJSITE_ARM)on the typical absorption rate constant with theta = -0.338 -> multiplier 0.662 for arm; the paper’s reference category is “abdomen or thigh” rather than “abdomen” alone, but is consistent with the canonical reference because abdomen is the universal SC reference site and the thigh effect is pooled into the reference category by the Diep 2026 model). -
Notes: Specific scope because the arm-vs-abdomen
contrast is paper-specific. Sister canonical to
INJSITE_THIGH(thigh-vs-abdomen indicator anticipated for future SC-route models with thigh-specific absorption). Per-administration rather than per-subject – a subject in a multi-dose simulation can switch SC injection sites between doses; supply the indicator on each dose record. Distinct fromROUTE_IV(IV vs SC route, not within-SC site) and fromDEVICE_AI(autoinjector vs prefilled syringe, device rather than anatomical site).
STUDY_APLIOS (canonical for APLIOS bioequivalence study indicator)
- Description: 1 = subject enrolled in the APLIOS bioequivalence study (NCT03560739; phase 2; ofatumumab AI vs PFS in RMS), 0 = other study in the Yu 2022 pooled analysis.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (non-APLIOS studies: OMS115102, MIRROR, ASCLEPIOS I, ASCLEPIOS II).
- Source aliases: “Study = APLIOS” (categorical effect column in Yu 2022 covariate equations).
-
Example models:
Yu_2022_ofatumumab.R(exponential effect on Emax of B cell lysis). - Notes: Captures a between-study shift in the maximum B-cell lysis stimulatory effect not explained by the other covariates in the final model.
STUDY_MIRROR (canonical for MIRROR dose-finding study indicator)
- Description: 1 = subject enrolled in the MIRROR dose-finding study (NCT01457924; phase 2; SC ofatumumab dose-ranging in RRMS), 0 = other study in the Yu 2022 pooled analysis.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (non-MIRROR studies: OMS115102, APLIOS, ASCLEPIOS I, ASCLEPIOS II).
- Source aliases: “Study = MIRROR” (categorical effect column in Yu 2022 covariate equations).
-
Example models:
Yu_2022_ofatumumab.R(exponential effect on B cell elimination rate kout). - Notes: Captures a between-study shift in the B cell elimination rate not explained by the other covariates in the final model.
REGI_BID (canonical for twice-daily dosing-regimen indicator)
- Description: 1 = subject’s dosing regimen is BID (twice daily), 0 = QD (once daily) or other non-BID regimen. Per-subject (regimen-fixed) categorical indicator for population analyses that pool QD and BID arms and test regimen as a covariate.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (non-BID regimen – typically QD).
-
Source aliases:
-
BID– used inGirard_2012_pimasertib.R.
-
-
Example models:
Girard_2012_pimasertib.R(additive shift on the cumulative-logit AE-score model:theta_bid * REGI_BID; -0.399 logit units for BID vs QD). -
Notes: Specific scope because the QD-vs-BID
contrast is study-specific; future regimen-comparison models that
contrast different schedules should either extend this entry’s example
list (when QD is the reference) or register a sibling indicator
(
REGI_TID,REGI_QW) following the same pattern. Distinct fromDOSE(dose level in mg) and from total-daily-dose aggregates: a 60 mg/day cohort can include either a 60 mg QD subgroup or a 30 mg BID subgroup, and both share the sameDOSE = 60while differing inREGI_BID.
MEAL_FLAG (canonical for intra-day meal-window indicator (time-varying))
-
Description: 1 = the current observation time falls
within a meal window (typically lunch or dinner, ~1 hour duration each);
0 = no meal-driven physiological perturbation active. Distinct from
FED(per-dose-record fed-vs-fasted indicator): MEAL_FLAG is a time-varying intra-day flag that switches on at meal onset, stays on through the meal duration, and switches off afterward. Used by enterohepatic-recirculation / gallbladder-contraction models that need to scale post-prandial transport rate constants for the duration of a meal. - Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (no meal effect active at this time).
- Source aliases: paper narrative “meal effect on K_BI” / “post-prandial gallbladder contraction” in Zuo 2016.
-
Example models:
Zuo_2016_UDCA.R(multiplies the biliary-to-intestine rate constants K_BI,0/1/2 by E_meal = 35.33 during meal windows to simulate gallbladder contraction; two meals modelled per day, lunch at +4 h and dinner at +10 h after the morning dose, each 1 hour long). -
Notes: Specific scope because the operational
definition (meal duration, schedule, magnitude of the rate-constant
scaling) is paper-defined. Time-varying: must be supplied at every
observation row in the event dataset. Pair with
SNACK_FLAGfor studies that distinguish meal and snack effects. Distinct fromFEDandFED_HIGHFAT(per-dose-record meal-state indicators tied to a single dosing event); MEAL_FLAG is decoupled from any specific dose record and instead drives ongoing physiology over a multi-hour window.
SNACK_FLAG (canonical for intra-day snack-window indicator (time-varying))
-
Description: 1 = the current observation time falls
within a snack window (typically 0.5 hour duration, a smaller
perturbation than a meal); 0 = no snack-driven physiological
perturbation active. Used in tandem with
MEAL_FLAGby enterohepatic-recirculation / gallbladder-contraction models that scale post-prandial transport rate constants with a smaller magnitude for snacks than for full meals. - Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (no snack effect active at this time).
- Source aliases: paper narrative “snack effect on K_BI” in Zuo 2016.
-
Example models:
Zuo_2016_UDCA.R(multiplies the biliary-to-intestine rate constants K_BI,0/1/2 by E_snack = 9.53 during snack windows; one snack modelled per day at +7 h after the morning dose in the Xiang 2011 single-dose study, 0.5 hour long). -
Notes: Specific scope; same per-paper-defined
operational definition as
MEAL_FLAG. Time-varying.
FRACABS (canonical for dose-record fractional absorption supplied as a data column)
-
Description: Per-dose-record fractional absorption
(0-1) supplied as a covariate when the source paper reports F as a
dose-dependent function rather than as a single estimable
bioavailability parameter. The model wires the covariate into a
bioavailability hook (
f(<depot>) <- FRACABS); the user supplies the per-dose F value derived from the paper’s regression / lookup table. - Units: fraction
- Type: continuous
- Scope: specific
- Reference category: n/a (continuous covariate).
- Source aliases: paper narrative “F = 0.66 at 150 mg” / “F = 0.31 at 1000 mg” / Walker 1992 / Crosignani 1991 / Dilger 2012 reported absorption values in Zuo 2016.
-
Example models:
Zuo_2016_UDCA.R(dose-dependent fractional absorption derived from a log-dose linear regression: F = 0.66 at 150 mg and F = 0.31 at 1000 mg, with R^2 = 0.99 over 200-2000 mg per the paper; combined with the UDCA molecular-weight conversion in the bioavailability hookf(stomach_udca) <- FRACABS / mw_udca). -
Notes: Specific scope because the operational
definition (which doses get which F value; whether F is treated as a
known input or as a parameter to be re-estimated) is paper-defined.
Distinct from
lfdepot/f(depot)in models where bioavailability is an estimable PK parameter rather than a supplied covariate. When the paper’s F regression is itself a function ofDOSE, the user can derive FRACABS from the DOSE column upstream ofrxSolverather than carrying the regression insidemodel().
FED (canonical for fed-vs-fasted dose-record indicator)
- Description: 1 = fed state at dosing, 0 = fasted.
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (fasted).
-
Example models:
Kyhl_2016_nalmefene.R,Goel_2016_Sonidegib.R(the Goel 2016 healthy-fasted F effect is applied viae_healthy_fast_f ^ (DIS_HEALTHY * (1 - FED)); cancer-patient records have FED = 1, healthy-fasted-arm records have FED = 0, high-fat-meal arm records have FED = 1 and the additional FED_HIGHFAT = 1 indicator).
FED_HIGHFAT (canonical for high-fat-meal-at-dosing indicator)
-
Description: 1 = oral dose administered after a
high-fat meal, 0 = oral dose administered under any other meal condition
(typically fasted or light meal). Refines the more general
FEDindicator for studies that specifically test the high-fat-meal food effect on bioavailability or absorption rate (a common solubility-limited absorption phenotype). - Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (non-high-fat meal condition; most often “fasted” or “2 h post light meal” as defined per study protocol). Document the operational reference per model.
-
Source aliases:
-
Fatmeal– used inGoel_2016_Sonidegib.R(covariate on F). -
FATM– used inGoel_2016_Sonidegib.R(covariate on Ka; same indicator asFatmealin that paper).
-
-
Example models:
Goel_2016_Sonidegib.R(multiplicative effect on F:5.74^FED_HIGHFAT– ~5.7-fold higher F under high-fat meal vs 2 h post-light-meal reference; multiplicative effect on Ka:1.01^FED_HIGHFAT– no meaningful effect). -
Notes: Distinct from
FED(binary fed-vs-fasted):FED_HIGHFATcarries the specific “high-fat meal” semantic (typically >= 800 kcal, >= 50% calories from fat per FDA guidance). Document the per-protocol meal definition incovariateData[[FED_HIGHFAT]]$notes. When a paper reports a high-fat-meal arm and a separate fasted-healthy arm (e.g., Goel 2016), useFED_HIGHFATfor the high-fat semantic and the existingFED+DIS_HEALTHYindicators for the composite healthy-fasted effect (Goel 2016 applies the e_healthy_fast_f effect via(DIS_HEALTHY * (1 - FED))); the retiredHV_FASTcomposite indicator was deleted on 2026-05-11.
MULTI_DOSE_PT (canonical for multiple-dose-phase-in-patients indicator)
- Description: 1 = dose record from the multiple-dose phase of a clinical-pharmacology study in patients, 0 = otherwise (single-dose run-in records, healthy-volunteer records, or first-dose records in patient studies). Captures any systematic shift in apparent bioavailability between the controlled run-in and the longer multiple-dose phase, typically driven by variable food-restriction compliance over many dosing days.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (single-dose / run-in / healthy-volunteer dose records).
-
Source aliases:
-
FMDD– used inGoel_2016_Sonidegib.R(Goel 2016 covariate on F).
-
-
Example models:
Goel_2016_Sonidegib.R(multiplicative effect on F:1.16^MULTI_DOSE_PT– ~16% higher apparent F during the multiple-dose phase relative to first dose, attributed in the paper to occasional non-fasting compliance). -
Notes: Specific scope because the indicator’s exact
definition (dose-record level vs subject level, run-in inclusion,
occasion boundary) is paper-specific. In Goel 2016, the dataset
distinguishes the run-in single dose from the daily multiple-dose phase;
the indicator switches at the start of the multiple-dose phase for
cancer patients. Distinct from
FEDandFED_HIGHFAT(which are per-record meal-state indicators) and fromREGI_BID(regimen indicator). Future models that need a generic “occasion boundary” effect should consider the existingooc<n>IOV pattern instead.
FORM_TABLET (canonical for tablet vs solution formulation indicator)
- Description: 1 = tablet formulation, 0 = solution.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (solution).
-
Source aliases:
-
TABLET– earlier name used byKyhl_2016_nalmefene.RandTikiso_2021_abacavir.R; renamed toFORM_TABLETfor consistency with theFORM_*family (FORM_CAPSULE, futureFORM_SUSPENSION, etc.).
-
-
Example models:
Kyhl_2016_nalmefene.R(additive shift on residual error: tablet vs solution),Tikiso_2021_abacavir.R(multiplicative effect on the absorption mean transit time MTT: tablet (abacavir + lamivudine fixed-dose-combination tablet) vs liquid solution;mtt *= (1 + 0.249 * FORM_TABLET), i.e. 24.9% slower absorption for the FDC tablet relative to the abacavir liquid reference). -
Notes: Scoped specific because the “tablet vs
solution” contrast is tied to formulation-comparison study designs (Kyhl
2016 nalmefene tablet-vs-oral-solution; Tikiso 2021 abacavir +
lamivudine FDC tablet vs abacavir liquid). Distinct from the
FORM_FDCcanonical (Wilkins 2008 antitubercular fixed-dose-combination of multiple drugs, contrasted against single-drug tablets) because here the comparator is a non-tablet liquid / solution rather than a separate tablet product. Future formulation-comparison models should either extend this entry’s example list when the comparator is a liquid / solution, or register a sibling canonical when contrasting two tablet products.
FORM_CAPSULE (canonical for capsule formulation indicator)
-
Description: 1 = capsule formulation, 0 = the
per-paper comparator non-capsule formulation. The complement formulation
is paper-defined: solution for Hennig 2006 / 2007 itraconazole, tablet
for Gupta 2016 lenvatinib. Document the comparator and the
reference-category bioavailability per-model in
covariateData[[FORM_CAPSULE]]$notes. - Units: (binary)
- Type: binary
- Scope: specific
-
Reference category: 0 (per-paper non-capsule
comparator: solution in Hennig 2006 with
fdepotfixed to 1 for the solution arm; tablet in Gupta 2016 with F fixed to 1 for the tablet arm). -
Source aliases:
-
PREP– used inHennig_2006_itraconazole.R(Clin Pharmacokinet 2006;45(11):1099-1114; PREP = 1 = capsule, PREP = 0 = oral solution) and inHennig_2007_itraconazole.R(Br J Clin Pharmacol 2007;63(4):438-450; DOI 10.1111/j.1365-2125.2006.02778.x; same orientation, capsule typical absorption parameters as the published reference). -
CAPSULE– earlier name used in theHennig_2006_itraconazole.RandHennig_2007_itraconazole.Rmodel files before theFORM_*rename.
-
-
Example models:
Hennig_2006_itraconazole.R(Hennig 2006 Table II final estimates: capsuleka0.09 h^-1 vs solutionka0.96 h^-1 and capsule relative bioavailability 0.55 vs solution 1; theetalfdepotIIV applies only to the capsule arm),Hennig_2007_itraconazole.R(selects betweenlka_capandlka_soltypical-value absorption rate constants and appliesf(depot) <- (1 - FORM_CAPSULE) + FORM_CAPSULE * fdepotso the relative bioavailabilityF_rel = 0.817is applied only to the capsule arm),Gupta_2016_lenvatinib.R(relative bioavailability of capsule vs tablet is 0.896; F1 fixed to 1 for the tablet reference; theetalfcapIIV (30.2% CV) applies only to the capsule arm). -
Notes: Scoped specific because the complement
reference category is paper-defined (solution for Hennig itraconazole,
tablet for Gupta lenvatinib). Sibling to
FORM_TABLET(Kyhl 2016 / Tikiso 2021 tablet vs solution) under theFORM_*family. Future formulation-comparison models that need a capsule indicator should reuse this canonical, extending the example list and documenting the comparator in per-model notes.
FORM_SUSPENSION (canonical for tablet-suspended-in-water formulation indicator)
- Description: 1 = subject received the modelled drug as solid oral tablets suspended in a small volume of water at bedside immediately before swallowing (i.e. tablets disintegrated into an extemporaneously-prepared liquid suspension rather than a formally manufactured suspension product); 0 = subject received the same solid oral tablets swallowed whole. Per-dose-occasion (not per-subject) indicator because a single participant can receive both formulations across study occasions in a crossover bioequivalence design.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (tablets swallowed whole; the typical-value MAT reference in Svensson 2018 Table 2).
-
Source aliases:
-
FORM– used inSvensson_2018_bedaquiline.R(paper’s narrative “whole vs suspended” with the suspended formulation as the 1 level in the analytical control stream).
-
-
Example models:
Svensson_2018_bedaquiline.R(multiplicative effect on the typical mean absorption time MAT:mat_typ = exp(lmat) * (1 + e_susp_mat * FORM_SUSPENSION)withe_susp_mat = +0.23(suspended-tablet MAT is 23% longer than whole-tablet MAT; Svensson 2018 Table 2, 95% CI 2.1-48%, P = 0.03); relative bioavailability F is held identical between formulations because the paper found no statistically significant difference (95% nonparametric CI 94-108% within the 80-125% bioequivalence criteria)). -
Notes: Specific scope because the
bedside-suspension manipulation is tied to the paediatric-dosing
motivation of Svensson 2018 (tablets crushed or suspended to enable
administration to children before a paediatric formulation is
available). Distinct from
FORM_TABLET(Kyhl 2016 / Tikiso 2021 tablet vs liquid solution) because the contrast here is the same tablet swallowed two different ways rather than two different drug products; distinct fromFORM_CAPSULEandFORM_POWDERfor the same reason. Future tablet-suspended-in-water bioequivalence extractions (a common paediatric-pharmacokinetics manipulation in tuberculosis, HIV, and oncology paediatric trials) should reuse this canonical and extend the example list. Ratified canonically on 2026-05-16 alongside the Svensson 2018 bedaquiline extraction.
FORM_POWDER (canonical for oral powder formulation indicator)
-
Description: 1 = subject received the oral powder
formulation of the modelled drug; 0 = subject received the comparator
solid oral formulation (typically a tablet, but document the per-paper
comparator in
covariateData[[FORM_POWDER]]$notes). - Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (tablet; F = 1 fixed in Yukawa 1990 Model 2).
-
Source aliases:
-
FORM_POWDER– used inYukawa_1990_phenytoin.R(paper’sBAindicator inverted: sourceBA = 1if tablet, 0 if powder; canonicalFORM_POWDER = 1 - BA_indicatorso 0 is the tablet reference).
-
-
Example models:
Yukawa_1990_phenytoin.R(Yukawa 1990 Model 2 dose-dependent powder bioavailabilityF_powder = 1 - exp(-9.92 / DOSE_PHT_MGKGD); tablet F fixed at 1),Retlich_2015_linagliptin.R(multiplicative shift on the linagliptin first-order absorption rate constant Ka: powder-in-bottle Ka = 0.933 1/h vs tablet formulation 2 reference Ka = 0.441 1/h; the tablet formulation 1 comparator is captured by the sibling canonicalFORM_LINAG_TAB1). -
Notes: Specific scope because the “powder vs
tablet” contrast is tied to a particular drug-product manufacturing
comparison (Yukawa 1990 contrasts Aleviatin brand phenytoin powder with
Aleviatin tablets, both from Dainippon Pharmaceutical Co.; Retlich 2015
contrasts an early-phase linagliptin powder-in-bottle formulation
against the marketed linagliptin tablet). Mirrors the sibling
FORM_TABLET(Kyhl 2016 / Tikiso 2021 tablet vs solution) andFORM_CAPSULE(Hennig 2006 / Hennig 2007 capsule vs solution) under theFORM_*family. Future powder-formulation models should reuse this canonical, extending the example list and documenting the per-paper comparator. Ratified canonically on 2026-05-10 alongside the Yukawa 1990 phenytoin extraction.
FORM_SYRUP (canonical for oral syrup / liquid-suspension formulation indicator)
-
Description: 1 = subject received the modelled drug
as an oral syrup or liquid-suspension formulation (paediatric or
extemporaneously-compounded oral suspension); 0 = subject received the
comparator solid oral formulation (typically a capsule, but document the
per-paper comparator in
covariateData[[FORM_SYRUP]]$notes). Per-dose-occasion indicator in principle; in paediatric cohorts often time-fixed per subject by clinical convention. - Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (per-paper non-syrup comparator: capsule in Nanga 2019 tacrolimus).
-
Source aliases:
-
syrup formulation– used inNanga_2019_tacrolimus_metaanalysis.R(Table 3 covariate-effect label; relative bioavailability of syrup vs capsule = 0.53).
-
-
Example models:
Nanga_2019_tacrolimus_metaanalysis.R(multiplicative effect on bioavailability per Eq. 4:f(depot) <- 0.53^FORM_SYRUP, so capsule users have F = 1 and syrup users have F = 0.53; Nanga 2019 Table 3 ‘Bioavailability for syrup formulation’ = 0.53 with 95% bootstrap CI 0.31 - 0.75). -
Notes: Specific scope because the comparator
solid-oral reference is paper-defined (capsule in Nanga 2019). Distinct
from
FORM_SUSPENSION(Svensson 2018: tablets extemporaneously suspended in water at bedside immediately before swallowing – same tablet swallowed two different ways, where the manipulation affects MAT rather than F) because here the contrast is between two distinct drug products (commercial capsules vs paediatric syrup / suspension). Distinct fromFORM_CAPSULE,FORM_TABLET, andFORM_POWDER, which compare those solid-oral forms against a liquid solution or against each other. Future paediatric-syrup / oral-suspension formulation comparisons should reuse this canonical; if a future model contrasts syrup against tablet (rather than capsule), extend the per-model notes rather than registering a sibling canonical. Ratified canonically on 2026-05-18 alongside the Nanga 2019 tacrolimus meta-analysis extraction.
FORM_FDC (canonical for fixed-dose-combination antitubercular formulation indicator)
- Description: 1 = subject received the rifampicin-containing fixed-dose-combination (FDC) antitubercular product (rifampicin co-formulated with isoniazid, pyrazinamide, and optionally ethambutol in a single tablet); 0 = subject received the same drugs as separate single-drug tablets (“SDC”, separate-drug-combination). Per-subject (regimen-fixed) categorical covariate flagging the formulation when a population analysis pools FDC and SDC arms and tests formulation as a covariate on absorption / disposition parameters.
- Units: (binary)
- Type: binary
- Scope: specific
-
Reference category: 1 (FDC; the most-common
formulation in the Wilkins 2008 cohort and the typical-value reference
for
lmtt/lcl). -
Source aliases:
-
FDC– used inWilkins_2008_rifampicin.R(DDMODEL00000280 NMTRAN$INPUTcolumn; values 0 / 1 with the same orientation as the canonical, 1 = FDC).
-
-
Example models:
Wilkins_2008_rifampicin.R(multiplicative(1 + e_fdc0_mtt * (1 - FORM_FDC))shift on MTT and(1 + e_fdc0_cl * (1 - FORM_FDC))shift on CL; SDC subjects (FDC = 0) had 104% longer MTT and 23.6% higher CL than the FDC = 1 reference per Wilkins 2008 final estimates). -
Notes: Specific scope because the FDC vs SDC
contrast is tied to the antitubercular fixed-dose-combination context
(rifampicin + isoniazid + pyrazinamide +/- ethambutol). Future
antitubercular-FDC models should extend the example list rather than
register a new canonical. Distinct from
FORM_TABLET(Kyhl 2016 nalmefene tablet vs solution) and the rest of theFORM_*(drug-product-version) family because the FDC-vs-SDC contrast compares two tablet products, not a tablet vs a non-tablet, and the underlying mechanism is co-formulation-driven absorption rather than drug-product manufacturing.
RIA_ASSAY (canonical for radioimmunoassay vs LC-MS/MS bioanalytical method indicator)
- Description: 1 = radioimmunoassay; 0 = LC-MS/MS.
- Units: (binary)
- Type: binary
- Scope: specific
-
Example models:
Kyhl_2016_nalmefene.R. - Notes: Switches the additive residual-error magnitude.
FORM_NS0 (canonical for NS0 cell-line formulation indicator)
- Description: 1 = NS0 cell-line formulation, 0 = other.
- Units: (binary)
- Type: binary
- Scope: specific
-
Example models:
Zhu_2017_lebrikizumab.R.
FORM_CHO_PHASE2 (canonical for CHO Phase 2 formulation indicator)
- Description: 1 = CHO Phase 2 formulation, 0 = other.
- Units: (binary)
- Type: binary
- Scope: specific
-
Example models:
Zhu_2017_lebrikizumab.R.
CONMED_EOX (canonical for concomitant EOX (epirubicin + oxaliplatin + capecitabine) chemotherapy backbone indicator)
- Description: 1 = concomitant epirubicin + oxaliplatin + capecitabine (EOX) chemotherapy backbone, 0 = other backbone (e.g., mFOLFOX6, CAPOX, or single-agent).
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (non-EOX backbone).
-
Source aliases:
COMB(used by Yamada 2025 Table 1 with the EOX level coded as the non-reference category; renamed toCONMED_EOXto preserve the semantic meaning of the 1-level). -
Example models:
Yamada_2025_zolbetuximab.R(fractional effect on V1). -
Notes: Disease-backbone indicator. If a future
model needs more backbone categories, encode each as its own indicator
(
COMB_CAPOX,COMB_FOLFOX, …) with a single reference group.
FORM_P2F2 (canonical for isatuximab P2F2 drug-material indicator)
- Description: 1 = isatuximab P2F2 drug material (intended commercial / phase III material, used in the EFC14335 / ICARIA-MM study), 0 = P1F1 drug material (early-phase material used in TED10893 / TED14154 / TCD14079).
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (P1F1).
-
Source aliases:
-
Drug_mat– used inFau_2020_isatuximab.R. Values 0 / 1 with the same orientation as the canonical (1 = P2F2 / commercial-bound material).
-
-
Example models:
Fau_2020_isatuximab.R(exponential effect on Vc with coefficient -0.137; P2F2 patients had ~13% lower Vc than P1F1). - Notes: Phase III / commercial-bound formulation indicator for isatuximab; the FORM_* family stays scope-specific per nlmixr2lib policy that drug-product-version indicators are kept model-specific unless they generalize across multiple drugs. Set to 1 to simulate the marketed material.
FORM_LINAG_TAB1 (canonical for linagliptin tablet formulation 1 indicator)
-
Description: 1 = subject received the linagliptin
“tablet formulation 1” (used in Retlich 2015 Study 2), 0 = subject
received tablet formulation 2 (the marketed linagliptin tablet, used in
Studies 3 and 4) OR the powder-in-bottle formulation (used in Study 1).
The powder-vs-tablet contrast is captured by the sibling canonical
FORM_POWDER; this indicator switches between the two tablet formulations. - Units: (binary)
- Type: binary
- Scope: specific
-
Reference category: 0 (tablet formulation 2 =
marketed linagliptin tablet, the typical-value Ka reference; or
FORM_POWDER = 1for the powder). - Source aliases: none known.
-
Example models:
Retlich_2015_linagliptin.R(multiplicative shift on the linagliptin first-order absorption rate constant Ka; typical Ka = 0.441 1/h for tablet formulation 2 (reference), 0.795 1/h for tablet formulation 1, 0.933 1/h for the powder formulation). -
Notes: Specific scope because the linagliptin
“tablet 1 vs tablet 2” distinction is a drug-product-version comparison
local to the Retlich 2015 popPK dataset; tablet formulation 1 was a
development formulation that is not marketed. Mirrors the
FORM_DP2(sarilumab) andFORM_P2F2(isatuximab) entries under theFORM_*family. Set to 0 for routine marketed-formulation simulation. Ratified canonically alongside the Retlich 2015 linagliptin extraction.
FORM_DP2 (canonical for sarilumab drug-product version 2 indicator)
- Description: 1 = sarilumab drug product 2 formulation (used in some phase I studies and the dose-ranging phase II study), 0 = other drug product (DP1 or DP3; DP3 is the commercial formulation).
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (DP1 or DP3).
-
Source aliases:
-
DP2– used inXu_2019_sarilumab.R.
-
-
Example models:
Xu_2019_sarilumab.R. - Notes: Affects both CLO/F (1.30x multiplier) and Ka (0.663x multiplier) in Xu 2019. Set to 0 for routine commercial-formulation simulation.
FORM_VISMO_PHASEI (canonical for vismodegib Phase I (dry-blend capsule) formulation indicator)
- Description: 1 = subject received the Phase I clinical-development vismodegib formulation (dry-blend capsules used in early-phase studies SHH3925g and the Phase I cohort of SHH4610g); 0 = subject received the Phase II / commercial-bound formulation (wet-granulation capsules used in SHH4476g, SHH4433g, and most of SHH4683g / SHH4871g). Per-subject (regimen-fixed) categorical indicator.
- Units: (binary)
- Type: binary
- Scope: specific
-
Reference category: 0 (Phase II / commercial
wet-granulation capsule; F = 1 fixed as the reference in Lu 2015 Eq. 4
and the typical-value
kareference in Eq. 4). - Source aliases: none known; Lu 2015 reports formulation as a paper-defined “Phase I vs Phase II formulation” categorical without committing to a single column name.
-
Example models:
Lu_2015_vismodegib.R(multiplicative shifts on Ka and relative bioavailability F: typical Ka = 9.025 1/day for the Phase II reference in patients, withexp(-0.602) = 0.55xfor the Phase I formulation andexp(0.671) = 1.96xfor healthy volunteers; F = 1 for the Phase II reference, F = 0.346 for the Phase I formulation in patients and 0.836 in HV). -
Notes: Specific scope because the “Phase I (dry
blend) vs Phase II (wet granulation) capsule” contrast is tied to the
vismodegib drug-product-version comparison in Lu 2015. Mirrors the
FORM_DP2(sarilumab),FORM_P2F2(isatuximab), andFORM_LINAG_TAB1(linagliptin) entries under theFORM_*family of drug-product-version indicators. Set to 0 for routine commercial-formulation simulation. Ratified canonically alongside the Lu 2015 vismodegib extraction.
FORM_TAC_IR (canonical for tacrolimus immediate-release vs prolonged-release formulation indicator)
- Description: 1 = subject received the twice-daily immediate-release tacrolimus formulation (Prograf, Astellas) administered every 12 hours; 0 = subject received the once-daily prolonged-release tacrolimus formulation (Advagraf in Europe, Astagraf XL in the US; both Astellas) administered every 24 hours. Per-subject (regimen-fixed; per-occasion in cross-over conversion studies) categorical indicator used in popPK analyses that pool the two oral tacrolimus formulations and test formulation as a covariate on absorption and disposition parameters.
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (Advagraf / Astagraf XL prolonged-release; the typical-value reference for Ktr and Vc/F in Woillard 2011 Table 4).
-
Source aliases:
-
study– used inWoillard_2011_tacrolimus.R(Woillard 2011 Methods: “study factor (assumed to be similar to drug formulation) … study = 1 for the Prograf cohort, study = 0 for the Advagraf cohort”). The source paper’sstudyandformulationfactors are aliased; the canonical preserves the source paper’s orientation (Prograf = 1 = IR, Advagraf = 0 = PR), which also matches the standard convention of treating the older immediate-release formulation as the non-reference level. -
FORMULATION(with 1 = PR-T, opposite polarity) – used inLu_2019_tacrolimus_industry_meta.R. Lu 2019 writes the Ka covariate equation asKa = 0.375 * (1 - (1 - theta_form) * FORMULATION)withFORMULATION = 1for prolonged-release; the model derivesform_pr = 1 - FORM_TAC_IRinsidemodel()to match the paper’s published equation while keeping the canonical column oriented Prograf = 1.
-
-
Example models:
Woillard_2011_tacrolimus.R(multiplicative effects per Woillard 2011 Table 4:Ktr = theta1 * theta2^FORM_TAC_IRwiththeta1 = 3.34/handtheta2 = 1.53(Prograf ~53 % faster absorption than Advagraf);Vc/F = theta6 * theta7^FORM_TAC_IRwiththeta6 = 486 Landtheta7 = 0.29(Prograf Vc/F is 29 % of the Advagraf reference)),Lu_2019_tacrolimus_industry_meta.R(linear effect on Ka per Lu 2019 Table 3 ‘Prolonged-release tacrolimus on Ka’ = 0.499, encoded asKa(PR-T) = Ka(IR-T) * 0.499viaform_pr = 1 - FORM_TAC_IR; 50 % slower absorption for PR-T vs IR-T). -
Notes: Scope promoted to
generalafter Lu 2019 corroborated the Prograf-vs-Advagraf contrast in a second, larger-cohort population (Woillard 2011 n = 173 + 174, Lu 2019 n = 408 across 8 Astellas Phase II studies). Mirrors the existingFORM_DP2(sarilumab),FORM_P2F2(isatuximab),FORM_LINAG_TAB1(linagliptin), andFORM_VISMO_PHASEI(vismodegib) entries under theFORM_*family of drug-product-version indicators. The Woillard 2011 paper notes the formulation effect partially confounds with time-post-transplant (Prograf cohort sampled within the first 6 months post-transplant, Advagraf cohort > 12 months post-transplant); Lu 2019 has no such confounding because most studies are within-subject IR-T-to-PR-T conversions. Future tacrolimus models that include Envarsus XR (modified-release once-daily granules) or LCP-Tacro (life-cycle-pharma melt-extrusion tablets) should register a sibling canonical (e.g.,FORM_TAC_ENVARSUS) rather than overloadingFORM_TAC_IR. Ratified canonically alongside the Woillard 2011 tacrolimus extraction.
dilution (canonical for diluted-drug-product indicator)
- Description: 1 = drug diluted (Soehoel 2022 study D2213C00001), 0 = not diluted.
- Units: (binary)
- Type: binary
- Scope: specific
-
Example models:
Soehoel_2022_tralokinumab.R. -
Notes: Lower-case preserved from source; future
models should rename to
DILUTION. Kept as alias here to match existing file.
nonECZTRA (canonical for non-ECZTRA-trial indicator)
- Description: 1 = not the ECZTRA trial; 0 = ECZTRA.
- Units: (binary)
- Type: binary
- Scope: specific
-
Example models:
Soehoel_2022_tralokinumab.R. -
Notes: Mixed case preserved from source; future
models should rename to
NON_ECZTRAorSTUDY_NON_ECZTRA.
SEASON2 (canonical for second RSV season at dosing indicator)
- Description: 1 = second RSV season at dosing, 0 = first RSV season.
- Units: (binary)
- Type: binary
- Scope: specific
-
Example models:
Clegg_2024_nirsevimab.R. - Notes: Study-specific but semantically general (second-exposure indicator). Promote to general if a second RSV-season model adopts the same semantics.
COHDOSE (canonical for randomized dose cohort (mg/kg))
- Description: Randomized dose cohort expressed in mg/kg. Subject-level (time-fixed) covariate carrying the per-subject cohort dose in a study where each subject remained on a single escalating-cohort dose for the full dosing period.
- Units: mg/kg
- Type: continuous
- Scope: specific
-
Reference category: n/a – used with power scaling
(COHDOSE / ref)^exponent. Reference value observed: 1 mg/kg in Narwal 2013. -
Source aliases:
-
DOSE– used inNarwal_2013_sifalimumab.R(the paper’s Eq. 3 variable name; renamed toCOHDOSEhere to avoid colliding with the rxode2/nlmixr2 event-column convention whereDOSEorAMTcarries the administered dose).
-
-
Example models:
Narwal_2013_sifalimumab.R(reference 1 mg/kg, exponent 0.0542 on CL). -
Notes: Scope: specific because the interpretation
depends on a study design where each subject stays on a single dose
cohort. For fixed-dose simulations, set
COHDOSE = nominal_dose_mg / WTper subject. When the subject receives a weight-based dose,COHDOSEis the mg/kg label (0.3, 1, 3, or 10 mg/kg for the MI-CP152 cohorts).
DOSE (canonical for current administered dose level supplied as a data column)
-
Description: Continuous covariate carrying the
administered dose level (in mg) as a per-record data column. Two
complementary use cases share this canonical:
- Per-subject assigned dose – each subject’s fixed assigned dose level (in mg) across the study, used as a power-style or stratified covariate when the population PK model detects a dose-dependent shift in a PK parameter (central volume, clearance, etc.).
-
Time-varying current administered dose – the
current daily dose at the time of the record, used in PD-only models
that derive a per-cycle exposure metric (e.g.,
AUC = DOSE / CLI) from a posthoc-CL covariate without instantiating a PK ODE. The column is set to 0 during off-treatment periods (drug holidays, placebo arm) so the derived exposure becomes 0.
-
Units: mg (document per-model in
covariateData[[DOSE]]$unitsif a different dose unit is used). - Type: continuous
-
Reference category: n/a – used with power scaling
(DOSE / ref)^exponentfor use case (a), or directly inside derived-exposure expressions for use case (b). Reference values observed: 600 mg inZheng_2016_sifalimumab.R(middle of the 200/600/1200 mg phase IIb dose range). -
Source aliases:
-
Dose– used inZheng_2016_sifalimumab.RandCastro-Surez_2020_nimotuzumab.R. -
DOS– used in the Hansson 2013 sunitinib biomarker / TGI / fatigue PD-model family (DDMODEL00000197 and siblings) as a per-record sunitinib dose column.
-
-
Example models:
Zheng_2016_sifalimumab.R(power effect on V1 with exponent 0.06),Castro-Surez_2020_nimotuzumab.R(binary-indicator usage(DOSE == 50)applying a 53 % decrease in V1 for the 50 mg cohort),Hansson_2013a_sunitinib.R(DDMODEL00000197; time-varying record-level dose feedingAUC = DOSE / CLI),Hansson_2013b_sunitinib.R(DDMODEL00000198; same time-varyingAUC = DOSE / CLIform for the tumor growth inhibition model),Schindler_2016_sunitinib.R(DDMODEL00000221; sameAUC = DOSE / CLIform, with the daily-dose column toggling between 50 mg/day on-cycle and 0 on dose-holiday records),Schindler_2017_imatinib.R(time-varying daily imatinib dose in mg/day feeding the size and density drug-effect terms asKdrug,S * (DOSE / 400) * exp(-k * t)andKdrug,D * (DOSE / 400), with 400 mg/day as the reference normalisation),Girard_2012_pimasertib.R(linear coefficient on the dropout-hazard log-rate:exp(beta * DOSE)Weibull multiplier; per-subject daily dose, observed range 1-255 mg). -
Notes: Distinct from
DOSE_70MG(binary indicator for a specific dose group in a trinary-dose design) and from the rxode2/nlmixr2 event columnamt(which carries the administered dose at dose events). For use case (a), the values are typically time-fixed per subject; for use case (b), the values are time-varying with on/off cycling – for sunitinib 4-weeks-on / 2-weeks-off cycling, setDOSE = nominal_daily_mg(e.g., 50) during on-cycles and 0 during off-cycles or for the placebo arm. Per-modelcovariateData[[DOSE]]$notesshould state which use case applies.
CD (canonical for cumulative cladribine dose (time-varying))
- Description: Time-varying cumulative cladribine dose (mg total dose, not body-weight-normalized) administered to each subject up to the current observation time. Stays at zero during the placebo arm and during the pre-dose baseline phase, rises stepwise across the dosing schedule (cladribine is given as short oral pulses), and remains constant between dose events.
- Units: mg
- Type: continuous
- Scope: specific
-
Reference category: n/a – used inside
EXPS = CD * 104.5 / CRLas an exposure surrogate driving an Emax-style symptomatic effect on disease progression, not as a power-form covariate. -
Source aliases:
CD– column name used in the DDMODEL00000223 input dataset (Simulated_Novakovic_2016_multiplesclerosis_cladribine_irt.csv). -
Example models:
Novakovic_2017_cladribine.R. -
Notes: Distinct from
DOSE(per-subject assigned dose level, time-fixed) andCOHDOSE(mg/kg cohort label, time-fixed).CDis the cumulative dose accrued at each timepoint, supplied as a time-varying covariate column rather than via dosing events because the Novakovic 2017 model does not carry an explicit cladribine-PK compartment. Scope: specific because the constant 104.5 inside the exposure-surrogate equation is hard-coded for cladribine in the source.
TRT (canonical for treatment-cohort indicator)
- Description: Treatment-cohort indicator used in the Novakovic 2017 cladribine IRT model. 0 = placebo, 1 = cladribine 3.5 mg/kg cumulative-dose cohort, 2 = cladribine 5.25 mg/kg cumulative-dose cohort.
- Units: (categorical)
- Type: categorical
- Scope: specific
- Reference category: 0 (placebo).
-
Source aliases:
TRT– column name used in the DDMODEL00000223 input dataset (Simulated_Novakovic_2016_multiplesclerosis_cladribine_irt.csv). -
Example models:
Novakovic_2017_cladribine.R. -
Notes: Gates the symptomatic and protective
drug-effect terms via
TRT >= 1 && t > 0; the categorical level (1 vs 2) is informational because the dose-response is driven by the time-varyingCDcovariate and the per-cohort dosing schedule, not byTRTitself. Scope: specific because the cohort labelling (3.5 vs 5.25 mg/kg cumulative dose over 2 years) is tied to the CLARITY-program cladribine dosing schedule. Future models that need a generic on-treatment indicator should register a new canonical name (e.g.,ON_TREATMENT) rather than reusingTRT.
DRUG_ORMU (canonical for Ormutivimab vs HRIG drug-product comparator indicator)
- Description: 1 = subject received Ormutivimab (a recombinant human anti-rabies IgG1 monoclonal antibody, also referred to in the source paper as rHRIG); 0 = subject received plasma-derived human rabies immunoglobulin (HRIG) comparator (or placebo + vaccine without passive antibody). Per-subject (time-fixed) categorical indicator carrying the head-to-head drug-arm assignment in a randomized rabies-vaccine pharmacodynamics study where rabies virus neutralizing antibody (RVNA) activity is modelled with a time-dependent Emax response to the vaccine and the passive antibody product modifies the typical-value Emax / ET50.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (HRIG / no passive antibody).
-
Source aliases:
-
antibody type– Zhang 2022 Results section 3.4 covariate-effect prose (“antibody type was determined as the covariate significantly affecting the model”); the source NMTRAN column name is not separately reported.
-
-
Example models:
Zhang_2022_ormutivimab.R(additive typical-value shifts on the linear-scale Emax and ET50 of the vaccine-induced RVNA Emax model:Emax_tv = exp(lEmax) + e_drug_ormu_Emax * DRUG_ORMUwithe_drug_ormu_Emax = +0.143 IU/mLandET50_tv = exp(lET50) + e_drug_ormu_ET50 * DRUG_ORMUwithe_drug_ormu_ET50 = -3.8 day, yielding a higher and faster vaccine-induced antibody peak in the Ormutivimab arms relative to the HRIG comparator). -
Notes: Distinct from the
FORM_*family (within-product formulation-version contrasts of a single drug) because the contrast here is between two biologically distinct products: HRIG is a polyclonal plasma-derived immunoglobulin, while Ormutivimab is a recombinant monoclonal antibody (CHO-cell-produced; the first rhRIG approved in China). Specific scope because the head-to-head HRIG-vs-rHRIG comparator design is tied to the Zhang 2022 phase II rabies-vaccine study. Future head-to-head rhRIG-vs-HRIG popPD models (e.g., SII Rabishield or Twinrab against HRIG) should register a sibling canonical (DRUG_SIIRMAB,DRUG_TWINRAB) rather than overloadingDRUG_ORMU; cross-product comparisons that need both indicators in the same dataset can carry them as independent binaries with HRIG as the shared reference. Ratified canonically alongside the Zhang 2022 ormutivimab extraction.
DRUG_PRED (canonical for prednisone vs fosdagrocorat drug-arm comparator indicator)
- Description: 1 = subject is in the prednisone (oral glucocorticoid comparator) arm; 0 = subject is in the fosdagrocorat (PF-04171327, dissociated agonist of the glucocorticoid receptor) arm or the placebo arm. Per-subject (time-fixed) binary indicator carrying the head-to-head drug-arm assignment in a phase II randomized trial in adults with rheumatoid arthritis where serum bone-formation biomarkers (P1NP, osteocalcin) are modelled with a K-PD framework and the two drugs share the rebound and response-side parameters but carry separate KDE / Imax / EDK50 values.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (fosdagrocorat or placebo).
- Source aliases: derived per subject from the trial-arm assignment; the source paper does not name an NMTRAN column.
-
Example models:
Shoji_2017_fosdagrocorat_p1np.R,Shoji_2017_fosdagrocorat_oc.R(binary multiplier switching the active KDE and the active sigmoid-Emax inhibition parameters between fosdagrocorat and prednisone:lkde_active = lkde_fos * (1 - DRUG_PRED) + lkde_pred * DRUG_PRED, with analogous switching oflogitimax_active/imax_activeandledk50_active; the rebound parametersRBmax,T50and the response-side parametersKd,BL,SLPare shared between the drugs and do not multiply byDRUG_PRED). -
Notes: Sibling of
DRUG_ORMU(rhRIG vs HRIG) and follows the same “head-to-head drug-arm” pattern: the contrast is between two structurally distinct active comparators in a randomized trial. Specific scope because the fosdagrocorat-vs-prednisone head-to-head is tied to the Shoji 2017 P1NP / OC analyses; future DAGR-vs-prednisone (or DAGR-vs-other glucocorticoid) popPK/PD models can extend this entry’sexample_modelslist, while a contrast between a different test drug and prednisone (e.g., methylprednisolone vs prednisone) should register a sibling canonical rather than reusingDRUG_PRED. Ratified canonically alongside the Shoji 2017 P1NP / OC extractions.
DOSE_70MG (canonical for 70 mg dose regimen indicator)
- Description: 1 = subject is on the 70 mg SC Q4W dose regimen, 0 = subject is on the 210 or 490 mg SC Q4W regimen.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (210 mg or 490 mg Q4W regimen).
- Source aliases: derived per subject from the trial-assigned dose level.
-
Example models:
Kotani_2022_astegolimab.R. -
Notes: Zenyatta-study categorical covariate
flagging the 70 mg group (lowest dose), modeled as a -15.3% relative
change on relative bioavailability. Modeled by Kotani 2022 as
70 mg vs {210 mg, 490 mg}combined reference.
DOSE_50MG (canonical for 50 mg dose administration indicator)
- Description: 1 = dose record is a 50 mg SC administration, 0 = all other SC doses (100, 150, 200, 300 mg) and all IV doses.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (100-300 mg SC or any IV dose).
-
Source aliases: derived per dose record from the
administered amount (
AMT). -
Example models:
Othman_2014_daclizumab.R,Diao_2016_daclizumab_cd25.R,Diao_2016_daclizumab_cd56bright.R,Diao_2016_daclizumab_treg.R. -
Notes: Othman 2014 estimated two separate absolute
bioavailabilities because of non-linear dose-normalized exposure at the
50 mg SC dose – F = 0.84 for the therapeutic 100-300 mg SC range and F =
0.57 for the 50 mg SC cohort. Encoded here as a record-level indicator
so the covariate effect
e_dose_50mg_f = 0.57/0.84 - 1 = -0.321scales bioavailability only on 50 mg SC doses. For clinical-range simulation (150 mg SC Q4W Phase III regimen) leaveDOSE_50MG = 0. The Diao 2016 PK/PD models inherit the Othman 2014 PK backbone verbatim; they carryDOSE_50MGeven though the Diao 2016 RRMS regimens are 150 / 300 mg SC only.
STUDY1 (canonical for Study-1 cohort indicator)
- Description: 1 = subject enrolled in Study 1 of the Cirincione 2017 pooled analysis, 0 = other. Used to switch the residual-error magnitude per study.
- Units: (binary)
- Type: binary
- Scope: specific
-
Reference category: 0 (all other studies; combined
with
STUDY5 = 0selects the pooled “other” residual error). -
Source aliases:
-
DVID = "study1"(character-valued study identifier;STUDY1 = as.integer(DVID == "study1")) – legacy form previously used inCirincione_2017_exenatide.R.
-
-
Example models:
Cirincione_2017_exenatide.R. -
Notes: Paired with
STUDY5. When both are 0, the subject is in the pooled “other studies” residual-error group.
STUDY5 (canonical for Study-5 cohort indicator)
- Description: 1 = subject enrolled in Study 5 of the Cirincione 2017 pooled analysis, 0 = other. Used to switch the residual-error magnitude per study.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (all other studies).
-
Source aliases:
-
DVID = "study5"(character-valued study identifier;STUDY5 = as.integer(DVID == "study5")) – legacy form previously used inCirincione_2017_exenatide.R.
-
-
Example models:
Cirincione_2017_exenatide.R. -
Notes: Paired with
STUDY1. When both are 0, the subject is in the pooled “other studies” residual-error group.
STUDY_PKU015 (canonical for sapropterin PKU-015 pediatric study cohort indicator)
- Description: 1 = subject enrolled in study PKU-015 (pediatric population pharmacokinetic study of sapropterin in infants and young children, 0-6 years old, of the Qi 2014 pooled analysis); 0 = study PKU-004 (adolescent / adult open-label extension study, >= 9 years old). Used to switch the residual-error magnitude per study under the log-transform-both-sides (LTBS) constant-CV residual model.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (PKU-004 adolescent / adult cohort).
-
Source aliases: derived per subject from the trial
identifier (
PKU-015-> 1,PKU-004-> 0). -
Example models:
Qi_2014_sapropterin.R. -
Notes: Qi 2014 Table 3 reports separate
residual-error estimates for the two studies – PKU-004 = 21.1% CV,
PKU-015 = 30.2% CV under the LTBS approach. The
STUDY_PKU015indicator selects between them. Specific scope because the indicator is tied to the BioMarin sapropterin clinical-development program (PKU-004 = phase 3b extension, PKU-015 = phase 3b pediatric).
PHASE2 (canonical for Phase II study cohort indicator)
- Description: 1 = subject enrolled in the Phase II study (MORAb-003-002) of the Farrell 2012 pooled analysis; 0 = Phase I study (MORAb-003-001). Used to switch the residual-error magnitude per study.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (Phase I).
-
Source aliases: derived per subject from the trial
identifier (
MORAb-003-001-> 0,MORAb-003-002-> 1). -
Example models:
Farrell_2012_farletuzumab.R. -
Notes: Farrell 2012 Table 3 reports separate
residual-error estimates for the two studies – Phase I uses a
proportional-only model (sigma = 20.5%); Phase II uses a combined
additive + proportional model (sigma_prop = 34.9%, sigma_add = 7.94
ug/mL). The
PHASE2indicator selects between them.
PHASE1 (canonical for Phase I study cohort indicator)
- Description: 1 = subject enrolled in a Phase 1 study of the Valenzuela 2025 pooled analysis (MOM-M281-001, MOM-M281-007, MOM-M281-010, EDI1001, EDI1002 – healthy participants); 0 = Phase 2 study (MOM-M281-004 / Vivacity-MG – participants with gMG). Used to switch the proportional PK residual-error magnitude per study phase.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (Phase 2).
-
Source aliases: derived per subject from the trial
identifier (Phase 1 protocols -> 1,
NCT03772587Vivacity-MG -> 0). -
Example models:
Valenzuela_2025_nipocalimab.R. -
Notes: Valenzuela 2025 Table 3 reports proportional
PK residual 0.0834 (Phase 1) vs 0.367 (Phase 2). Distinct from Farrell
2012
PHASE2– the reference category is inverted (Valenzuela 2025 picks Phase 1 as the 1-level).
STUDY_C2201 (canonical for Bienczak 2025 ligelizumab study C2201 cohort indicator)
- Description: 1 = subject enrolled in study C2201 (NCT02477332; Novartis Phase 2b ligelizumab dose-finding study in adult CSU patients) of the Bienczak 2025 pooled ligelizumab PopPK analysis; 0 = any other study in the pool (A2103, C2101, C2202, C2302, or C2303). Used to switch the typical CL/F magnitude in study C2201.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (any non-C2201 cohort in the Bienczak 2025 pool: pooled adult / adolescent CSU patients from C2202 / C2302 / C2303 and adult healthy volunteers from A2103 / C2101).
-
Source aliases: derived per subject from the trial
identifier (
C2201-> 1, else -> 0). -
Example models:
Bienczak_2025_ligelizumab.R(Table S6: study C2201 on CL/F = 0.176, log-additive;cl *= exp(0.176)for C2201 subjects). - Notes: Specific scope because the contrast is tied to the Novartis ligelizumab CSU development program. Subject-level / time-fixed; set once from the trial identifier on each subject record. The C2201 effect was retained in the final model because the residual unexplained CL/F differed between C2201 and the other studies after accounting for body weight, IgE, ADA, and disease-state covariates.
STUDY_LBSL (canonical for early-phase belimumab LBSL01 / LBSL02 study indicator)
- Description: 1 = subject enrolled in study LBSL01 (NCT00657007) or LBSL02 (NCT00071487) – the two early-phase belimumab studies that used a different ELISA-based bioanalytical assay; 0 = any other belimumab study in the Zhou 2021 pooled analysis. Used to switch CL and V1 magnitudes per study group (effectively an assay / early-development PK adjustment).
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (later-phase studies using the electrochemiluminescence assay).
-
Source aliases:
-
INDR– used inZhou_2021_belimumab.R(Zhou 2021 Table 2 footnote: study indicator).
-
-
Example models:
Zhou_2021_belimumab.R(multiplicative factors 1.63 on CL and 1.26 on V1 when STUDY_LBSL = 1). -
Notes: Conceptually similar to
STUDY1/PHASE2/ELISA/PHASE1(per-study switches) but specific to the belimumab program. Subject-level (time-fixed); set from the trial identifier on each subject record.
SAMPLE_INTENSIVE (canonical for per-observation sampling-intensity indicator)
- Description: 1 = observation belongs to an intensive (rich, post-dose) PK sampling window; 0 = sparse (pre-dose / steady-state trough) sampling. Per-observation (record-level) indicator used to switch the proportional residual-error magnitude when a source paper estimates separate residual errors for intensive vs sparse sampling phases of the same dataset.
- Units: (binary)
- Type: binary
- Scope: general
- Reference category: 0 (sparse sampling).
- Source aliases: derived per observation from the sampling-design label in the source dataset (intensive 24-h profiles / dense post-dose schedules -> 1; pre-dose troughs and steady-state population samples -> 0).
-
Example models:
Macpherson_2015_rosuvastatin.R(Table 2 final model: 39.4% CV intensive vs 59.5% CV sparse residual error; switched per observation aspropSd <- propSdIntensive * SAMPLE_INTENSIVE + propSdSparse * (1 - SAMPLE_INTENSIVE)). -
Notes: Conceptually similar to
STUDY1/STUDY5/PHASE1/ELISA(per-record switches that select between estimated residual-error magnitudes) but the contrast is sampling design rather than study cohort or bioanalytical assay. The indicator is generally applicable: any pediatric / dense-vs-sparse pooled popPK design that estimates two residual errors can carry it. Within-subject variation is permitted (a single subject can have both intensive and sparse observations, as in the Macpherson 2015 CHARON PK-pilot cohort where 12 subjects had a Day-0 intensive profile followed by 2 years of sparse troughs).
ELISA (canonical for ELISA-vs-ECLIA bioanalytical assay indicator)
- Description: 1 = serum nipocalimab concentration measured by ELISA assay (LLOQ 0.150 ug/mL; studies MOM-M281-001, MOM-M281-007, MOM-M281-010); 0 = measured by ECLIA assay (LLOQ 0.010 ug/mL; studies EDI1001, EDI1002, MOM-M281-004). Used to switch the additive PK residual-error magnitude per assay.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (ECLIA).
- Source aliases: derived per study from the bioanalytical method (Text S2 of Valenzuela 2025).
-
Example models:
Valenzuela_2025_nipocalimab.R. - Notes: Valenzuela 2025 Table 3 reports additive PK residual 0.445 nmol/L (ELISA) vs 0.0342 nmol/L (ECLIA). Assay choice is study-fixed; include as a per-observation indicator so the correct assay residual is applied even when observations from different studies are pooled.
STUDY_M281_004 (canonical for MOM-M281-004 (Vivacity-MG) study indicator)
- Description: 1 = subject enrolled in study MOM-M281-004 (Vivacity-MG; NCT03772587; phase 2 in generalized myasthenia gravis); 0 = any other study in the Valenzuela 2025 pooled analysis. Used to switch the IgG-baseline scaling factor.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (non-Vivacity-MG studies).
-
Source aliases: derived per subject from the trial
identifier (
NCT03772587-> 1, else -> 0). -
Example models:
Valenzuela_2025_nipocalimab.R. -
Notes: Valenzuela 2025 estimated a slightly lower
IgG baseline in MOM-M281-004 participants (baseline scaled by
FRIgG0_M281_004 = 0.777vs. 1 in other studies). Distinct from the disease-state indicator implied bygMG– it is specifically the Vivacity-MG study flag because the IgG baseline factor was only estimated for that study.
STUDY_ABA2_HLA78 (canonical for ABA2 trial HLA 7/8-matched cohort indicator)
- Description: 1 = subject enrolled in the ABA2 hematopoietic-cell-transplant trial (IM101311; NCT01743131) HLA 7/8 (one-allele-mismatched donor) cohort, 0 = any other study in the Takahashi 2023 pooled abatacept population PK analysis (RA/JIA reference and ABA2 8/8 cohort).
- Units: (binary)
- Type: binary
- Scope: specific
-
Reference category: 0 (non-ABA2-7/8 – pooled adult
RA / pediatric JIA cohort, plus ABA2 HLA 8/8 cohort which is flagged
separately by
STUDY_ABA2_HLA88). -
Source aliases: derived per subject from the
trial-cohort identifier (
Cohort = ABA2 7/8-> 1, else -> 0). Takahashi 2023 Supplemental Table 4 names the corresponding thetathetaCohort_CL/thetaCohort_Vc. -
Example models:
Takahashi_2023_abatacept.R(multiplicativeRatiofactors on CL = 0.70 and on Vc = 0.99 vs the RA/JIA reference; values from Takahashi 2023 Supplemental Table 4). -
Notes: Pairs with
STUDY_ABA2_HLA88to reproduce the three-level cohort categorical (RA/JIA, ABA2 7/8, ABA2 8/8) the paper reports as the only retained categorical PK covariate. At most one ofSTUDY_ABA2_HLA78andSTUDY_ABA2_HLA88is 1 per subject; both 0 reproduces the RA/JIA reference. Scope: specific because the contrast is tied to the ABA2-vs-RA/JIA pooling design.
STUDY_ABA2_HLA88 (canonical for ABA2 trial HLA 8/8-matched cohort indicator)
- Description: 1 = subject enrolled in the ABA2 hematopoietic-cell-transplant trial (IM101311; NCT01743131) HLA 8/8 (allele-matched donor) cohort, 0 = any other study in the Takahashi 2023 pooled abatacept population PK analysis (RA/JIA reference and ABA2 7/8 cohort).
- Units: (binary)
- Type: binary
- Scope: specific
-
Reference category: 0 (non-ABA2-8/8 – pooled adult
RA / pediatric JIA cohort, plus ABA2 HLA 7/8 cohort which is flagged
separately by
STUDY_ABA2_HLA78). -
Source aliases: derived per subject from the
trial-cohort identifier (
Cohort = ABA2 8/8-> 1, else -> 0). Takahashi 2023 Supplemental Table 4 names the corresponding thetathetaCohort_CL/thetaCohort_Vc. -
Example models:
Takahashi_2023_abatacept.R(multiplicativeRatiofactors on CL = 0.91 and on Vc = 1.32 vs the RA/JIA reference; values from Takahashi 2023 Supplemental Table 4). -
Notes: Pairs with
STUDY_ABA2_HLA78. At most one of the two indicators is 1 per subject; both 0 reproduces the RA/JIA reference. Scope: specific.
STUDY_RIV201 (canonical for Tammara 2017 rivipansel phase II SCD study indicator)
- Description: 1 = subject enrolled in the phase II rivipansel study (NCT01119833; “study 201” in the Tammara 2017 integrated population PK analysis; Telen 2015 Blood 125:2656-2664 reports the trial results); 0 = healthy adult volunteers from the three rivipansel phase I studies (studies 101, 102, and 103) pooled into the integrated dataset. Used both as an additive shift on typical clearance (interpreted by the authors as the SCD-hyperfiltration component, ~23%) and to switch the additive / proportional residual-error magnitudes between the two cohorts (Table 1).
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (rivipansel phase I healthy-adult studies 101, 102, and study 103 healthy adults with SCD pooled with phase I per Tammara 2017; the paper labels these as “phase I studies” in the residual-error rows of Table 1).
-
Source aliases: derived per subject from the trial
identifier (
NCT01119833-> 1, else -> 0); the sourceSTUDcolumn in the NONMEM dataset described in Tammara 2017 Table 1 footnote b. -
Example models:
Tammara_2017_rivipansel.R(Table 1: additive effect 0.234 on CL via1 + 0.234 * STUDY_RIV201; selects the cohort-specific additive and proportional residual SDs inmodel()). -
Notes: Specific scope because the contrast is tied
to the rivipansel development program. The Tammara 2017 paper interprets
the 23% CL increment as a putative hyperfiltration effect of SCD; in
simulation use cases targeting the SCD population (the paper’s stated
goal) set
STUDY_RIV201 = 1for every subject. Subject-level / time-fixed; set once from the trial identifier on each subject record.
DLVL (canonical for source-protocol dose-level integer indicator)
-
Description: Integer dose-level / protocol-arm
indicator carried per subject in the DDMODEL00000281 lidocaine bundle’s
simulated dataset (
Simulated_Lid_B04_ddmore.csv). Values 1-4 (or higher) flag distinct study-protocol dose / regimen tiers; theNA_NA_lidocaine.Rmodel binarises asDLVL > 2to switch the typical-value baselines for both the GX elimination rate constant K30 and the lidocaine apparent central volume V1 between a “low” (DLVL <= 2) and a “high” (DLVL > 2) regimen. - Units: (integer-coded categorical)
- Type: categorical
- Scope: specific
-
Reference category: the binary form
DLVL <= 2is the reference (THETA(4) for K30 and THETA(14) for V1 in the source.ctl);DLVL > 2selects the higher-exposure regimen (THETA(5) and THETA(15) respectively). -
Source aliases:
DLVL– the column header used in the DDMORE bundle’s.ctl$INPUTand the Simulated_Lid_B04_ddmore.csv data file. -
Example models:
NA_NA_lidocaine.R(DDMODEL00000281; binary derivationDLVL_HIGH = as.integer(DLVL > 2)on K30 base and V1 base). -
Notes: Specific scope because the integer-coded
dose / regimen tiers are paper-specific to the lidocaine BAST.dat
(“4-cRUN249”) study and the linked publication is not on disk for this
extraction. The binary threshold
> 2reproduces the source.ctllineIF(DLVL.GT.2)P1=0. If a future model needs a different dose-level binarisation or a continuous treatment, register a distinct canonical name rather than overloadingDLVL.
S1A2 (canonical for source-protocol CYP1A2 substrate / co-medication categorical indicator)
-
Description: Categorical CYP1A2-modifying
co-medication / phenotype indicator carried per subject in the
DDMODEL00000281 lidocaine bundle’s simulated dataset. Integer code; in
the
NA_NA_lidocaine.Rmodel the valueS1A2 == 3selects the “CYP1A2 inducer present” sub-cohort (lidocaine N-deethylation to MEGX is CYP1A2-mediated, so the modifier acts on the GX elimination rate constant K30 in the source’s parameterisation). Other integer codes (0, 1, 2) are pooled into the reference. - Units: (integer-coded categorical)
- Type: categorical
- Scope: specific
-
Reference category:
S1A2 != 3(i.e., values 0, 1, 2) – pooled into the reference. -
Source aliases:
S1A2– the column header used in the DDMORE bundle’s.ctl$INPUTand the simulated dataset, with sibling columnsD1A2andH1A2carried in the data file but dropped via=DROPin the source.ctl. -
Example models:
NA_NA_lidocaine.R(DDMODEL00000281; binary derivationS1A2_IND = as.integer(S1A2 == 3)on the GX elimination rate constant K30). -
Notes: Specific scope because the integer codes for
S1A2are paper-specific to the lidocaine BAST.dat study and the linked publication is not on disk for this extraction. The exact biological meaning of each integer level (0/1/2/3) is not fully reconstructable from the bundle alone – the natural interpretation, given the column name encodes “CYP1A2” and the model attaches a sizeable positive K30 modifier of +0.853 to the level-3 cohort, is a CYP1A2-induction or smoking / inducer co-medication indicator. Sibling columnsD1A2(donor / inhibitor?) andH1A2(host / inhibitor?) are dropped in the source.ctlso only the level-3 indicator is structurally identifiable from the surviving model code. If a future model needs a richer encoding of CYP1A2 modulation, register a separate canonical (e.g.,CYP1A2_IND) rather than overloadingS1A2.
Occasion / period (IOV)
OCC (canonical for the integer-valued occasion / period column)
-
Description: Integer-valued occasion / period
indicator for inter-occasion-variability (IOV) modelling. Values
1,2, …,Nidentify the occasion to which each observation belongs (typically a dosing visit, study period, or sampling occasion). Time-varying within subject; constant within an occasion. - Units: (count)
- Type: categorical
- Scope: general
-
Reference category: n/a –
OCCis decomposed insidemodel()into mutually-exclusive binary indicators, e.g.,oc1 <- (OCC == 1),oc2 <- (OCC == 2), …, that are then multiplied against per-occasioneta*slots. -
Source aliases:
-
OCC– used inJonsson_2011_ethambutol.R(DDMODEL00000220 NMTRAN$INPUTcolumn; values 1..4).
-
-
Example models:
Jonsson_2011_ethambutol.R(4-occasion IOV on log-CL;cl <- exp(lcl + etalcl + oc1 * etalcl_oc1 + oc2 * etalcl_oc2 + oc3 * etalcl_oc3 + oc4 * etalcl_oc4) * (WT/50)^0.75, where eachetalcl_oc<k>is a separate~ fix(0.127)after the first to encode NONMEM$OMEGA BLOCK(1) SAME),Aregbe_2012_alvespimycin.R(5-occasion BOV on Q2 and V1),Oosten_2016_fentanyl.R(10-occasion IOV on transdermal Ka; onlyOCC >= 1records carry IOV, sc / non-transdermal records passOCC = 0so all indicators zero out). -
Notes:
OCCis the recommended canonical for new IOV-using models – the binaryooc1..oocNindicators below remain canonical for legacy / pre-existing models that ship the data already-decomposed. Ratified canonically on 2026-05-06.
ooc1, ooc2, ooc3, ooc4 (canonical for mutually-exclusive crossover occasion indicators)
- Description: Mutually exclusive occasion indicators for a crossover / multi-period design. Exactly one is 1 per observation.
- Units: (binary)
- Type: binary
- Scope: specific
-
Example models:
Xie_2019_agomelatine.R. -
Notes: Lower case preserved from source file.
Pre-existing legacy form; new models should prefer the integer-valued
OCCcanonical above and decompose into binary indicators insidemodel().
DAY14 (canonical for day-14-post-treatment-initiation landmark indicator)
- Description: Binary within-subject landmark indicator: 1 = the observation falls on or after day 14 of treatment (post-nutritional-rehabilitation steady state in the Archary 2019 / MATCH trial of severely malnourished HIV-infected children), 0 = the observation falls before day 14 (acute / pre-rehabilitation baseline; e.g., day 1 of antiretroviral treatment). Time-varying within subject – gates a step change in typical-value PK parameters that the source paper attributes to nutritional recovery + auto-induction of hepatic metabolism over the first ~2 weeks of treatment.
- Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (day 1 / pre-rehabilitation).
- Source aliases: none known; Archary 2019 reports the day-1-vs-day-14 contrast directly in the parameter table (separate typical-value rows for “day 1” and “day 14”).
-
Example models:
-
Archary_2019_abacavir.R(multiplicative additive shift on CL/F:cl <- exp(lcl + etalcl) * (WT/7)^0.75 * (1 + e_day14_cl * DAY14)withe_day14_cl = 0.760, encoding the day-1 typical CL/F = 3.33 -> day-14 CL/F = 5.86 L/h per 7 kg step). -
Archary_2019_lamivudine.R(multiplicative additive shift on ka:ka <- exp(lka) * (1 + e_day14_ka * DAY14) * exp(etalka)withe_day14_ka = 0.133, encoding the day-1 typical ka = 0.30 -> day-14 ka = 0.34 /h step).
-
-
Notes: Specific scope because the day-14 cutoff is
tied to the MATCH-trial nutritional-rehabilitation timeline (severely
malnourished children, two-week re-feeding window per WHO guidelines);
future studies that test a similar within-subject step change with a
different cutoff (day 7, day 28, etc.) should register a new canonical
(e.g.,
DAY28). Distinct fromOCCandooc<n>(which decompose multi-occasion sampling for IOV), fromTRT_PHASE(which gates active-vs-baseline study-phase contributions), and fromEARLY_ART(which is a between-subject randomization-arm indicator, not a within-subject landmark). Data assemblers can deriveDAY14 = as.integer(time_post_treatment_start_days >= 14)for a regularly-sampled multi-day study. Ratified canonically on 2026-05-08.
CYCLE (canonical for dose-number / treatment-cycle counter)
- Description: Dose-number / treatment-cycle counter (1 = first dose or cycle, 2 = second, …). Integer count, time-varying across a multi-dose / multi-cycle treatment course, incremented at each new administration.
- Units: (count)
- Type: count
- Scope: general
-
Reference category: n/a – used either with a
power-covariate form
CYCLE^Fm(Fm typically negative) to capture cycle-over-cycle decline in a derived quantity such as ADC-to-payload conversion fraction (Li 2017 brentuximab vedotin), or with a piecewise indicatorCYCLE == 1 vs CYCLE >= 2to capture a step change in PK between the first and subsequent administrations (Hong 2025 datopotamab deruxtecan; Huynh 2026 VRC07-523LS). -
Source aliases:
CYCLE– used inLi_2017_brentuximab.R,Hong_2025_datopotamab.R,Lu_2022_patritumab.R, andHuynh_2026_VRC07523LS.Rwith the same canonical name. -
Example models:
-
Li_2017_brentuximab.R(exponent on the fraction of ADC that converts to MMAE by proteolytic degradation, Fm = -0.261, to reflect tumor-burden reduction across successive treatment cycles). -
Hong_2025_datopotamab.R(cycle-1 vs cycle-2+ piecewise scaling Factor1 = 0.696 on the DAR equation that drives DXd formation rate from total Dato-DXd elimination). -
Lu_2022_patritumab.R(cycle-1 vs cycle-2+ piecewise scaling Factor1 = theta = 0.648 on the payload-to-intact-drug ratio PIR that scales DXd release rate from intact ADC).
-
-
Notes: Must be >= 1 throughout
(
CYCLE^Fmis undefined at 0; the piecewise form requiresCYCLEto be a positive integer at every observation row). Distinct fromooc<n>binary-occasion indicators:CYCLEis an integer count, not a mutually-exclusive set of indicator columns. Data-assembly helper: setCYCLE = floor((TIME - TIME_FIRST_DOSE) / cycle_length_days) + 1for a fixed-interval dosing regimen.
T_ENTRY (canonical for per-subject study-entry time on the model integration axis)
-
Description: Per-subject study-entry time,
expressed on the same integration-time axis the model is solved on
(years for AD disease-progression models). The dataset’s
TIMEcolumn carries the integration-time variable (withTIME = 0the integration origin, typically well before any subject’s disease onset);T_ENTRYrecords, per subject, the integration-time value at which the subject’s first observation falls. Used in models whose pre-study and post-study time semantics differ – e.g., a disease-progression activation function defined on global disease time plus a placebo-effect term defined on time-since-study-entry need access to both reference clocks within the samemodel()block. -
Units: year (or whatever time unit the model’s
units$timefield declares) - Type: continuous
- Scope: specific
-
Reference category: n/a –
T_ENTRYis a per-subject time-offset covariate that anchors the time-since-study-entry clock used by post-entry-only model terms (placebo / learning effects, study-design transient drops). - Source aliases: none standardized; this canonical originates with the Delor 2013 AD disease-progression extraction. The source paper’s NONMEM dataset handles the global / study-time split internally (the dataset is staged with TIME = study time and the model uses a fixed offset to align with DOT); when porting to rxode2 / nlmixr2 the per-subject offset is exposed as a covariate so simulation event tables can carry it explicitly.
-
Example models:
Delor_2013_alzheimer.R(placebo-term clock:t_pl_raw <- time - T_ENTRY; post_entry <- t_pl_raw > 0; placebo_term <- pl_indiv * (1 - exp(-kpl_indiv * t_pl_raw * post_entry)) * post_entry). -
Notes: Scope: specific because the variable is
paper-domain-bound – it only makes sense for models that operate on a
global disease-time axis distinct from per-subject study-entry timing.
For a typical-value reproduction the user supplies
T_ENTRYper subject in the simulation event table; a reasonable construction isT_ENTRY = DOT_individual + a few years of established diseaseso the patient is observed during disease progression (see the Delor 2013 vignette for the construction used to reproduce Figures 2-4). Ratified canonically on 2026-05-16 alongside the Delor 2013 extraction. ## Mixture / latent-class indicators
MIX_PDI (canonical for binary mixture-model class indicator: PDI inhibition-of-synthesis vs PDS stimulation-of-elimination thrombocytopenia mechanism)
-
Description: Per-subject latent mixture-model class
indicator from the Tsuji 2017 linezolid platelet PKPD model. 1 = subject
classified to the PDI mechanism (linezolid inhibits platelet synthesis
via a linear effect on the proliferation rate of the formation
compartment); 0 = subject classified to the PDS mechanism (linezolid
stimulates platelet elimination via an Emax effect on the
circulating-compartment first-order loss rate). Not a measured clinical
covariate – the mixture assignment is a posterior class assignment from
the NONMEM
$MIXTUREblock (Tsuji 2017 Equation 8 and Methods, “Mixture model” subsection). - Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (PDS = stimulation of elimination; the 3% minority class in the source cohort).
-
Source aliases:
MIXTURE– NONMEM$MIXTUREblock class index in the Tsuji 2017 estimation run (component 1 = PDI, component 2 = PDS); the binary indicator isMIX_PDI = as.integer(MIXTURE == 1). -
Example models:
Tsuji_2017_linezolid.R(gates the drug-effect term that enters the platelet ODE chain:MIX_PDI = 1activatesSLOPE * Ccon the formation rate,MIX_PDI = 0activatesSMAX * Cc / (SC50 + Cc)on the circulating-compartment elimination rate). -
Notes: The population probability of
MIX_PDI = 1is the estimated population mixture fractionFPOP_inhibit = 0.969(Tsuji 2017 Table 2; 95% CI 0.867-1.00, 78/80 patients in the source dataset). For typical-value simulation setMIX_PDI = 1(dominant clinical phenotype, slower 2-week nadir; Figure 5 left panel) orMIX_PDI = 0(rare immune-mediated-like phenotype, faster 2-day nadir; Figure 5 right panel). For population simulation, drawMIX_PDI ~ Bernoulli(0.969)per subject. Scope: specific because the binary semantics are tied to Tsuji 2017’s two-mechanism platelet model; mixture indicators from future papers that share the same biological dichotomy may extend this entry, but mixture indicators from unrelated dichotomies should register a new canonical name (e.g.,MIX_FAST_ELIMfor a fast/slow eliminator mixture).
MIX_LARGE_BASE (canonical for binary mixture-model class indicator: larger-baseline vs smaller-baseline tumor-lesion subpopulation)
-
Description: Per-subject latent mixture-model class
indicator from the Schindler 2017 imatinib-GIST liver-metastases joint
tumor-dynamics model. 1 = subject classified to the larger-baseline
subpopulation (typical lesion 1 baseline 76.6 mm MTD / 161 mL Vactual /
187 mL Vellipsoid; estimated population probability Ppop1 = 0.348 in
Schindler 2017 Table 2 row ‘Ppop1’); 0 = subject classified to the
smaller-baseline subpopulation (typical lesion 1 baseline 20.9 mm MTD /
3.45 mL Vactual / 3.93 mL Vellipsoid). Not a measured clinical covariate
– the mixture assignment is the per-subject latent-class index from a
NONMEM
$MIXTUREblock describing the observed bimodal distribution of baseline tumor sizes (Schindler 2017 Methods, “Maximum transaxial diameter, actual volumes, and ellipsoidal volume models” subsection: “Semiparametric distributions and mixture models were investigated to describe the observed bimodal distribution in baseline MTD, Vactual, and Vellipsoid”). One assignment per subject is shared across the three size models (MTD, Vactual, Vellipsoid) and across the subject’s up-to-two lesions; density (D0) is not mixture-indexed. - Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (smaller-baseline subpopulation; majority class at 65.2 % of the source cohort).
-
Source aliases:
MIXTURE– NONMEM$MIXTUREblock class index in the Schindler 2017 estimation run (component 1 = larger-baseline subpop with P = 0.348, component 2 = smaller-baseline subpop); the binary indicator isMIX_LARGE_BASE = as.integer(MIXTURE == 1). -
Example models:
Schindler_2017_imatinib.R(gates the per-lesion typical baselinesS0_<metric>_l<k>_typbetween subpopulation-1 and subpopulation-2 anchors, and selects between subpopulation-specific IIV etasetalS0_<metric>_pop1andetalS0_<metric>_pop2whose log-scale variances differ across mixture classes). -
Notes: Schindler 2017 Methods reports 22 % of
patients with one target lesion vs 39 % of patients with two target
lesions were assigned to subpopulation 1, suggesting the larger-baseline
class is enriched among multi-lesion patients. For typical-value
simulation set
MIX_LARGE_BASE = 1to reproduce the larger-baseline phenotype (dominates Figure 2 right panel typical-individual trajectory) orMIX_LARGE_BASE = 0to reproduce the smaller-baseline phenotype. For population simulation, drawMIX_LARGE_BASE ~ Bernoulli(0.348)per subject. Scope: specific because the binary semantics are tied to Schindler 2017’s two-class baseline-tumor-size mixture; future tumor-burden mixture models that share the same large-vs-small baseline dichotomy may extend this entry, while mixture indicators from unrelated dichotomies (e.g., fast / slow tumor-growth phenotypes) should register a new canonical name (MIX_FAST_GROW, etc.). Ratified canonically on 2026-05-18 alongside the Schindler 2017 imatinib extraction.
Preclinical experimental conditions
SEIZURE_ACUTE (canonical for acute focal-seizure-activity indicator in preclinical brain-distribution studies)
-
Description: 1 = the animal is undergoing acute
focal seizure activity (typically chemoconvulsant-induced, e.g.,
intrahippocampal pilocarpine perfusion in a microdialysis design) during
the modelled observation window; 0 = no acute seizure activity. Captures
the experimental flag used by preclinical BBB / biophase-distribution PK
studies to test whether seizure-driven changes in blood-brain barrier
permeability or brain redistribution alter brain PK parameters relative
to non-seizing control animals. Time-fixed per animal in
single-condition allocations (each rat in one arm); a time-varying form
is permitted when a study induces transient seizures during a sampling
window and the operator chooses to gate the covariate only over the
seizure interval (document in
covariateData[[SEIZURE_ACUTE]]$notes). - Units: (binary)
- Type: binary
- Scope: specific
- Reference category: 0 (no acute seizure activity; non-seizing control animal).
-
Source aliases:
-
A(values inverted: sourceA = 1means non-seizing control andA = 0means seizing, per Clinckers 2008 Methods ‘Both A and B were set to 1 in control animals; to 0 and 1, respectively, in seizing animals’; the canonical encoding flips the polarity viaSEIZURE_ACUTE = 1 - A) – used inClinckers_2008_MHD_rat.R.
-
-
Example models:
Clinckers_2008_MHD_rat.R(mutually-exclusive selection of the seizure-specific biophase volume V3b in place of the control V3a inside the V3 expression of the one-compartment-plus-biophase model for MHD in male Wistar rats;v3 = exp(lv3a + etalv3a) * (1 - SEIZURE_ACUTE) * (1 - EFFLUX_INHIB) + exp(lv3b) * SEIZURE_ACUTE * (1 - EFFLUX_INHIB) + exp(lv3c + etalv3c) * (1 - SEIZURE_ACUTE) * EFFLUX_INHIB). -
Notes: Specific scope because the indicator’s
semantics are tied to acute, induced focal seizure activity in
preclinical BBB / biophase PK designs; clinical-trial canonicals for
epilepsy (
CONMED_AED,CONMED_EIAED,PDVprevious-period seizure count) describe distinct concepts and should not be conflated. Future preclinical studies that test a similar seizure-vs-non-seizure contrast should extend the example list; promote to general scope once a second model legitimately ratifies the name. Mutually exclusive withEFFLUX_INHIBin Clinckers 2008 (each rat is allocated to exactly one of {control, seizure, efflux-inhibition}); the model() encoding uses(1 - SEIZURE_ACUTE) * (1 - EFFLUX_INHIB)as the control multiplier, so any data record that asserts both flags as 1 would zero out the V3a term – data assemblers should preserve mutual exclusivity unless a model is explicitly designed for the cross-condition case. Ratified canonically on 2026-05-16 alongside the Clinckers 2008 extraction.
EFFLUX_INHIB (canonical for local efflux-transporter-inhibitor co-perfusion indicator in preclinical brain-distribution studies)
- Description: 1 = the animal’s brain microdialysis probe is co-perfused with an efflux-transporter inhibitor (e.g., verapamil 5 mM for P-glycoprotein / MDT blockade in Clinckers 2008) at the site of brain sampling, 0 = no local efflux-transporter inhibitor co-perfusion. Captures the experimental flag used by preclinical BBB / biophase-distribution PK studies to test whether local pharmacological blockade of brain efflux transporters alters brain PK parameters relative to non-blocked control animals. Time-fixed per animal in single-condition allocations.
- Units: (binary)
- Type: binary
- Scope: specific
-
Reference category: n/a –
T_ENTRYis a per-subject time-offset covariate that anchors the time-since-study-entry clock used by post-entry-only model terms (placebo / learning effects, study-design transient drops). - Source aliases: none standardized; this canonical originates with the Delor 2013 AD disease-progression extraction. The source paper’s NONMEM dataset handles the global / study-time split internally (the dataset is staged with TIME = study time and the model uses a fixed offset to align with DOT); when porting to rxode2 / nlmixr2 the per-subject offset is exposed as a covariate so simulation event tables can carry it explicitly.
-
Example models:
Delor_2013_alzheimer.R,Clinckers_2008_MHD_rat.R(mutually-exclusive selection of the verapamil-specific biophase volume V3c in place of the control V3a inside the V3 expression; see the V3 expression quoted underSEIZURE_ACUTE). -
Notes: Scope: specific because the variable is
paper-domain-bound – it only makes sense for models that operate on a
global disease-time axis distinct from per-subject study-entry timing.
For a typical-value reproduction the user supplies
T_ENTRYper subject in the simulation event table; a reasonable construction isT_ENTRY = DOT_individual + a few years of established diseaseso the patient is observed during disease progression (see the Delor 2013 vignette for the construction used to reproduce Figures 2-4). Ratified canonically on 2026-05-16 alongside the Delor 2013 extraction.