From Crime Scene to Clinical Trial: Mastering Transfer and Persistence for Activity-Level Predictions

Jaxon Cox Nov 27, 2025 211

This article provides a comprehensive framework for researchers and drug development professionals on the critical principles of transfer and persistence and their application to activity-level evaluations.

From Crime Scene to Clinical Trial: Mastering Transfer and Persistence for Activity-Level Predictions

Abstract

This article provides a comprehensive framework for researchers and drug development professionals on the critical principles of transfer and persistence and their application to activity-level evaluations. It explores the foundational science behind how evidence—from DNA on forensic samples to drugs in the human body—transfers, persists, and is recovered. The content delves into methodological applications for designing robust experiments, troubleshooting common barriers to adoption, and validating findings through fit-for-purpose models and comparative analysis. By synthesizing knowledge from forensic science and pharmacology, this guide aims to enhance the credibility and utility of activity-level assessments in both investigative and clinical contexts.

The Core Principles: Understanding Transfer, Persistence, and Activity-Level Propositions

In forensic science, an activity-level proposition is a framework used to evaluate how a specific piece of trace evidence, such as DNA, came to be in a particular location. It addresses questions of activity rather than mere source, focusing on the mechanisms of transfer, persistence, and prevalence of background evidence [1] [2]. This represents a significant evolution in forensic practice, shifting the fundamental question from the traditional "Whose DNA is this?" to the more forensically relevant "How did it get there?" [1]. This shift is particularly crucial in the era of advanced DNA profiling technology capable of producing results from minute quantities of trace material, where the issue of source is often uncontested, but the explanation for its presence is central to the case [1].

The hierarchy of propositions in forensic science exists on a continuum from sub-source to source to activity level. While sub-source propositions concern only the DNA profile itself (e.g., "This profile matches the suspect"), and source propositions concern the biological material (e.g., "The bloodstain came from the suspect"), activity-level propositions directly address the actions and activities that allegedly occurred [2] [3]. For example, suitable activity-level propositions might be: "The suspect punched the victim" versus "The suspect shook hands with the victim the day before" [1]. The core distinction is that activity-level evaluations require specialist knowledge about transfer, persistence, and background levels of DNA, whereas source-level evaluations primarily require data on the rarity of the DNA profile in a population [1] [3].

Theoretical Framework and Formulation

Core Components of Activity-Level Evaluation

The evaluation of biological traces considering activity-level propositions rests on three fundamental scientific principles that extend beyond simple profile rarity [1]:

Transfer: The mechanism by which DNA is deposited onto a surface or person during an activity. This includes primary transfer (direct contact) and secondary transfer (indirect contact via an intermediate surface).
Persistence: The duration for which deposited DNA remains detectable on a surface despite environmental factors and subsequent contacts.
Background: The presence and prevalence of DNA from unknown individuals on surfaces or items in the environment, which establishes the context for evaluating the significance of the recovered DNA.

A formal approach to evaluating findings given activity-level propositions uses a likelihood ratio (LR) framework to assess the probability of the evidence under each of the competing propositions [2]. The scientist must ask: "What are the expectations if each of the propositions is true?" and "What data are available to assist in the evaluation of the results given the propositions?" [2]. The value of evidence calculated for a DNA profile at the sub-source level cannot be carried over to activity-level evaluations, as these calculations are separate and address fundamentally different questions [2].

Formulating Propositions

Effective activity-level propositions must be balanced and address the same issue in the case [2]. They should ideally be set before knowledge of the results and avoid using the word 'transfer' in their formulation, as propositions are assessed by the Court, while DNA transfer is a factor scientists use for interpretation [2]. Properly formulated propositions help the court address the fundamental question of "How did an individual's cell material get there?" [2].

Table 1: Examples of Activity-Level Proposition Pairs

Scenario	Proposition 1 (Prosecution)	Proposition 2 (Defense)
Alleged Assault	The suspect punched the victim.	The person who punched the victim shook hands with the suspect earlier.
Alleged Sexual Activity	The suspect had sex with the complainant.	The suspect and complainant attended the same party and had social contact only.
Alleged Burglary	The suspect handled the broken window.	The suspect was present at the scene days before for legitimate reasons.

Experimental Protocols for Activity-Level Research

General Framework for Transfer and Persistence Studies

Objective: To generate quantitative data on DNA transfer and persistence for specific activities to inform likelihood ratio calculations in casework.

Experimental Design Considerations:

Define Activity Parameters: Precisely define the activity to be studied (e.g., grabbing, touching, punching). Document variables such as force, duration, surface type, and environmental conditions [1].
Account for Donor Characteristics: Consider the "shedder status" of donors, which can significantly impact the amount of DNA transferred. Incorporate this variability into the experimental design [1].
Simulate Realistic Conditions: While controlled laboratory conditions are necessary, experiments should aim to mimic real-life scenarios as closely as possible. The uncertainty from unmeasurable aspects of real activities will be reflected in the spread of the obtained data [1].
Incorporate Sensitivity Analysis: If important factors have a considerable impact on the outcome but their actual states are unknown, they can be incorporated into the logical framework by considering all possible states, weighted by probabilities informed by controlled experiments [1].

Protocol 1: Direct Transfer Study (Hand Contact)

Purpose: To quantify the amount and quality of DNA transferred through direct hand contact to various surfaces.

Materials:

Research Reagent Solutions & Essential Materials
- DNA-free surfaces (e.g., glass, plastic, fabric): Substrates for DNA deposition.
- Sterile swabs and collection kits: For sample collection from surfaces.
- Quantitative PCR (qPCR) kit: For quantifying total human DNA.
- DNA profiling kits (e.g., STR multiplex kits): For generating DNA profiles.
- Statistical analysis software: For data analysis and LR calculation.

Methodology:

Donor Selection: Recruit donors with varying shedder status (pre-determined through a standardized test).
Pre-activity Handwashing: Standardize the hand hygiene protocol before the experiment (e.g., no washing, washing with specific products).
Contact Simulation: Participants make firm contact with designated DNA-free surfaces for a predetermined duration (e.g., 5, 10, 30 seconds).
Sample Collection: Immediately swab the contacted surface area using a standardized technique and pressure.
Analysis:
- Extract DNA from swabs.
- Quantify the total human DNA yield using qPCR.
- Subject samples to DNA profiling and record the percentage of the donor's profile obtained.
Data Recording: Record the DNA quantity, profile quality, and donor shedder status for each trial.

Protocol 2: Persistence and Background Study

Purpose: To evaluate the persistence of deposited DNA over time and determine the background levels of DNA on commonly encountered surfaces.

Materials: (As in Protocol 1, with the addition of environmental chambers for controlled aging, if applicable).

Methodology:

Initial Deposition: Follow Protocol 1 to deposit DNA on surfaces under controlled conditions.
Aging Process: Store the deposited samples under controlled environmental conditions (temperature, humidity) that reflect common casework scenarios.
Time-Series Sampling: Collect samples from the surfaces at predetermined time intervals (e.g., 1 hour, 6 hours, 24 hours, 1 week).
Background Sampling: Sample identical, unused surfaces from the same environment to establish baseline background DNA levels and profiles.
Analysis: Process all samples as in Protocol 1 to track the degradation of the donor's profile and the potential emergence of background profiles over time.

Data Interpretation and Reporting

Quantitative Data from Experimental Studies

The data collected from the experimental protocols can be synthesized to inform the probabilities required for a likelihood ratio calculation. The following table provides a hypothetical example of how such data might be structured.

Table 2: Example Data Structure from Transfer and Persistence Experiments

Activity	Surface Type	Mean DNA Quantity (ng)	Probability of Full Profile	Probability of Detectable DNA after 24h	Key Influencing Factors
Firm Grab (10s)	Cotton Fabric	0.15	0.85	0.45	Shedder status, fabric weave
Firm Grab (10s)	Smooth Plastic	0.45	0.95	0.70	Shedder status, surface porosity
Brief Touch (2s)	Smooth Plastic	0.08	0.60	0.30	Shedder status dominant
Punch	Cotton Fabric	1.20	0.99	0.80	Force, skin abrasion

The Role of Bayesian Networks

Bayesian Networks (BNs) are a powerful tool for evaluating findings given activity-level propositions because they force the consideration of all relevant possibilities and their probabilistic relationships in a logical way [2]. They provide a visual and mathematical framework to combine case-specific information (e.g., the alleged activity) with population-level data (e.g., transfer probabilities) to calculate a likelihood ratio.

Diagram 1: BN for Activity Evaluation

The above diagram illustrates a simplified BN for activity-level evaluation. The Activity node (e.g., "Did the suspect punch the victim?") influences the intermediate Transfer and Persistence nodes. These, along with the Background node, collectively determine the probability of the observed DNA Result. The BN allows for the calculation of the likelihood ratio by computing the probability of the evidence under both the prosecution's and defense's propositions for the Activity node.

Implementation Framework

Building a Knowledge Base

A critical barrier to the global adoption of activity-level reporting is the perceived lack of robust and impartial data to inform probabilities [4]. Overcoming this requires a concerted effort to build community-wide knowledge bases through controlled experiments [1] [3]. Analysts should ideally have and collect data that are relevant to the case in question [2]. The design of experiments to form these knowledge bases should:

Be systematic and hypothesis-driven.
Explore the impact of multiple variables (e.g., shedder status, activity duration, surface type).
Be conducted under controlled but forensically relevant conditions.
Be published and shared to create a robust, accessible body of data for the forensic community.

The Scientist's Toolkit for Activity-Level Evaluation

Successfully implementing activity-level evaluation requires both conceptual tools and practical resources. The following table details key components of this toolkit.

Table 3: The Scientist's Toolkit for Activity-Level Evaluation

Tool Category	Item	Function & Explanation
Conceptual Framework	Hierarchy of Propositions	Guides the scientist to formulate propositions at the correct level (sub-source, source, activity) based on the questions being asked by the court [2] [3].
Interpretation Method	Likelihood Ratio (LR)	Provides a structured, quantitative method to evaluate the strength of the forensic findings given two competing activity-level propositions [2].
Computational Tool	Bayesian Networks	Allows for complex probabilistic modeling of the relationships between activities, transfer mechanisms, persistence, and the final DNA result [2].
Data Foundation	Experimental Knowledge Base	A repository of data from controlled studies on transfer, persistence, and background levels. This is essential for assigning realistic probabilities within the LR framework [1].
Reporting Guideline	ENFSI/ISFG Guidelines	Provide standards for formulating balanced propositions and reporting evaluative conclusions, ensuring scientific rigor and transparency [2] [3].

The adoption of a framework for activity-level propositions is not merely a theoretical improvement but a practical necessity for modern forensic science. It addresses the questions that are truly of interest to the court, particularly in cases involving low-level or transferred DNA, where the source may not be disputed, but the activity is [1] [3]. While challenges exist—including the need for more extensive knowledge bases on transfer and persistence, and the requirement for additional training—the forensic community is encouraged to address these challenges directly [1] [4]. By doing so, scientists can provide more focused, useful, and robust contributions to the criminal justice process, ensuring that their conclusions are both scientifically valid and forensically relevant.

The movement of drugs across biological membranes is a fundamental process in pharmacokinetics and pharmacodynamics. Drug transfer entails the intrinsic ability of drug molecules to move across biological membranes through passive or facilitated diffusion as well as by carrier-mediated transport processes [5]. These processes determine drug absorption, distribution, and excretion, ultimately influencing therapeutic efficacy and safety profiles. Based on their primary sequence and mechanism, drug transporters can be divided into the ATP-Binding Cassette (ABC), Solute-Linked Carrier (SLC), and Solute Carrier Organic anion (SLCO) superfamilies [6]. Understanding these transfer mechanisms is particularly crucial for drug development professionals seeking to optimize drug delivery and overcome challenges such as multi-drug resistance [7].

The transport of a drug across a membrane depends on multiple factors, including the physicochemical properties of the drug (molecular size, concentration gradient, pKa) and properties of the membrane (thickness, surface area, permeability) [8]. This article explores the primary, secondary, and complex pathways of drug transfer, providing detailed experimental protocols and analytical frameworks within the context of transfer persistence considerations for activity level research.

Fundamental Mechanisms of Drug Transfer

Passive Transfer Pathways

Passive diffusion represents the most fundamental drug transfer mechanism where molecules move across membranes based on concentration gradients without protein assistance [6]. In passive transcellular transport, lipophilic drugs with molecular weights typically under 400-500 Da diffuse directly through the lipid bilayer of cells [5]. This pathway is particularly relevant for abusive drugs such as alcohol, caffeine, and heroin, though uptake by peripheral tissues can sometimes reduce brain delivery [5]. Passive paracellular transport occurs when hydrophilic molecules pass through aqueous pores or intercellular spaces, though tight interendothelial junctions in systems like the blood-brain barrier significantly restrict this movement for peptides, polysaccharides, and proteins [5].

The rate of passive diffusion is governed by Fick's First Law of Diffusion, where the diffusive flux (J) is proportional to the concentration gradient (dφ) and inversely proportional to the membrane thickness (dx) [8]. Lipid solubility remains the most critical factor for passive diffusion, as the body is predominantly composed of cells enveloped in lipid membranes [8].

Active Transfer Pathways

Primary active transport requires direct ATP hydrolysis to move molecules unidirectionally against concentration gradients [6] [5]. ATP-binding cassette (ABC) transporters represent the primary superfamily in this category, including clinically significant transporters such as P-glycoprotein (P-gp), multidrug resistance-associated protein (MRP), and breast cancer resistance protein (BCRP) [5] [7]. These transporters possess nucleotide-binding domains (NBDs) that hydrolyze ATP to provide energy for substrate translocation [6].

Secondary active transport utilizes electrochemical or ion gradients established by primary active transporters rather than directly hydrolyzing ATP [6] [5]. Solute carrier (SLC) transporters belong to this category and can function as antiporters (moving molecules in opposite directions), symporters (moving molecules in the same direction), or uniporters (moving a single species) [6]. The organic anion transporting polypeptide (OATP) family, now classified under the SLCO superfamily, represents an important group of secondary transporters, though their exact mechanisms remain less understood [6].

Table 1: Classification of Major Drug Transporter Superfamilies

Superfamily	Energy Source	Transport Type	Key Examples	Primary Localization
ABC Transporters	ATP hydrolysis	Primary active uniport	P-gp (ABCB1), BCRP (ABCG2), MRP (ABCC1)	Intestine, liver, kidney, BBB
SLC Transporters	Ion/concentration gradients	Secondary active symport/antiport	OCTs, OATs, PEPTs	Kidney, liver, intestine
SLCO Transporters	Not well characterized	Possibly electroneutral exchange or pH-driven	OATP1A2, OATP1B1, OATP1B3	Liver, brain capillaries

Quantitative Analysis of Transfer Mechanisms

Table 2: Comparative Analysis of Drug Transfer Mechanisms

Transfer Mechanism	Energy Requirement	Directionality	Saturability	Substrate Specificity	Representative Substrates
Passive Transcellular	None	Down gradient	No	Low	Alcohol, caffeine, heroin
Passive Paracellular	None	Down gradient	No	Low	Small hydrophilic molecules
Primary Active Transport	ATP hydrolysis	Against gradient	Yes	Moderate-high	Chemotherapeutics, HIV protease inhibitors
Secondary Active Transport	Ion gradient	Against gradient	Yes	Moderate-high	Neurotransmitters, statins
Facilitated Diffusion	None	Down gradient	Yes	High	Glucose, nucleosides

Quantitative understanding of drug transfer requires consideration of multiple physicochemical parameters. The diffusion coefficient (D) is influenced by solution temperature, viscosity, and molecular size [8]. The lipid-water partition coefficient, determined by the interplay between drug pKa and solution pH, significantly impacts transfer rates for passive mechanisms [8]. For carrier-mediated transport, Michaelis-Menten kinetics describe the saturation behavior, where transport velocity reaches a maximum (V~max~) at high substrate concentrations [6].

Recent advances in structural biology through X-ray crystallography and cryo-electron microscopy have revealed detailed mechanisms of these transport processes [6]. The "alternating access" model proposes that transporters undergo conformational changes that expose substrate binding sites to alternating sides of the membrane [6]. Variations of this model include the "rocker-switch" mechanism for SLC transporters and the "ATP switch" model for ABC transporters [6].

Experimental Protocols for Studying Transfer Mechanisms

Protocol 1: In Vitro Membrane Permeability Assay

Objective: To quantify passive transcellular and paracellular drug transfer across cellular barriers.

Materials:

Caco-2 cell monolayers (21-day differentiated)
Transwell permeable supports (0.4 μm pore size, 1.12 cm² surface area)
Modified Krebs-Ringer bicarbonate buffer (pH 7.4)
Test compound (10 mM stock solution in DMSO)
LC-MS/MS system for analytical quantification
Apparent Permeability (P~app~) = (dQ/dt) × (1/(A × C~0~)) Where dQ/dt is the transport rate, A is the membrane surface area, and C~0~ is the initial donor concentration

Procedure:

Culture Caco-2 cells on Transwell inserts for 21 days until full differentiation, monitoring transepithelial electrical resistance (TEER) values >300 Ω·cm².
Pre-incubate monolayers with transport buffer for 30 minutes at 37°C.
Add test compound to donor compartment (apical for A→B transport, basolateral for B→A transport).
Collect samples from receiver compartment at 15, 30, 45, 60, and 90 minutes, replacing with fresh buffer.
Analyze samples using LC-MS/MS with appropriate calibration standards.
Calculate P~app~ values for both directions and determine efflux ratio (P~app~ B→A / P~app~ A→B).

Interpretation: P~app~ < 1×10⁻⁶ cm/s indicates low permeability; P~app~ > 10×10⁻⁶ cm/s indicates high permeability. Efflux ratio >2 suggests active efflux involvement.

Protocol 2: ATP-Dependent Transport Assay

Objective: To characterize primary active transport mediated by ABC transporters.

Materials:

Membrane vesicles expressing recombinant human transporters (e.g., P-gp, BCRP, MRP2)
Transport assay buffer (50 mM MOPS-Tris, 70 mM KCl, 7.5 mM MgCl₂, pH 7.0)
5 mM ATP or AMP regeneration system
Test compound with radiolabeled or fluorescent tag
Rapid filtration apparatus with 0.45 μm nitrocellulose filters
Scintillation counter or fluorescence plate reader

Procedure:

Thaw membrane vesicles on ice and homogenize gently with a Dounce homogenizer.
Prepare reaction mixtures containing membrane vesicles (50 μg protein), test compound (10 μM), and assay buffer with or without 5 mM ATP.
Incubate at 37°C with shaking at 300 rpm for predetermined time points (1, 2, 5, 10 minutes).
Terminate reactions by adding 2 mL ice-cold stop buffer and immediately filter through nitrocellulose membranes.
Wash filters three times with 5 mL ice-cold wash buffer.
Quantify accumulated compound using appropriate detection methods.
Calculate ATP-dependent transport as the difference between ATP-containing and ATP-free conditions.

Interpretation: Significant ATP-dependent accumulation indicates substrate status for the specific transporter. Kinetic parameters (K~m~, V~max~) can be determined through concentration-dependent studies.

Protocol 3: Inhibition Studies for DDI Assessment

Objective: To evaluate drug-drug interactions (DDIs) at the transport level.

Materials:

Transporter-expressing cell systems (e.g., MDCKII, HEK293)
Known transporter substrates (e.g., digoxin for P-gp, metformin for OCTs)
Investigational inhibitor compounds
Transport buffers at physiologically relevant pH values

Procedure:

Culture transporter-expressing cells to appropriate density on multi-well plates.
Pre-incubate cells with investigational inhibitor at multiple concentrations (0.1-100 μM) for 30 minutes.
Add known transporter substrate at concentration approximating K~m~ value.
Incubate for linear uptake period (typically 2-10 minutes).
Terminate uptake with ice-cold buffer and quantify substrate accumulation.
Calculate percentage inhibition relative to control (without inhibitor).
Determine IC~50~ values using non-linear regression analysis.

Interpretation: IC~50~ values < predicted maximum therapeutic concentration suggest clinically relevant DDI potential.

Visualization of Transfer Pathways

Diagram 1: Drug Transfer Pathway Classification. This diagram illustrates the major routes for drug transfer across biological membranes, categorized into passive and active mechanisms with their characteristic features.

Diagram 2: Active Transport Mechanism Workflow. This diagram compares the sequential steps in ABC (primary) and SLC (secondary) active transport mechanisms, highlighting their distinct energy-coupling strategies.

Research Reagent Solutions

Table 3: Essential Research Reagents for Transfer Mechanism Studies

Reagent/Cell System	Function/Application	Key Characteristics	Example Suppliers
Caco-2 Cell Line	Model for intestinal permeability	Differentiates into enterocyte-like monolayers	ATCC, ECACC
MDCKII Transfected Cells	Transporter-specific flux studies	Stable expression of human transporters	Solvo Biotechnology
Membrane Vesicles	ATP-dependent transport assays	Inside-out orientation for ABC studies	GenoMembrane, Corning
Transwell Permeable Supports	Polarized cell culture for transport	Porous membrane inserts (0.4-3.0 μm)	Corning, Greiner Bio-One
Fluorescent Substrates (e.g., Calcein-AM)	P-gp activity screening	Accumulation inversely proportional to activity	Thermo Fisher, Sigma-Aldrich
Radiolabeled Compounds (³H, ¹⁴C)	Quantitative transport studies	High sensitivity detection	American Radiolabeled Chemicals
Specific Inhibitors (e.g., Verapamil, Ko143)	Transporter phenotyping	Selective for P-gp (Verapamil) and BCRP (Ko143)	Tocris, Sigma-Aldrich

Advanced Applications in Drug Development

The continuous occurrence of multi-drug resistance in the clinic has heightened focus on transporter-based drug delivery strategies [7]. Nano-formulations represent promising approaches to modulate transporter interactions, either by evading efflux transporters or by targeting specific uptake transporters [7]. These advanced delivery systems can enhance targeted uptake while preventing drug efflux, particularly for chemotherapeutic agents [7].

Tissue-specific transporter expression profiles enable targeted drug delivery approaches. For instance, OATP1B1 and OATP1B3 expressed exclusively in the liver facilitate hepatic targeting, while OATP1A2 in cerebral capillaries mediates brain transport of opioid peptides [7]. Understanding these distribution patterns allows researchers to design drugs with improved tissue selectivity and reduced off-target effects.

Current research focuses on developing nano-agents that target specific transporters to overcome biological barriers. These include lipid-based nanoparticles to bypass intestinal P-gp efflux, polymeric nanoparticles functionalized with transporter substrates to enhance brain delivery, and inorganic nanoparticles designed for renal targeting through OAT transporters [7]. The integration of transporter science with nanomedicine represents a frontier in precision drug delivery.

Within the framework of transfer persistence considerations activity level research, understanding the longevity and degradation of biological evidence is paramount for accurate forensic reconstruction. Persistence dynamics refer to the complex interplay of factors that influence how long trace evidence, such as DNA, remains detectable and intact on a surface or substrate after a transfer event. For researchers and drug development professionals, quantifying these dynamics is critical for assessing the temporal relevance of evidence recovered from clinical trial materials, drug manufacturing equipment, or contamination incidents. This document provides detailed application notes and experimental protocols to systematically evaluate evidence persistence, focusing on the degradation kinetics of nucleic acids under controlled conditions.

Quantitative Persistence Factors

The longevity of biological evidence is governed by a multitude of chemical, physical, and environmental variables. The following tables summarize key quantitative data on factors affecting nucleic acid persistence, essential for interpreting activity-level propositions in forensic casework.

Table 1: DNA Degradation Timeline Based on Sample Type and Conditions

Sample Type / Condition	Timeframe of DNA Persistence	Key Factors Influencing Degradation	STR Profiling Success
Insects' Feces	Up to 48 hours post-death [9]	Activity of bacterial enzymes [9]	Complete STR profile possible [9]
Cotton Fabric on Floor	Not specified	Environmental exposure, background DNA [9]	79% enabled identification [9]
Home Floor Surfaces	Not specified	Surface material, foot traffic [9]	92% allowed DNA profiling [9]
Historical Samples	Decades to centuries [9]	Time, temperature, environmental conditions [9]	Possible with mini-STR/SNP analysis [9]

Table 2: Impact of Environmental and Chemical Factors on Genetic Material

Factor	Impact on DNA/RNA	Mechanism of Action
Temperature	Can preserve or rapidly degrade [9]	Influences enzyme and microbial activity; high temps accelerate strand breakage [9]
Hydration & pH	RNA: hydrolyzed in acidic & alkaline conditions; DNA: hydrolyzed in acidic conditions [9]	Chemical breakdown of phosphodiester bonds in the nucleic acid backbone [9]
UV Radiation	Physical strand breakage, chemical structure changes [9]	Induces DNA lesions (e.g., thymine dimers) and generates reactive oxygen species [9]
Microbial Activity	Generation of small oligonucleotides (~80–200 bp) [9]	Bacterial and fungal enzymes digest nucleic acids for nutrients [9]

Table 3: DNA Recovery Variation Across Laboratories (ReAct Project Data) Data from the ReAct project, which involved 23 laboratories, highlights the considerable variation in DNA recovery rates, underscoring the need for standardized protocols when assessing persistence for activity-level interpretations [10].

Experiment Context	Range of Median DNA Recoveries	Implication for Activity-Level Assessment
Direct Transfer (Screwdriver)	200 pg – 5 ng [10]	Likelihood ratios are highly dependent on laboratory-specific recovery rates [10]
Indirect Transfer	200 pg – 5 ng [10]	Single or major contributor profiles strongly support the proposition of use [10]

Experimental Protocols for Assessing Persistence

Protocol: Controlled Persistence Study on Simulated Evidence Surfaces

Objective: To quantify the persistence and degradation kinetics of trace DNA deposited on surfaces relevant to forensic and pharmaceutical contexts under varying environmental conditions.

Materials:

Donor samples (e.g., saliva, touch DNA from skin)
Substrates (e.g., glass, plastic, stainless steel, cotton cloth)
Environmental chambers (for temperature and humidity control)
UV light chamber
Research Reagent Solutions (See Section 5)

Methodology:

Sample Deposition: Apply a standardized volume (e.g., 10 µL) of a quantified DNA solution or use a controlled touch deposition protocol by donors onto sterile substrates. Allow to air-dry in a laminar flow hood.
Environmental Exposure: Place the inoculated substrates into environmental chambers with pre-set conditions. Key experimental arms include:
- Temperature Series: e.g., -20°C, 4°C, 22°C, 37°C, 60°C.
- Humidity Series: e.g., 20%, 50%, 80% relative humidity.
- Light Exposure: Expose subsets to controlled UV-A/UV-B radiation for set durations.
- Time Series: Retrieve replicates at set intervals (e.g., 0, 1, 3, 7, 14, 30 days).
Sample Recovery: Using the double-swabbing technique [11]:
- Moisten a sterile cotton or nylon swab with purified water.
- Roll the wet swab thoroughly over the entire deposition area with moderate pressure.
- Follow with a dry swab, rolling over the same area to collect remaining moisture.
- Air-dry both swabs and store at -20°C until extraction.
DNA Extraction and Quantification: Extract DNA using a validated kit (e.g., Qiagen MinElute). Quantify the recovered DNA using a qPCR-based kit (e.g., Quantifiler Trio) to assess both quantity and degradation index (DI).
STR Profiling and Analysis: Amplify quantified DNA extracts using a commercial STR kit (e.g., GlobalFiler). Analyze capillary electrophoresis data to determine profile completeness (percentage of full loci detected) and any stochastic effects (e.g., allele drop-out).

Protocol: Assessing Persistence on Human Skin (Mock Assault Scenario)

Objective: To evaluate the transfer, persistence, and recovery of non-self DNA from skin surfaces following physical contact.

Materials:

Volunteer "donors" and "recipients"
Sterile gloves, nitrile sleeves
Research Reagent Solutions (See Section 5)

Methodology:

Controlled Transfer: A donor firmly grips the forearm of a recipient for a standardized duration (e.g., 10 seconds).
Persistence Time Course: Sample the contact area on the recipient's forearm at defined post-contact intervals (e.g., immediately, 15, 30, 60, 120 minutes).
Optimal Recovery: Employ the double-swabbing technique, which has been shown to recover approximately 13.7% more offender DNA than single-swab methods on skin [11].
- Use a wet swab followed by a dry swab, both with a combination of rolling and rubbing motions.
Genetic Analysis: Extract DNA and quantify. Use qPCR to determine the total human DNA concentration and the proportion of male DNA (if a male donor/female recipient model is used). Perform STR profiling and use probabilistic genotyping software to deconvolute mixed profiles and estimate the likelihood ratio for the donor's contribution.

Signaling Pathways and Workflow Visualizations

DNA Degradation Pathways

DNA Degradation Pathways Map

Evidence Persistence Workflow

Evidence Persistence Assessment Workflow

The Scientist's Toolkit: Research Reagent Solutions

Table 4: Essential Reagents and Kits for Persistence Studies

Reagent / Kit Name	Function / Application	Key Features
Quantifiler Trio DNA Quantification Kit	Quantitative PCR (qPCR) for human DNA quantification and degradation assessment	Detects human DNA as low as 32 pg/μL; provides a Degradation Index (DI) to gauge sample quality [9]
GlobalFiler PCR Amplification Kit	Short Tandem Repeat (STR) multiplex profiling	Analyzes multiple loci simultaneously; optimized for challenging, low-template, or degraded DNA samples [9]
IbpA-msfGFP Biosensor	Detection and visualization of protein aggregation in bacterial persistence studies	A fluorescent fusion protein that localizes to protein aggregates without triggering aggregation itself; marks early-stage aggregates [12]
SYTOX Green Stain	Cell viability staining in dormancy and VBNC state studies	Membrane-impermeable dye that stains only dead cells with compromised membranes; used in flow cytometry [12]
Puritan Cap-Shure Cotton Swabs	Biological sample collection from surfaces and skin	Commonly used for the double-swabbing technique; effective for non-self DNA recovery in skin contact studies [11]
Sodium Hypochlorite / Trigene / Virkon	Work environment decontamination	Effective removal of cell-free DNA to prevent contamination of tested samples and ensure analysis accuracy [9]

Within the framework of activity-level research in forensic genetics, evaluating how deoxyribonucleic acid (DNA) is transferred, persists, and is recovered from physical evidence is paramount for interpreting the significance of a DNA profile. A principal challenge lies in distinguishing whether a recovered DNA profile originates from direct handling of an item or from indirect transfer via an intermediary surface or environment. This distinction is especially critical in the investigation of illicit drug offenses, where the presence of a suspect's DNA on drug packaging does not, in itself, elucidate the specific activities that led to its deposition.

This application note addresses the critical recovery consideration of optimizing sampling protocols to maximize data integrity for subsequent activity-level interpretation. Data integrity, in this context, refers to the accuracy and consistency of the genetic data generated from forensic samples, which can be compromised by inefficient recovery techniques, background DNA contamination, or suboptimal sampling location selection. We present experimental data and detailed protocols from simulated drug distribution scenarios, focusing on how strategic sampling from specific areas of drug packages can enhance the reliability of downstream genetic analysis and support more robust evaluative opinions on the activity-level propositions.

Quantitative Data on DNA Transfer and Recovery

The following tables summarize quantitative findings from simulation studies, which are crucial for formulating sampling strategies that optimize data integrity.

Table 1: DNA Transfer Frequencies in Different Scenarios [13]

Experimental Scenario	Sample Location on Mock Drug Package	DNA Transfer Frequency	Key Interpretation
Direct Handling	Handle of Carrier Bag	High	Primary location for DNA deposition from the person packing/carrying the package.
	Exterior Body of Carrier Bag	Moderate	Receives DNA from handling, but typically less than primary grip areas.
	Interior Surfaces	Low	Protected from direct contact; DNA recovery suggests alternative transfer mechanisms.
Indirect Transfer (Vehicle/Household)	Handle of Carrier Bag	Low to Moderate	Can acquire DNA from pre-existing environmental background on contaminated surfaces.
	Exterior Body of Carrier Bag	Low to Moderate	Similar recovery to handles in indirect scenarios, suggesting passive, non-specific transfer.
	Interior Surfaces	Minimal	Very low probability of DNA transfer via indirect means, making it a high-integrity sampling target.

Table 2: DNA Profile Complexity on Different Surfaces of Ziplock Bags [14]

Surface Sampled	Typical Number of Contributors	Profile Complexity	Suitability for Activity Inference
Outside of Ziplock Bag	Multiple	Complex Mixtures	High complexity can complicate attribution; may reflect recent handlers or transporters.
Inside of Ziplock Bag	1-2	Simple Mixtures / Single Source	Higher integrity; more likely to reflect the individual(s) involved in the packing stage.
Exterior of Capsules	1-2	Simple Mixtures / Single Source	High integrity; can identify the individual(s) who handled the capsules during manufacture.

Experimental Protocols for Key Scenarios

Protocol 1: Simulating Direct Handling and Packing

This protocol models the activity of an individual directly packing illicit drugs into packaging materials [13].

Objective: To assess the amount, distribution, and profile quality of DNA directly deposited during the packing process.
Materials: Resealable plastic bags (e.g., ~16cm x 9cm), larger plastic carrier bags (e.g., ~52cm x 31cm), inert mock drug substance (e.g., 200g of salt), sterile swabs, and DNA-free gloves.
Procedure:
- Participant Preparation: Participants wash hands and resume normal activities for at least one hour prior to the experiment to allow for the natural accumulation of skin cells and DNA.
- Packing Simulation: Without wearing personal protective equipment (PPE), the participant portions the mock drug substance into the resealable bag, places it inside the carrier bag, and then carries the bag by the handle for a defined period (e.g., 5 minutes).
- Sample Collection: Using a double-swabbing technique (sterile wet swab followed by a dry swab), collect DNA from four distinct areas:
  - Handle of the carrier bag.
  - Exterior body of the carrier bag.
  - Interior of the carrier bag.
  - Exterior of the resealable bag.
Data Integrity Consideration: Sampling multiple, functionally distinct areas provides a spatial distribution map of DNA, which is critical for inferring the mode of contact (e.g., forceful grip vs. light brush).

Protocol 2: Assessing Indirect Transfer in Vehicular Environments

This protocol models the passive transfer of DNA onto drug packages stored in a vehicle linked to a suspect [13].

Objective: To evaluate the potential for DNA-free drug packages to acquire DNA profiles from background environmental DNA during storage.
Materials: DNA-free mock drug packages (prepared in a clean state with full PPE), participant vehicles used for >6 months.
Procedure:
- Package Preparation: Prepare DNA-free mock drug packages in a controlled environment while wearing full PPE (lab coat, gloves, face mask, hair cap) to prevent pre-contamination.
- Placement: Place one DNA-free package at each of several predefined locations within a vehicle (e.g., rear passenger seat, footwell, glove compartment, boot). The bag handle should remain untied.
- Storage Duration: Store packages for both short (e.g., up to 3 hours) and long (e.g., 2 days) durations to study the effect of time. Participants resume normal vehicle usage.
- Sample Collection:
  - Package Samples: Collect DNA from the handle, exterior body, and interior (carrier + resealable bag) of each package.
  - Background Samples: At the end of the storage period, collect a control sample from a 10cm x 10cm area adjacent to where the package was stored.
Data Integrity Consideration: The inclusion of background environmental samples and the use of enclosed vs. open storage locations allows for a refined assessment of whether a recovered DNA profile is more likely from direct handling or from incidental environmental transfer.

Protocol 3: Multi-Person Drug Distribution Chain

This protocol models a sequential chain of handling involving different individuals in the roles of maker, packer, and transporter [14].

Objective: To trace DNA transfer pathways through a multi-stage drug distribution process and identify which surfaces best retain DNA from specific actors.
Materials: Capsules, ziplock bags (ZLB), storage containers.
Procedure:
- Two-Person Chain: Participant A makes and packs capsules into ZLBs. Participant C then carries the bags for four days.
- Three-Person Chain: Participant A makes the capsules. Participant B places the capsules into ZLBs. Participant C carries the bags.
- Sample Collection: Sample the exterior of capsules, the inside surface of the ZLB, the inner semi-protected portion of the ZLB opening, and the outside surface of the ZLB.
Data Integrity Consideration: This protocol highlights that the interior of bags and the exterior of capsules, which are less exposed to subsequent handling, often yield simpler, more interpretable DNA profiles that are more attributable to the initial packers and makers, thus preserving the integrity of the activity-level information.

Workflow Visualization for Evidence Interpretation

The following diagram outlines the logical workflow for optimizing sampling and interpreting DNA evidence based on activity-level propositions.

Diagram 1: Activity-Level Evaluation Workflow for DNA Evidence

The Scientist's Toolkit: Essential Research Reagent Solutions

Table 3: Key Materials and Reagents for DNA Transfer and Recovery Studies [13]

Item	Function / Application
Resealable Plastic Bags (Ziplock)	Serve as the primary inner container for mock illicit substances, simulating common drug packaging.
Plastic Carrier Bags	Act as the outer packaging; their handles and large surfaces are key areas for DNA deposition and recovery.
Inert Mock Substances (e.g., Salt)	A safe and DNA-free substitute for illicit powders, allowing for realistic packing simulations without hazard.
Sterile Cotton Swabs	The primary tool for sample collection; the "double swabbing" technique increases cellular material recovery.
DNA IQ Casework Kits	Optimized for DNA extraction from challenging forensic samples, improving yield and inhibitor removal.
Personal Protective Equipment (PPE)	Critical for preparing DNA-free control packages and preventing contamination from researchers during experiments.
STRmix / Verifiler Plus	Probabilistic genotyping software and PCR amplification kit used for deconvoluting complex DNA mixtures from packaging.

Transfer, Persistence, Prevalence, and Recovery (TPPR) parameters form the cornerstone of forensic evidence interpretation, particularly when addressing activity-level propositions. The evaluation of findings must extend beyond mere identification to understanding how evidence arrived at a location, how long it remained, what background materials were already present, and how effectively it was collected. This framework is indispensable for reconstructing events and providing meaningful context to DNA and fiber evidence in criminal investigations. This article details practical applications and standardized protocols for integrating TPPR considerations into forensic casework, providing researchers and forensic professionals with the tools to bridge the gap between theoretical models and real-world evidentiary challenges.

TPPR Fundamentals and Quantitative Data

The core TPPR parameters are interdependent, each playing a vital role in the transfer and detection of trace evidence. Transfer refers to the movement of material from a source to a recipient surface during contact. Persistence describes how long the transferred material remains on the surface post-contact, considering factors like environmental conditions and surface properties. Prevalence (or Background) concerns the presence and quantity of similar material on the surface prior to the activity under investigation. Finally, Recovery encompasses the efficiency of the methods used to collect and analyze the evidence from the surface [15] [11].

The following tables consolidate quantitative data from empirical studies on DNA TPPR, providing a reference for evidence assessment.

Table 1: DNA Recovery Efficiencies from Skin Using Different Collection Methods

Sampling Technique	Body Parts Commonly Sampled	Key Finding	Reference
Double-Swabbing (wet then dry)	External skin (arms, neck, face), internal cavities	Recovered ~13.7% more offender DNA than other methods; considered most effective.	[11]
Single Swabbing (various movements)	External skin	Efficacy varied with movement (rubbing vs. rolling) and swab moisture.	[11]
Tape Lifting	External skin	Applicable but less commonly used for body surfaces compared to swabbing.	[11]

Table 2: Factors Influencing DNA-TPPR on Human Skin

TPPR Parameter	Influencing Factor	Impact on Evidence
Transfer	Shedder status of the individual	High shedders deposit more DNA [11].
	Nature and pressure of contact	Higher pressure and longer duration increase transfer [11].
	Body area contacted	Variations in skin texture, moisture, and oil affect pickup and deposition.
Persistence	Time since contact	DNA quantity generally decreases over time [11].
	Post-contact activities (rubbing, washing)	Can significantly accelerate the loss of DNA [11].
Prevalence	Background DNA on recipient skin	The presence of non-self DNA prior to contact can complicate analysis [11].
	Self-DNA to non-self-DNA ratio	Impacts the detection of a minor contributor's profile [11].
Recovery	Swab type (e.g., cotton, nylon FLOQ)	Different materials have varying collection efficiencies [11].
	Swabbing technique (rolling, rubbing)	Standardized protocols are critical for consistent recovery [11].

Experimental Protocols for TPPR Assessment

Protocol 1: Standardized Double-Swab Technique for DNA Recovery from Skin

This protocol is optimized for recovering touch DNA from skin surfaces following a mock assault scenario or other contact events [11].

Materials: Sterile water or recommended buffer (e.g., 0.01% PBS), Two sterile cotton or viscose swabs (e.g., Cap-Shure Puritan or nylon FLOQ swabs), Paper swab boxes, Evidence labels and chain-of-custody forms.
Procedure:
- Moisten the tip of the first swab with the sterile solution. Avoid excessive saturation.
- Roll the wet swab thoroughly over the entire target skin area while applying gentle, consistent pressure. Ensure the entire tip surface is used.
- Air-dry the first swab for approximately 60 minutes at room temperature.
- Using a second, dry swab, re-roll over the same skin area to collect any residual moisture and cellular material.
- Air-dry the second swab completely.
- Place both swabs from the same sample into a single swab box. Secure and label the evidence with case details, sample location, date, and collector's initials.
- Store samples at -20°C prior to DNA extraction and profiling.

Protocol 2: Experimental Workflow for Assessing Fibre Persistence on Fabrics

This methodology outlines a controlled approach to study the persistence of transferred fibres on clothing substrates, critical for evaluating activity-level propositions [15].

Materials: Donor garments (source of fibres), Recipient garments (substrate), Force-standardized rubbing apparatus or mannequin, Tape lifts (e.g., SceneSafe FAST minitape), Sterile tweezers and evidence bags, Microscope with fluorescence capability.
Procedure:
- Pre-Conditioning: Clean the recipient garments using a standardized method to remove extraneous fibres. Document the background fibre population.
- Fibre Transfer: Mount donor and recipient fabrics on the apparatus. Execute a controlled contact with predefined pressure, duration, and direction.
- Persistence Timeline: Following transfer, subject the recipient garment to controlled movement in a simulated environment (e.g., on a mannequin in a wind tunnel).
- Sampling: At predetermined time intervals (e.g., 0, 1, 3, 6, 12 hours), collect fibre samples from standardized locations on the recipient garment using tape-lifting methods.
- Analysis: Analyze tape lifts under a microscope. Count and characterize the transferred fibres (type, color) against the background. Record the decay rate of the fibre population over time.

Visualization of TPPR Concepts and Workflows

TPPR Activity Evaluation Logic

Forensic DNA Recovery Workflow

The Scientist's Toolkit: Essential Research Reagents and Materials

Table 3: Key Materials for TPPR Research and Casework

Item	Function/Application
Puritan Cap-Shure Cotton Swabs	Standard swab for single and double-swab techniques for DNA recovery from skin and surfaces [11].
Nylon FLOQ Swabs	Alternative swab type with specialized structure designed to improve cell collection and DNA release.
SceneSafe FAST Minitape	Adhesive tape used for the non-destructive lifting of trace evidence like fibres and hair from clothing [11].
Sterile Water / PBS Buffer	Liquid medium used to moisten swabs, facilitating the absorption of cellular material during collection.
Force-Standardized Contact Apparatus	Equipment to apply uniform pressure and duration during simulated contact events for reproducible transfer studies.
Fluorescence Microscope	Essential for the visualization, counting, and characterization of fibres and other microscopic trace materials [15].
Multiplex PCR Kits	High-sensitivity kits enabling the generation of DNA profiles from low-template and trace DNA samples [11].

From Theory to Practice: Methodologies for Modeling and Data Generation

Designing Universal Experimental Protocols for TPRR Studies

Within the framework of a broader thesis on transfer persistence considerations for activity-level research, the development of robust, universal experimental protocols is paramount. Transfer, Prevalence, Persistence, and Recovery (TPPR) studies provide the foundational data required to evaluate forensic findings given activity-level propositions, moving beyond mere source attribution to reconstruct specific events [11]. The heightened sensitivity of modern DNA profiling techniques, capable of generating profiles from minute quantities of biological material, necessitates a rigorous understanding of how DNA is transferred to and from various surfaces, its background prevalence, its persistence over time, and the efficiency of its recovery [11]. This document outlines standardized application notes and protocols designed to generate reliable, reproducible, and forensically relevant TPPR data, with a particular focus on skin surfaces and touched objects.

Core Experimental Design Principles

The design of TPPR experiments must control for numerous variables to ensure data validity and inter-laboratory comparability. The ReAct project, a large-scale initiative by the European Network of Forensic Science Institutes (ENFSI), highlighted considerable variation in DNA quantity recovery rates between different laboratories, which directly impacts the likelihood ratios derived for activity-level propositions [10]. The following principles are foundational:

Proposition Formulation: Experiments must be designed around clear, casework-relevant activity-level propositions. For example, a direct transfer scenario ("The defendant used the tool") versus an indirect transfer scenario ("An unknown person used the defendant's stolen tool") [10].
Standardization: Protocols for sample collection, processing, and analysis must be meticulously standardized to minimize inter-laboratory variation, a key finding of the ReAct project [10].
Context Relevance: The selection of surfaces (e.g., skin types, tool handles), contact types (e.g., grip, touch), and environmental conditions should reflect realistic forensic scenarios [11].

Experimental Workflow

The generic workflow for a TPPR study involves several sequential stages, from design to data interpretation. The diagram below visualizes this process.

Detailed Methodologies for Key TPPR Experiments

Protocol 1: Direct and Indirect DNA Transfer

This protocol simulates transfer events to address questions about whether a person handled an object directly or if DNA was transferred indirectly.

Objective: To model direct and indirect DNA transfer scenarios and evaluate the resulting DNA profiles in the context of activity-level propositions.
Background: The ReAct project successfully employed such designs, simulating a robbery where a screwdriver was used to force a door. In the direct transfer experiment, the defendant used the screwdriver. In the indirect transfer experiment, an unknown person used the defendant's stolen screwdriver after the defendant had handled it, transferring DNA indirectly [10].

Experimental Procedure:

Surface Preparation: Use pre-cleaned objects relevant to casework (e.g., screwdriver handles). For direct transfer, the object should be pristine. For indirect transfer, the object should be pre-loaded with DNA from a "first individual" by having them hold it for a standardized duration (e.g., 2 minutes).
Contact Event:
- Direct Transfer: A "second individual" (the mock offender) directly uses the object (e.g., grips the screwdriver) to perform a task.
- Indirect Transfer: An "unknown person" uses the object that was previously handled by the "first individual," without the first individual directly contacting the final surface.
Sampling: Sample the object using the double-swabbing technique [11].
Controls: Include negative controls (swabs from unused, cleaned surfaces) and positive controls (swabs from surfaces with known DNA deposits).

Protocol 2: DNA Persistence on Skin Following Contact

This protocol assesses how long foreign DNA persists on skin surfaces under different environmental conditions.

Objective: To quantify the degradation of transferred DNA on skin over time and under varying environmental conditions.
Background: The persistence of DNA is a critical factor between the alleged activity and the sampling of a victim or suspect. Factors such as body area, sweat, and friction can affect persistence rates [11].

Experimental Procedure:

Standardized Contact: A "donor" makes a controlled, bare-handed contact (e.g., a 30-second grip) on the forearm of a "recipient."
Persistence Phase: Recipients are exposed to different environmental conditions (e.g., room temperature vs. elevated heat and humidity) for varying time intervals (e.g., 0, 15, 30, 60, 120 minutes).
Sampling: At each time interval, sample the contact area on the recipient's skin using the double-swabbing technique [11].
Data Collection: Record the total human DNA quantity and the percentage of the donor's DNA profile obtained from each sample.

Protocol 3: Prevalence of Background DNA on Skin

This protocol maps the baseline level of "self" and "non-self" DNA on various body areas prior to experimental contact.

Objective: To characterize the background DNA profile on different skin surfaces of individuals under different activities of daily living.
Background: The likelihood of detecting DNA transferred during a specific contact is heavily influenced by the pre-existing background DNA on that surface [11].

Experimental Procedure:

Cohort Selection: Recruit participants from different occupational and lifestyle backgrounds.
Sampling Regimen: Swab predefined body areas (e.g., neck, hands, forearms) of participants at different times of the day, using a double-swab technique.
Analysis: Generate DNA profiles from each sample and interpret the mixtures to determine the relative contributions of "self" (the individual) and "non-self" (unknown individuals) DNA.

Data Interpretation and Statistical Analysis

The data generated from TPPR experiments are interpreted using Bayesian frameworks to compute a Likelihood Ratio (LR). The LR assesses the strength of evidence given two competing activity-level propositions posed by the prosecution and defense [10].

Bayesian Networks (BNs): The ReAct project utilized BNs implemented via an R Shiny platform (Shiny_React()) to calculate LRs. These networks incorporate probabilities of DNA transfer, persistence, and recovery based on experimental data [10].
Qualitative Categorization: Profiles can also be categorized qualitatively as "absent," "single contributor," "major contributor," or "other" for comparison with BN outputs [10].

The following diagram illustrates the logical relationship between TPPR factors and the resulting Likelihood Ratio within a Bayesian network.

The Scientist's Toolkit: Research Reagent Solutions

The table below details essential materials and reagents for conducting the TPPR experiments described in this protocol.

Table 1: Key Research Reagent Solutions and Materials for TPPR Studies

Item	Function/Description	Application in TPPR
Cotton or Nylon FLOQ Swabs	The primary tool for sample collection from surfaces. The double-swabbing technique (wet swab followed by dry swab) has been shown to recover approximately 13.7% more DNA than single-swab methods [11].	Recovery
Sterile Deionized Water	The moistening agent for the first swab in the double-swabbing technique. It facilitates the release of cellular material from the surface without introducing inhibitors.	Recovery
DNA Extraction Kits	Commercial kits optimized for maximum DNA yield from complex and low-template samples, including those collected from skin.	Recovery
Quantitative PCR (qPCR) Assays	For determining the total human DNA concentration and the presence of PCR inhibitors in a sample. This quantitative data is crucial for input into Bayesian Networks.	Analysis
Multiplex PCR STR Kits	High-sensitivity amplification kits for generating DNA profiles from low-level DNA and complex mixtures.	Analysis
Bayesian Network Software (e.g., R Shiny)	Software platform for implementing probabilistic models to calculate Likelihood Ratios given activity-level propositions, as demonstrated by the ReAct project's `Shiny_React()` application [10].	Data Interpretation

The following tables summarize typical quantitative findings from TPPR research, which should be used as a benchmark for data generated using these protocols.

Table 2: DNA Recovery from Different Sampling Methods in a Mock Assault Scenario (Skin-to-Skin Contact)

Sampling Method	Relative Recovery of Offender DNA (%)	Key Findings
Double-Swabbing (Cotton)	113.7%	Recovered significantly more offender DNA than other single-swab methods [11].
Single Swab (Wet, Rolling)	100% (Baseline)	Used as a comparison baseline in the study.
Nylon FLOQ Swab (Dry, Rubbing)	~90%	Variation highlights the impact of swab material and technique.
Tape Lifting	~85%	Less effective for skin surfaces compared to double-swabbing.

Table 3: Inter-Laboratory Variation in DNA Quantity Recovery (ReAct Project Data)

Parameter	Finding	Implication for Protocol Design
Range of Median DNA Recovery	200 pg to 5 ng	For the same experiment, different labs recovered quantities differing by an order of magnitude, emphasizing the need for standardized protocols [10].
Impact on Likelihood Ratios	The LRs were affected by the recovered DNA quantity.	Standardization is critical for generating reliable, comparable data for activity-level assessment.

Leveraging Model-Informed Drug Development (MIDD) for Pharmacokinetic Predictions

Model-Informed Drug Development (MIDD) is a quantitative framework that integrates computational modeling and simulation (M&S) based on preclinical and clinical data to inform drug development and regulatory decision-making [16]. According to the International Council for Harmonisation (ICH) M15 guidelines, MIDD is defined as "the strategic use of computational modeling and simulation methods that integrate nonclinical and clinical data, prior information, and knowledge to generate evidence" [16]. In the specific context of pharmacokinetics (PK), which describes the time course of drug absorption, distribution, metabolism, and excretion (ADME) in the body, MIDD approaches provide powerful tools for predicting drug exposure, optimizing dosing regimens, and supporting regulatory submissions [17] [16].

The application of MIDD for PK predictions transforms drug development by enabling researchers to extrapolate knowledge, simulate various scenarios, and quantify uncertainty. This approach is particularly valuable for addressing challenges in developing complex therapies, such as gene therapies and cell therapies, and for optimizing doses in special populations where clinical trials are difficult or unethical to conduct [17] [16]. The framework fosters a structured consultative process between drug sponsors and regulatory agencies, ensuring early alignment on modeling approaches to inform decision-making [16].

Regulatory Framework and MIDD Adoption

The regulatory evolution of MIDD began in the mid-1990s, culminating in recent structured programs and international harmonization. The U.S. Food and Drug Administration's (FDA) MIDD Paired Meeting Program, established under the Prescription Drug User Fee Act (PDUFA VII), provides a formal pathway for sponsors to discuss MIDD approaches with regulatory agencies during fiscal years 2023-2027 [18]. This program grants selected sponsors the opportunity for initial and follow-up meetings to discuss the application of MIDD approaches for specific drug development programs, focusing on priority areas such as dose selection, clinical trial simulation, and predictive safety evaluation [18].

Internationally, the ICH M15 guideline, released as a draft in 2024, aims to harmonize expectations between regulators and sponsors, support consistent regulatory decisions, and minimize errors in the acceptance of M&S to inform drug labels [16]. The guideline establishes a standardized credibility framework for computational models and provides a comprehensive taxonomy of MIDD terms, including Question of Interest (QOI), Context of Use (COU), and Model Risk assessment [16]. This harmonization is crucial as regulatory agencies worldwide, including the European Medicines Agency and Japan's Pharmaceuticals and Medical Devices Agency, increasingly incorporate MIDD approaches into their review processes [19].

Table 1: Key Regulatory Initiatives Supporting MIDD Adoption

Initiative	Lead Organization	Key Focus Areas	Status/Timeline
MIDD Paired Meeting Program	FDA CDER/CBER	Dose selection, clinical trial simulation, predictive safety evaluation [18]	Fiscal years 2023-2027 [18]
ICH M15 Guideline	International Council for Harmonisation	General principles for MIDD planning, model assessment, and documentation [16]	Draft released November 2024; implementation expected 2025/2026 [16] [19]
ISTAND Pilot Program	FDA	Qualification of novel drug development tools, including non-animal M&S methodologies [19]	Ongoing pilot [19]

MIDD Approaches for PK Prediction

Traditional and Emerging Modeling Techniques

MIDD encompasses a suite of quantitative modeling approaches that support PK prediction throughout the drug development continuum. Population PK (PopPK) modeling, typically using nonlinear mixed-effects modeling, remains a preeminent methodology for characterizing drug exposure and variability in patient populations [16]. Physiologically-Based Pharmacokinetic (PBPK) models simulate how a drug moves through the body using virtual populations, helping to predict responses in special populations like children, the elderly, or patients with organ impairment [19]. A frequent application of PBPK models in regulatory submissions is the prediction of drug-drug interactions [16].

Emerging approaches include Quantitative Systems Pharmacology (QSP) models, which integrate drug-specific data with detailed biological pathway information to simulate a drug's effects on a disease system over time, thereby predicting both efficacy and safety [19]. The adoption of QSP in regulatory submissions has more than doubled since 2021, supporting applications for both small molecules and biologics across multiple therapeutic areas [19].

Artificial Intelligence and Machine Learning in MIDD

Artificial Intelligence (AI) and Machine Learning (ML) present transformative opportunities for enhancing PK predictions within the MIDD paradigm. A 2025 comparative analysis of traditional NONMEM software versus AI-based models for population PK prediction demonstrated that AI/ML models often outperform traditional nonlinear mixed-effects models, with variations in performance depending on model type and data characteristics [20].

Specifically, neural Ordinary Differential Equation (ODE) models showed strong performance and explainability, particularly with large datasets [20]. Another novel framework uses chemical structure data to directly predict rat PK profiles, eliminating the need for animal experimentation in early discovery phases [21]. This ML framework uses molecular structure represented as SMILES strings to predict critical ADME properties (clearance and volume of distribution), which then serve as inputs for a separate ML model that predicts the full concentration-time profile [21]. For compounds with sufficient structural similarity to training data, this approach achieved reasonable predictive accuracy, offering potential for virtual PK screening of novel molecules [21].

Table 2: Comparison of Traditional and AI-Based PK Modeling Approaches

Characteristic	Traditional NLME (e.g., NONMEM)	AI/ML Approaches (e.g., Neural ODEs)
Foundation	Statistical, parametric models [20]	Data-driven, flexible algorithms [20] [21]
Data Requirements	Well-structured, often smaller datasets [20]	Larger datasets for optimal performance [20]
Interpretability	High, with established diagnostic tools [20]	Varies; Neural ODEs offer better explainability [20]
Computational Efficiency	Can be computationally intensive for complex models [20]	Potentially faster prediction once trained [20]
Key Strengths	Robust methodology, regulatory familiarity [20] [16]	Handling complex patterns, potential performance gains [20]

Protocols for MIDD Implementation in PK Studies

Protocol: Developing a Population PK Model for Dose Optimization

Objective: To develop a PopPK model characterizing the relationship between drug exposure, patient covariates, and dosing regimens to optimize therapy for specific subpopulations.

Materials and Software:

NONMEM, R, or Python for model development
Patient PK data (drug concentrations, dosing history, timing)
Patient covariate data (demographics, organ function, comorbidities)

Methodology:

Data Preparation: Compile a structured dataset incorporating all PK observations, exact dosing histories, and relevant patient covariates. Ensure data quality through validation checks.
Base Model Development: Identify the structural PK model (e.g., one- or two-compartment) that best describes the drug's disposition. Estimate inter-individual variability and residual error.
Covariate Model Building: Systematically test the influence of patient covariates (e.g., body weight, renal function) on key PK parameters using stepwise forward addition and backward elimination.
Model Validation: Evaluate the final model using diagnostic plots, visual predictive checks, and bootstrap analysis to ensure robust and unbiased parameter estimates.
Clinical Trial Simulations: Use the validated model to simulate various dosing scenarios in virtual patient populations to identify regimens that maximize therapeutic efficacy while minimizing toxicity [16].

Protocol: Applying a PBPK Model for Special Population Dosing

Objective: To leverage a verified PBPK model for predicting PK in populations where clinical trials are not feasible, such as pediatric patients or those with hepatic impairment.

Materials and Software:

PBPK software platform (e.g., Simcyp, PK-Sim)
Drug-specific parameters (e.g., lipophilicity, protein binding, metabolic pathways)
System data for virtual populations (anthropometric, physiological, enzymatic)

Methodology:

Model Verification: First, develop and verify a PBPK model against observed clinical PK data in healthy adults to establish predictive credibility.
Virtual Population Construction: Generate age- or disease-specific virtual populations using built-in demographic and pathophysiological databases within the software.
Extrapolation and Simulation: Simulate drug exposure following standard or adjusted dosing regimens in the target special population.
Dose Recommendation: Based on the simulation results (e.g., achieving target exposure metrics comparable to those in adults), propose optimized dosing regimens for the special population [19].

Protocol: Implementing an AI/ML Framework for Early PK Prediction

Objective: To predict in vivo PK profiles in rats for new chemical entities using only molecular structure information, thereby reducing reliance on early animal testing.

Materials and Software:

Python with RDKit package for molecular descriptor calculation
Dataset of historical PK profiles and structures for model training
ML libraries (e.g., scikit-learn, XGBoost, PyTorch for neural ODEs)

Methodology:

Data Curation: Compile a dataset of SMILES strings and corresponding in vivo PK parameters (CL, Vdss) and/or concentration-time profiles.
Feature Generation: Convert SMILES strings into molecular fingerprints or descriptors using RDKit to create numerical representations of the chemical structures.
Model Training:
- Train ML models (e.g., Random Forest, XGBoost) to predict CL and Vdss from the molecular features [21].
- Train a separate ML model (e.g., Neural ODE) that uses the predicted CL and Vdss, along with time points, to generate the full concentration-time profile [21].
Validation: Assess model performance on a held-out test set of compounds using metrics such as mean absolute percentage error (MAPE) and R² for the predicted PK profiles [21].

Visualization of MIDD Workflows and Relationships

MIDD for PK Prediction Workflow

AI/ML vs. Traditional PK Modeling

Table 3: Key Resources for MIDD-based Pharmacokinetic Research

Tool/Resource	Type	Primary Function in PK Prediction
NONMEM	Software	Industry standard for nonlinear mixed-effects (population) PK/PD modeling [20]
PBPK Platforms (Simcyp, PK-Sim)	Software	Simulate ADME processes in virtual human populations to predict PK in untested scenarios [19]
R/Python with ML libraries (scikit-learn, XGBoost, PyTorch)	Software/Programming Environment	Develop custom AI/ML models for predicting PK parameters and profiles from structure or in vitro data [20] [21]
RDKit	Software/Chemoinformatics	Generate molecular descriptors and fingerprints from chemical structures for QSAR and ML models [21]
PKTC Corpus	Data Resource	An expert-annotated corpus of 2,640 tables from PK literature for training ML models to extract PK parameters [22]
EPA TK Database	Data Resource	An open database of toxicokinetic time-series data and parameters for environmental chemicals, useful for modeling [23]
Model Analysis Plan (MAP)	Document	Pre-specified plan detailing objectives, data, methods, and evaluation criteria for a MIDD analysis, as recommended by ICH M15 [16]

Model-Informed Drug Development (MIDD) is an essential framework for advancing drug development and supporting regulatory decision-making [24]. The fit-for-purpose (FFP) approach strategically aligns MIDD tools with key Questions of Interest (QOI) and Context of Use (COU) across all development stages [24]. This methodology ensures that modeling techniques are appropriately matched to specific scientific and clinical questions, optimizing resource utilization and enhancing decision-making confidence.

The U.S. Food and Drug Administration (FDA) has established the Fit-for-Purpose Initiative, providing a pathway for regulatory acceptance of dynamic tools for use in drug development programs [25]. This initiative acknowledges that a Drug Development Tool (DDT) is deemed FFP based on the acceptance of the proposed tool following thorough evaluation of the submitted information [25].

The Strategic Framework: Aligning Tools with Development Stages

The Five-Stage Drug Development Process

Drug development follows a structured process with five main stages, each presenting distinct challenges and QOIs [24]:

Discovery: Identifying disease targets and testing compounds for potential drug candidates
Preclinical Research: Evaluating biological activity, potential benefits, and safety in laboratory and animal studies
Clinical Research: Three-phase human testing (safety, effectiveness, side effects, confirmation of benefits)
Regulatory Review: FDA evaluation of all submitted data for approval decisions
Post-Market Monitoring: Ongoing safety surveillance in real-world use

Stage-Appropriate MIDD Tool Selection

Table 1: MIDD Tools Aligned with Development Stages and Key QOIs

Development Stage	Primary QOIs	Recommended MIDD Tools	Key Outputs
Discovery	Target validation, Compound screening	QSAR, AI/ML approaches	Predictive activity models, Lead compound identification
Preclinical Research	FIH dose prediction, Toxicity assessment	PBPK, QSP/T, FIH Dose Algorithm	Safe starting doses, Mechanistic understanding
Clinical Development	Population variability, Dose optimization	PPK/ER, Semi-mechanistic PK/PD, Bayesian Inference	Exposure-response relationships, Dosing regimens
Regulatory Submission	Label claims, Comparative effectiveness	Model-Integrated Evidence, MBMA	Evidence for labeling, Support for claims
Post-Market	Lifecycle management, New indications	PBPK, Virtual Population Simulation	Support for label updates, New patient populations

Core MIDD Methodologies and Experimental Protocols

Quantitative Structure-Activity Relationship (QSAR) Modeling

Protocol 3.1.1: QSAR Model Development for Lead Compound Optimization

Purpose: To predict biological activity of compounds based on chemical structure to prioritize synthesis and testing [24].

Materials and Reagents:

Chemical structure database (e.g., ChEMBL, PubChem)
Molecular descriptor calculation software (e.g., Dragon, MOE)
Biological activity data (IC50, Ki, or EC50 values)
Statistical analysis environment (e.g., R, Python with scikit-learn)

Procedure:

Data Curation: Collect and standardize chemical structures and associated biological activity data
Descriptor Calculation: Generate molecular descriptors (2D, 3D) and fingerprint representations
Model Training: Apply machine learning algorithms (random forest, support vector machines, neural networks)
Validation: Perform internal (cross-validation) and external validation using hold-out test sets
Application: Screen virtual compound libraries to identify promising candidates

Acceptance Criteria: Q² > 0.6 for internal validation, R² > 0.7 for external test set prediction

Physiologically Based Pharmacokinetic (PBPK) Modeling

Protocol 3.2.1: PBPK Model Development for First-in-Human Dose Prediction

Purpose: To mechanistically understand the interplay between physiology and drug product quality for FIH dose prediction [24].

Materials:

Physiological parameters database (tissue volumes, blood flows)
Drug-specific parameters (log P, pKa, blood-to-plasma ratio)
In vitro metabolism and transport data
PBPK software platform (e.g., GastroPlus, Simcyp, PK-Sim)

Procedure:

System Configuration: Define physiological characteristics of virtual population
Compound Configuration: Incorporate drug-specific physicochemical and in vitro data
Model Verification: Compare predictions with available in vivo data (if any)
Sensitivity Analysis: Identify critical parameters influencing PK predictions
FIH Dose Prediction: Simulate exposure profiles to determine safe starting dose and escalation scheme

Acceptance Criteria: Predicted PK parameters within 2-fold of observed values in verification dataset

Population PK/Exposure-Response (PPK/ER) Modeling

Protocol 3.3.1: Population PK Model Development

Purpose: To characterize sources of variability in drug exposure among individuals in the target population [24].

Materials:

Rich or sparse PK sampling data from clinical trials
Patient demographic and covariate data
NONMEM, Monolix, or R with nlmixr software
Diagnostic visualization tools

Procedure:

Base Model Development: Identify structural model (1, 2, or 3-compartment)
Stochastic Model Building: Characterize interindividual and residual variability
Covariate Model Development: Identify significant demographic, pathophysiological factors
Model Validation: Perform bootstrap, visual predictive check, cross-validation
Simulations: Conduct simulations to support dosing recommendations in special populations

Acceptance Criteria: Successful covariate model with physiological plausibility, adequate diagnostic plots, successful validation

Research Reagent Solutions and Essential Materials

Table 2: Essential Research Reagents and Materials for MIDD Implementation

Category	Specific Tools/Platforms	Function	Application Context
Software Platforms	NONMEM, Monolix, Phoenix NLME	Population PK/PD modeling	PPK, ER analysis, Covariate testing
PBPK Platforms	GastroPlus, Simcyp, PK-Sim	Mechanistic PK prediction	FIH dosing, DDI prediction, Formulation optimization
Systems Pharmacology	DILIsym, GI-sym, QSP Toolboxes	Mechanism-based efficacy/toxicity prediction	Target validation, Preclinical safety
Statistical Environments	R, Python, SAS, MATLAB	Data analysis, Machine learning, Visualization	QSAR, MBMA, Bayesian analysis
Clinical Trial Tools	East, FACTS	Adaptive trial design simulation	Dose-finding, Bayesian optimal interval design
Data Management	CDISC standards, SQL databases	Data standardization, Management	Regulatory submission, Analysis readiness

Visualization of MIDD Workflows and Strategic Alignment

Fit-for-Purpose MIDD Tool Selection Algorithm

Diagram Title: FFP MIDD Tool Selection Workflow

MIDD Tool Progression Across Development Stages

Diagram Title: MIDD Tool and QOI Evolution Across Development

Regulatory Considerations and Implementation Framework

FDA Fit-for-Purpose Initiative Applications

Table 3: FDA FFP Initiative Examples and Applications

Disease Area	Tool Submitter	FFP Tool	Trial Component	Issuance Date
Alzheimer's disease	Coalition Against Major Diseases (CAMD)	Disease Model: Placebo/Disease Progression	Demographics, Drop-out	June 12, 2013
Multiple	Janssen & Novartis	Statistical Method: MCP-Mod	Dose-Finding	May 26, 2016
Multiple	Ying Yuan, PhD (MD Anderson)	Bayesian Optimal Interval (BOIN) design	Dose-Finding	December 10, 2021
Multiple	Pfizer	Empirically Based Bayesian Emax Models	Dose-Finding	August 5, 2022

Model Credibility and Validation Standards

Protocol 6.2.1: Model Evaluation for Regulatory Submission

Purpose: To establish model credibility and ensure fitness for proposed context of use in regulatory decision-making [24].

Materials:

Complete model development documentation
Dataset for model validation
Regulatory guidance documents (FDA, EMA, ICH M15)
Validation criteria checklist

Procedure:

Context of Use Definition: Clearly specify the proposed regulatory use and decision context
Model Verification: Ensure computer implementation matches mathematical description
Model Calibration: Assess model performance against training/calibration data
Model Validation: Evaluate predictive performance using external data
Uncertainty Quantification: Characterize uncertainty in model predictions
Documentation: Prepare comprehensive model summary following regulatory standards

Acceptance Criteria: Adequate model performance for specified COU, complete documentation, appropriate uncertainty characterization

Emerging Technologies and Future Directions

The integration of artificial intelligence (AI) and machine learning (ML) approaches represents the evolving frontier of MIDD [24]. These technologies enhance traditional modeling through improved pattern recognition in large-scale biological, chemical, and clinical datasets, particularly for target identification and ADME property prediction [24].

The FFP approach continues to evolve through regulatory harmonization efforts, including the ICH M15 general guidance, which promises to improve consistency in applying MIDD across global regulatory jurisdictions [24]. This harmonization facilitates more efficient worldwide drug development while maintaining appropriate standards for model credibility and validation.

Successful FFP implementation requires experienced multidisciplinary teams with collective insights to choose and apply the right modeling tools at the right time to support development decisions and improve outcomes for patients [24].

Building Open-Access Data Repositories for Collaborative Research

In modern forensic science, particularly for research on DNA transfer, persistence, prevalence, and recovery (TPPR), evaluating findings given activity-level propositions is crucial for interpreting evidence within a case context [26] [27]. Such evaluations rely on robust, accessible experimental data to assign probabilities for transfer events and the presence of background DNA. The forensic community recognizes that sharing TPPR data is essential to advance understanding and standardize methodologies [26] [11]. Open-access data repositories serve as the cornerstone for this collaborative effort, providing centralized platforms for preserving, sharing, and discovering datasets. These repositories facilitate the transparency and reproducibility required to strengthen the evidentiary value of forensic biology and support research that informs activity-level interpretations [28] [29]. This protocol outlines the steps for constructing and managing such repositories to maximize their utility for researchers, scientists, and drug development professionals engaged in collaborative science.

Protocol: Establishing an Open-Access Data Repository

Phase 1: Requirements Analysis and Planning

Objective: Define the repository's purpose, scope, and stakeholder needs to ensure it effectively serves the research community.

Step 1: Identify User Needs and Functional Requirements
- Conduct stakeholder interviews with researchers, data curators, and end-users to understand their workflows and data sharing challenges [30].
- Define functional requirements: core activities the repository must support (e.g., data upload, persistent identifier assignment, granular access controls, and programmatic data access via an API) [30] [28].
- Define non-functional requirements: constraints and qualities such as data security, scalability, uptime reliability, and long-term preservation plans [30].
Step 2: Define Data and Usability Requirements
- Data Requirements: Specify accepted data types, formats, volatility, size/amount, persistence, and accuracy. For TPPR research, this includes sequencing data, quantitative DNA profiles, environmental metadata, and standardized forensic reports [30] [11].
- User Requirements: Document characteristics of the intended user group, including their technical expertise and disciplinary background [30].
- Usability Requirements: Establish specific, measurable usability goals (e.g., time to complete a data deposit, success rate of dataset discovery) [30].

Phase 2: Repository Software Selection

Objective: Choose a software platform that aligns with the requirements identified in Phase 1, considering available resources and technical expertise.

A repository can be established using three primary models: commercial hosting, self-hosted open-source software, or free hosted platforms [30]. The table below compares common platforms used in scientific research.

Table 1: Comparison of General-Purpose Data Repository Platforms

Platform	Type	Cost	Key Features	Best For
Harvard Dataverse [28] [31]	Open Source (can be self-hosted or used via hosted service)	Free for users (up to 1TB on Harvard's service)	Assigns dataset & file DOIs; Tiered access controls; Data analysis tools; 2.5GB/file browser upload limit.	Institutions seeking a feature-rich, citable, and highly customizable repository.
Dryad [28] [31]	Curated, general-purpose repository	$120 data publishing charge [31]	Assigns DOIs; CC0 waiver required; Journal-integrated peer review access.	Researchers publishing data linked to journal articles, particularly in biosciences.
figshare [28] [31]	Commercial, hosted service	Free tier (20GB private data); Fees for large datasets (Figshare+)	Assigns DOIs to individual files; Private sharing links; Renders many file types in-browser.	Individual researchers and institutions needing a user-friendly platform with strong visualization.
Zenodo [31]	Open Access, hosted by CERN	Free	Accepts all file types; 50GB/dataset; Integrated with OpenAIRE; Allows embargoes.	EU-funded researchers and those needing to capture the entire research lifecycle.
Open Science Framework (OSF) [31]	Free, open-source research management tool	Free	Manages projects, components, and files; Assigns DOIs; Promotes collaboration and attribution.	Research teams needing to document the entire project lifecycle, not just archive final data.

Selection Criteria:

Technical Resources: Organizations with in-house expertise may opt for self-hosted open-source solutions like CKAN or DKAN. Institutions lacking technical staff may prefer commercial or freely hosted services [30].
Community Standards: For TPPR research, also consider field-specific repositories listed in global registries like re3data.org and FAIRsharing, or those recommended by journals such as PLOS ONE and Nature [32] [29] [33].

Phase 3: Implementation and Data Curation Workflow

Objective: Configure the repository and establish a standardized workflow for data ingestion, curation, and publication.

Step 1: Repository Configuration
- Install and configure the chosen software on institutional servers or establish an account with a service provider.
- Customize the metadata schema to include fields critical for TPPR data (e.g., substrate type, contact duration, shedder status, sampling method, PCR kit used) [11].
- Implement user authentication and define roles (e.g., administrator, curator, contributor, public user) [28].
Step 2: Establish a Data Curation Pipeline
- Ingestion: Provide a template for researchers to submit data and metadata. Support both browser-based uploads and API-based submissions for large datasets [28].
- Curation: A dedicated curator reviews submissions for completeness, checks metadata accuracy, verifies that data files are not corrupted, and ensures compliance with repository policies (e.g., no personally identifiable information) [30].
- Publication: Upon acceptance, the repository assigns a persistent identifier (e.g., DOI), makes the dataset publicly available according to specified license terms (e.g., CC0, CC BY), and creates a permanent landing page [32] [28] [31].

The following diagram illustrates the logical flow and decision points in the data curation workflow.

Application: Repository-Enabled TPPR Research

Experimental Protocol for DNA Transfer Studies

Objective: Generate quantitative data on the probability of DNA transfer from a donor to an object or body surface during a specific activity, suitable for deposition in a repository to support activity-level assessments [11].

Step 1: Experimental Design
- Define activity-level propositions (e.g., "DNA was transferred through direct handling" vs. "DNA was transferred via indirect environmental contact") [27].
- Recruit participants with varying shedder statuses, following ethical approval and informed consent procedures.
- Standardize donor pre-treatment (e.g., hand washing) to minimize pre-existing background DNA variability [11].
Step 2: Controlled DNA Transfer
- Skin-to-Skin Contact: A "mock offender" grips the forearm of a "mock victim" with consistent pressure and duration [11].
- Skin-to-Object Contact: Participants handle standardized objects (e.g., plastic bottles, cable ties) following a predefined protocol [27].
Step 3: Sample Collection and Recovery
- From Skin: Use the double-swabbing technique. Moisten a cotton swab with distilled water, rub the skin area thoroughly, then follow with a dry swab to collect residual moisture [11].
- From Objects: Swab a defined surface area using a single wet or double-swab technique, depending on the substrate [11].
Step 4: Genetic Analysis and Profile Interpretation
- Extract DNA using a validated method (e.g., silica-based extraction).
- Amplify DNA using a standard STR multiplex kit and analyze on a capillary electrophoresis instrument.
- Interpret profiles using probabilistic genotyping software to determine the likelihood ratio for a specific donor's contribution, considering mixture complexity and background DNA [27].

The Scientist's Toolkit: Research Reagent Solutions

Table 2: Essential Materials for DNA TPPR Experiments

Item	Function	Application in TPPR
Cotton or Nylon Swabs	Physical collection of cellular material from surfaces.	The primary tool for sample recovery from skin (double-swab technique) and objects [11].
Sterile Distilled Water	Hydration medium for swabs.	Dampens the first swab to enhance cell recovery during the double-swabbing process [11].
STR Multiplex Kits	Simultaneous amplification of multiple Short Tandem Repeat (STR) loci.	Generates DNA profiles from trace samples for donor identification and mixture deconvolution [11].
Probabilistic Genotyping Software (PGS)	Statistical evaluation of complex DNA mixtures.	Calculates likelihood ratios for activity-level propositions by weighing the probability of the evidence under different transfer scenarios [27].
Positive Control DNA	Verified human genomic DNA.	Ensures the PCR amplification and analytical processes are functioning correctly in each run.

Data Management and Repository Integration

Objective: Prepare and deposit TPPR data in a repository to ensure FAIR principles (Findable, Accessible, Interoperable, Reusable) are upheld [29].

Step 1: Data and Metadata Preparation
- Data: Include raw data (electropherograms), interpreted DNA profiles, and quantitative metrics (e.g., peak heights, mixture ratios).
- Metadata: Document all experimental conditions using a standardized schema. Essential metadata for TPPR includes:
  - Donor and recipient shedder status
  - Contact type, duration, and pressure
  - Substrate material and surface area
  - Sample collection method (e.g., swab type, elution volume)
  - DNA extraction, quantification, and amplification kits
  - PCR cycle number
  - Analysis parameters and software version [11]
Step 2: Data Deposition and Citation
- Upload the dataset and metadata to the chosen repository (e.g., a discipline-specific repository or a generalist repository like Dryad or Figshare).
- Apply an appropriate open license (e.g., CCO or CC BY) [32].
- Once published, the repository will issue a DOI. Cite this DOI in related research publications to create a permanent link between the article and its underlying evidence [29] [31].

The following diagram maps the lifecycle of a TPPR dataset from generation to reuse, highlighting the repository's role.

Discussion

The establishment of open-access data repositories is a critical enabler for progress in activity-level forensic research. By providing a secure, citable, and permanent archive for TPPR datasets, repositories directly address the challenges of data harmonization and standardization highlighted by the forensic community [26]. The structured protocols for repository creation and data management outlined here ensure that shared data adheres to the FAIR principles, maximizing its utility for future evidence evaluation.

The integration of detailed TPPR data into public repositories allows for the refinement of Bayesian Network models used to evaluate activity-level propositions [27]. When casework involves findings such as a common unknown DNA profile on multiple items, analysts can query repository data to assess the probability of such an event under different transfer scenarios (e.g., primary transfer from an alternate offender versus background contamination from a cohabitant) [27]. This data-driven approach moves beyond subjective opinion, providing a scientifically robust framework for interpreting complex evidence and strengthening the foundation of expert testimony in judicial proceedings.

Application Note: Transfer, Persistence, and Recovery of Textile Fibres

The evaluation of forensic fibre findings requires interpretation within the framework of activity-level propositions, where transfer, persistence, and recovery (TPR) parameters are indispensable for assigning probabilities during casework assessment [15]. Contemporary research has quantitatively characterized how fibre evidence persists on garments under realistic conditions, providing empirical data to support expert interpretation. This application note synthesizes current TPR knowledge and presents a structured protocol for applying quantitative persistence data to forensic fibre evaluation.

Quantitative Persistence Data from Recent Studies

A 2025 study examined the persistence of 175,948 fibres recovered from cotton T-shirts and polyester/cotton hoodies following controlled assault re-enactments and subsequent physical activity [34]. The data characterize how fibre count, type, colour, length, and spatial distribution change over time intervals up to 240 minutes across three activity intensity levels.

Table 1: Fibre Persistence Based on Activity Intensity and Time [34]

Activity Intensity	Description of Activities	10 min	30 min	60 min	120 min	240 min
Low	Sedentary activities, easy walking, sitting in transport	High persistence	Moderate persistence	Decreasing persistence	Low persistence	Minimal persistence
Moderate	Brisk walking, housework, gentle cycling	High persistence	Moderate persistence	Decreasing persistence	Low persistence	Minimal persistence
High	Running, sprinting, jogging, high exertion	Moderate persistence	Decreasing persistence	Low persistence	Minimal persistence	Minimal persistence

Table 2: Fibre Characteristics Recorded in Persistence Dataset [34]

Property	Categories Recorded
Fibre colour	Black/grey, blue, dark blue, green, orange/brown, purple, red, turquoise, yellow, other
Fibre type	Cotton, linen/flax, other vegetable, wool, hair, other animal, man-made, miscellaneous, unclassified
Continuous length	Numerical value (mm)
Categorical length	1 (≤ 1.0 mm), 2 (1.0–3.0 mm), 3 (3.1–5.0 mm), 4 (> 5.0 mm)
Garment type	T-shirt, hoody
Location	Front/back surface, lateral position (left/middle/right), axial position (lower/middle/upper/sleeve/hood)

Experimental Protocol: Fibre Persistence Study

Materials and Equipment

Garments: Yellow 100% cotton T-shirts (180 gsm) and red 80% cotton/20% polyester hoodies (310 gsm) [34]
Recovery materials: 48 mm adhesive tape (Scotch Tough Grip Moving Tape) and clear acetate sheets [34]
Examination equipment: Leica EZ4D stereoscopic zoom macroscope with integrated digital camera and Leica DM4M-FSCB comparison microscope [34]
Software: Leica Application Suite (LAS) with manual measurement add-on module [34]

Methodology

Garment Preparation: Launder and blank (clean) all garments using a lint remover and adhesive tape prior to each experimental session [34].
Assault Simulation: Pairs of participants (jiu-jitsu practitioners) enact a choreographed frontal assault routine, with one participant wearing a yellow T-shirt ("victim") and the other wearing a red hoody ("assailant") to facilitate fibre transfer [34].
Activity Phase: Following fibre transfer, participants maintain a specified activity intensity (low, moderate, or high) for predetermined durations (10, 30, 60, 120, or 240 minutes) [34].
Fibre Recovery: Remove and package garments after the activity period. Recover fibres using tapelifting method with adhesive tape applied to ten 1×1 cm grids per sampling zone [34].
Fibre Analysis: Examine all recovered fibres using stereoscopic microscopy. Characterize each fibre by colour, generic type, and continuous length. Categorize fibres according to the classification system in Table 2 [34].
Data Recording: Record fibre parameters along with experimental conditions (activity intensity, duration, garment type, and spatial location of recovery) [34].

Research Reagent Solutions: Forensic Fibre Analysis

Table 3: Essential Materials for Fibre Persistence Studies

Item	Function/Application
Cotton T-shirts (100% cotton)	Donor/recipient garments for transfer and persistence studies [34]
Polyester/cotton hoodies	Donor/recipient garments with synthetic blend composition [34]
Adhesive tape (48mm)	Fibre recovery from garment surfaces using tapelifting method [34]
Acetate sheets	Mounting and preservation of tape lifts for microscopic examination [34]
Leica EZ4D stereoscopic macroscope	Initial examination and morphological characterization of recovered fibres [34]
Leica DM4M-FSCB comparison microscope	Higher magnification analysis and detailed fibre comparison [34]
LAS software with measurement module	Digital micrograph acquisition and fibre length measurement [34]

Application Note: DNA Replication Stress Assay Using Nanopore Sequencing

DNA replication stress represents a cancer hallmark exploited by chemotherapeutic agents. Current methods for assessing replication stress suffer from low throughput and poor resolution, limiting their utility in drug development [35]. A novel nanopore-based artificial intelligence assay now enables high-resolution mapping of replication fork movement, stalling, and rates genome-wide, providing distinct "replication stress signatures" for different cancer therapies [35].

Quantitative Replication Stress Parameters

The DNAscent method detects thymidine analogues (BrdU and EdU) incorporated into nascent DNA strands during replication, allowing precise measurement of fork dynamics at single-molecule resolution [35].

Table 4: Replication Fork Metrics Measured by DNAscent Assay [35]

Parameter	Definition	Application in Therapy Assessment
Fork speed	Length of base analogue tracks divided by pulse duration	Measures replication rate under therapeutic intervention
Stall score	Proportional decrease in BrdU call frequency (0-1 scale)	Quantifies fork stalling versus slowing; differentiates mechanism of action
False positive rate	<0.004% for fork calls	Ensures assay reliability and reproducibility
Throughput	~150,000 reads >20 kb, N50 ~90 kb, ~2,050 fork calls per flow cell	Enables genome-wide assessment of replication dynamics

Table 5: Replication Stress Signatures of Cancer Therapeutics [35]

Therapeutic Agent	Target	Effect on Fork Speed	Effect on Stall Score	Replication Stress Signature
Hydroxyurea (HU)	dNTP synthesis	Slowing	Similar to untreated	Fork slowing without increased stalling
ATR inhibitor (VE-821)	ATR kinase	Moderate slowing	Marked increase	Frequent fork stalling
WEE1 inhibitor (MK1775)	WEE1 kinase	Slowing	Similar to untreated	Fork slowing without increased stalling
PARP inhibitor (Olaparib)	PARP1	30% increase	Similar to untreated	Accelerated fork progression

Experimental Protocol: DNAscent Assay for Replication Stress

Materials and Equipment

Cell lines: A2058 human melanoma cells and HCT116 colon cancer cells [35]
Thymidine analogues: EdU and BrdU for sequential pulse labeling [35]
Therapeutics: Hydroxyurea, ATR inhibitors (VE-821), WEE1 inhibitors (MK1775), PARP inhibitors (Olaparib) [35]
Sequencing platform: Oxford Nanopore MinION with R9.4.1 or R10.4.1 flow cells [35]
Software: DNAscent for detecting base analogues and calling replication forks [35]

Methodology

Cell Culture and Labeling: Pulse A2058 melanoma cells or other cancer cell lines sequentially with EdU, then BrdU, followed by thymidine chase [35].
Therapeutic Treatment: Expose cells to replication stress-inducing agents (HU, ATRi, WEE1i, or PARPi) at specified doses and durations [35].
Cell Sorting: Enrich for S-phase cells using fluorescence-activated cell sorting (FACS) [35].
DNA Extraction: Isolate ultra-high-molecular weight DNA while preserving epigenetic modifications [35].
Nanopore Sequencing: Prepare libraries and sequence on Oxford Nanopore MinION platform, targeting >150,000 reads longer than 20 kb with N50 ~90 kb [35].
Data Analysis: Use DNAscent software to detect BrdU/EdU incorporation, calculate fork speeds, and assign stall scores. Apply Uniform Manifold Approximation and Projection (UMAP) to visualize replication stress signatures in multidimensional space [35].

Research Reagent Solutions: DNA Replication Stress Analysis

Table 6: Essential Reagents for DNAscent Replication Stress Assay

Item	Function/Application
Thymidine analogues (EdU, BrdU)	Sequential pulse labeling of replicating DNA for fork tracking [35]
Hydroxyurea (HU)	dNTP depletion agent inducing replication stress [35]
ATR inhibitor (VE-821)	Checkpoint kinase inhibitor causing fork stalling [35]
WEE1 inhibitor (MK1775)	Kinase inhibitor causing fork slowing [35]
PARP inhibitor (Olaparib)	PARP1 inhibitor leading to fork acceleration [35]
Oxford Nanopore flow cells (R9.4.1/R10.4.1)	Long-read sequencing platform for detecting base analogues [35]
DNAscent software	AI-based analysis of replication fork dynamics and stall scores [35]

Visualization of Experimental Workflows

Forensic Fibre Analysis Workflow

DNA Replication Stress Assay Workflow

Replication Stress Signature Differentiation

Navigating Challenges: Barriers and Solutions in Global Adoption

Application Notes: Quantifying Global Research and Development Barriers

Advancing activity level research, particularly in fields like forensic science and therapeutic development, is hampered by interconnected global barriers. These challenges range from foundational data deficiencies to systemic methodological inconsistencies, which impede the development of robust, transferable protocols. The tables below summarize key quantitative findings from recent assessments across multiple domains.

Table 1: Systemic and Data-Related Barriers in Global Health and Development

Barrier Category	Specific Finding	Quantitative Measure	Source / Context
Diagnostic Preparedness	Concentrated manufacturing capacity	Limited regional capacity, particularly in low and middle-income countries [36]	Pandemic Preparedness
Diagnostic Preparedness	Suboptimal financing model	Unpredictable demand & lack of financial incentives [36]	Pandemic Preparedness
SDG Data Timeliness	Outdated indicators delay policy responses	>40% of SDG indicators in OECD countries rely on outdated data [37]	Sustainable Development
SDG Data Coverage	Insufficient data for trend analysis	>30% of targets in the "Planet" dimension lack sufficient data [37]	Sustainable Development
Data Granularity	Limited intersectional data	Data largely absent for environmental and digital inclusion indicators disaggregated by gender [37]	Sustainable Development
Data Existence	Comprehensive data is unavailable	For ~50% of identified data gaps, no data exists at all [38]	Systems Change Monitoring

Table 2: Methodological and Evidence Barriers in Research and Development

Barrier Category	Specific Finding	Quantitative Measure	Source / Context
Clinical Evidence	Recommendations based on lower-level evidence	54.5% of recommendations in ESC clinical guidelines are based on Level of Evidence C (expert opinion) [39]	Cardiovascular Medicine
Clinical Evidence	Uncertainty in recommended interventions	42.6% of clinical recommendations are in Class II (area of uncertainty) [39]	Cardiovascular Medicine
Digital Transformation	Failure of data-driven initiatives	85% of big data projects fail to meet their objectives [40]	Technology & Analytics
Data Quality	Organizational data quality challenges	77% of organizations rate their data quality as "average" or worse [40]	Technology & Analytics
Skills Gap	Workforce unprepared for technological change	87% of organizations report existing or anticipated skills gaps within five years [40]	Technology & Analytics

Experimental Protocols

The following protocols provide detailed methodologies for conducting critical experiments in activity level research, focusing on transfer and persistence studies. These are essential for generating high-quality, reproducible data to address existing gaps.

Protocol: Evaluating DNA Transfer and Persistence on Routinely Handled Items

This protocol is designed to address activity-level questions by quantifying DNA deposition from habitual versus one-time users on forensically relevant surfaces [41].

Key Research Reagent Solutions

Item	Function in Experiment
Cotton or Nylon Swabs	For sample collection from item surfaces using standardized pressure and moistening techniques [41].
PowerPlex Fusion 6C System	For DNA amplification and STR profiling to identify contributors and their relative proportions in a mixture [41].
Probabilistic Genotyping Software	For deconvoluting complex mixed DNA profiles and calculating likelihood ratios for sub-source propositions [41].
Bayesian Network Software	A computational framework for evaluating evidence given activity-level propositions, incorporating probabilities of transfer, persistence, and prevalence [42].

Detailed Methodology

Participant Recruitment and Ethical Compliance:
- Recruit a cohort of participants (e.g., 25 individuals) who share a common environment but are not cohabiting or intimate partners to model realistic secondary transfer scenarios [41].
- Obtain informed consent and collect buccal samples from all participants for reference DNA profiles.
Item Selection and Pre-Conditioning:
- Select a range of porous (e.g., gloves, pencil) and non-porous (e.g., phone, tablet, knife handle) items [41].
- Prior to the experiment, decontaminate all items using a validated cleaning method (e.g., 1% Virkon solution) to eliminate pre-existing DNA [41].
Habitual Use Simulation Phase:
- Assign a "habitual user" to handle a specific item for a defined, extended period (e.g., 2 hours for a mobile phone). This establishes a baseline DNA profile [41].
One-Time Use Simulation Phase:
- Following the habitual use phase, a "one-time user" handles the same item for a short, defined duration (e.g., 15 seconds) [41].
Sample Collection and Storage:
- Collect samples from pre-defined areas of the item after the habitual use phase and again after the one-time use phase.
- Use a standardized double-swab technique (moistened swab followed by a dry swab) for optimal DNA recovery [41] [11].
- Air-dry swabs and store at room temperature until extraction.
DNA Profiling and Data Analysis:
- Extract DNA using a co-extraction protocol suitable for subsequent DNA and mRNA analysis if body fluid identification is required [42].
- Amplify DNA using a commercial STR profiling kit like the PowerPlex Fusion 6C System [41].
- Analyze profiles to determine the total quantity of DNA recovered and the relative percentage contribution from the habitual user and the one-time user. Calculate sub-source likelihood ratios (LR) to statistically evaluate the strength of the evidence [41].

Protocol: Optimizing DNA Recovery from Human Skin Surfaces

This protocol addresses the challenge of recovering foreign DNA following skin-to-skin contact, which is critical for investigating assaults and other contact crimes [11].

Key Research Reagent Solutions

Item	Function in Experiment
Viscose or Cotton Swabs	The primary tool for sample collection from skin. The double-swab technique is recommended [11].
Sterile Deionized Water	Used to moisten the first swab to enhance the recovery of cellular material and DNA from the skin surface [11].
mRNA Body Fluid Primers	For endpoint PCR to confirm the presence and origin of specific body fluids (e.g., vaginal mucosa) alongside DNA profiling [42].

Detailed Methodology

Mock Scenario Design:
- Design a controlled, ethically approved mock assault scenario. For example, a male "offender" grips the forearms of a female "victim" with bare hands for a predetermined time [11].
Pre-Contact Background Sampling:
- Prior to contact, sample the specific area of the "victim's" skin that will be contacted. This controls for the prevalence of background DNA not related to the activity under investigation [11].
Post-Contact Sample Collection:
- After the contact event, sample the contacted skin area after a defined persistence period (e.g., 0, 6, 12, 24 hours).
- Apply the double-swab technique: first, vigorously rub the area with a swab moistened with sterile deionized water; second, rub the same area with a dry swab to collect any residual moisture and cells [11].
- Air-dry and store swabs at -20°C until processing.
DNA and RNA Co-Extraction and Analysis:
- Co-extract DNA and RNA from the collected samples to allow for both STR profiling and body fluid identification [42].
- Perform DNA profiling via capillary electrophoresis and mRNA analysis via endpoint PCR using a multiplexed primer mix (e.g., for saliva, skin, and vaginal mucosa markers) [42].
- Data interpretation should use a Bayesian network framework to evaluate the evidence given activity-level propositions, considering transfer, persistence, prevalence, and recovery (DNA-TPPR) factors [42] [11].

Mandatory Visualizations

DNA Transfer & Persistence Research Workflow

Interconnected Global Research Barriers

In activity-level research, particularly within fields like forensic science and drug development, assigning robust probabilities often hinges on interpreting complex, trace evidence from scenarios where empirical data is inherently scarce. Whether evaluating the likelihood of DNA transfer and persistence or predicting drug-induced liver injury, a fundamental challenge is building reliable probabilistic models when failure or positive event instances are rare. Data scarcity and the resulting severe class imbalance can lead to models that fail to generalize, compromising their utility for critical decision-making. This document outlines application notes and protocols for overcoming these hurdles, focusing on synthetic data generation and specialized modeling techniques to produce robust probability assignments essential for transfer persistence considerations.

The following table summarizes the core strategies, their underlying principles, and reported performance gains for addressing data scarcity in probabilistic modeling.

Table 1: Strategies for Overcoming Data Scarcity in Probabilistic Modeling

Strategy	Core Principle	Reported Efficacy/Performance	Primary Application Context
Synthetic Data Generation (GANs) [43] [44]	A Generator creates synthetic data to fool a Discriminator; adversarial training produces realistic data.	ML models trained on GAN-generated data achieved accuracies of: ANN (88.98%), RF (74.15%), DT (73.82%), KNN (74.02%), XGBoost (73.93%) [43].	Predictive maintenance, image data, and any domain with complex, high-dimensional data distributions [43] [44].
Failure Horizon Creation [43]	Labels the last 'n' observations before a failure event as "failure" to ameliorate class imbalance in run-to-failure data.	Converts a single failure point per run into 'n' failure instances, significantly increasing the minority class size for model training [43].	Time-series and temporal data where events are preceded by a degradation signal (e.g., condition monitoring, RUL prediction) [43].
Transfer Learning [44]	Leverages knowledge (e.g., features, model weights) from a data-rich source domain to a data-poor target domain.	Enables high performance on small-data tasks by utilizing features learned from large, open-source datasets (e.g., in computer vision, speech) [44].	Computer vision, natural language processing, and audio classification where pre-trained models are available [44].
Problem Reduction [44]	Transforms a problem from a data-scarce domain into an equivalent problem in a data-rich domain.	Example: Converting an audio classification problem (small data) into an image classification problem (larger models available) via spectrograms [44].	Cross-domain applications where data formats can be converted (e.g., audio to image, text to numerical vectors).
Probabilistic Programming [45] [46]	A declarative paradigm for specifying probabilistic models, separating model definition from inference.	Allows for the construction of complex Bayesian models that explicitly incorporate prior knowledge, mitigating the need for vast datasets [45].	Bayesian cognitive science, pharmaceutical safety prediction (e.g., drug-induced liver injury), economics [45].

Experimental Protocols

Protocol 1: Generating Synthetic Data Using Generative Adversarial Networks (GANs)

Application: This protocol is designed to augment a scarce dataset, particularly for building probabilistic classifiers in activity-level research where real failure events are rare [43].

Materials:

Hardware: A machine with a powerful GPU (e.g., NVIDIA RTX series) is highly recommended for accelerated training.
Software: Python with deep learning frameworks such as TensorFlow/Keras or PyTorch.
Data: The original, scarce dataset (e.g., a condition monitoring dataset with 228,416 healthy observations and only 8 failure observations [43]).

Methodology:

Data Preprocessing: Clean and normalize the source data. For sensor data, apply min-max scaling to maintain consistency [43]. Structure the data into the format required for the model (e.g., time-series windows, flattened feature vectors).
Model Architecture Definition:
- Generator (G): Design a neural network that takes a random noise vector as input and outputs a synthetic data sample. The architecture typically consists of dense and/or convolutional transpose layers.
- Discriminator (D): Design a binary classifier network (real vs. fake) that takes a data sample as input. The architecture typically uses dense and/or convolutional layers, ending with a single sigmoid output.
Adversarial Training Loop: Train the G and D concurrently in a mini-max game [43].
- In each training iteration: a. Train the Discriminator with a batch of real data (label: 1) and a batch of fake data from the Generator (label: 0). b. Train the Generator to fool the Discriminator by freezing D's weights and passing a batch of noise through G. The loss is based on D's incorrect classification of the fake data as real (label: 1).
Synthetic Data Generation: Once training converges (i.e., the Discriminator can no longer reliably distinguish real from fake data), use the trained Generator to produce the required volume of synthetic data.
Validation: Assess the quality of the synthetic data by training a downstream machine learning model (e.g., an Artificial Neural Network) on a combined dataset of real and synthetic data and evaluating its performance on a held-out test set of real data [43].

Protocol 2: Creating Failure Horizons for Imbalanced Temporal Data

Application: This protocol addresses extreme class imbalance in run-to-failure or temporal datasets, where a component is healthy until its final time point [43].

Materials:

A run-to-failure dataset with a temporal index and a binary health status label (e.g., healthy, failure).

Methodology:

Data Labeling: Identify the precise time point of failure for each unique run in the dataset.
Horizon Definition: Define the horizon length n, which represents the number of time steps prior to a failure that are indicative of an impending failure.
Relabeling: For each run, reclassify the last n observations before the failure event from "healthy" to "failure".
Model Training: Use the newly labeled dataset, which now has a substantially larger number of "failure" instances, to train a probabilistic classifier. This helps the model learn the pre-failure patterns rather than just the final failure state [43].

Protocol 3: Bayesian Modeling via Probabilistic Programming

Application: To build robust probability assignments by explicitly incorporating prior knowledge and domain expertise into a model, thus reducing dependence on large volumes of data alone [45].

Materials:

Software: A Probabilistic Programming Language (PPL) such as Turing.jl (Julia), PyMC (Python), or Stan (C++/R/Python) [45].
Data: The available (scarce) empirical data.
Prior Information: Domain knowledge from literature, expert opinion, or related datasets to inform prior distributions.

Methodology:

Model Specification: Declaratively define the probabilistic model in the PPL. This includes:
- Specifying the prior distributions for model parameters based on domain knowledge.
- Defining the likelihood function that describes how the observed data was generated.
- Establishing the relationship between latent variables and observed data.
Automated Inference: Instruct the PPL to perform Bayesian inference on the model. The PPL automatically computes the posterior distribution of the model parameters given the data. Common inference methods include Markov Chain Monte Carlo (MCMC) or Hamiltonian Monte Carlo (HMC) [45] [46].
Model Evaluation and Use: Check inference diagnostics (e.g., convergence statistics) and validate the model on held-out data. The resulting posterior distributions can be used for robust probability assignment for new, unseen data.

The Scientist's Toolkit: Key Research Reagent Solutions

Table 2: Essential Tools for Data Scarcity and Probabilistic Modeling Research

Item / Technology	Function / Explanation
Generative Adversarial Network (GAN) [43] [44]	A framework for generating high-quality synthetic data that mirrors the statistical properties of the original, scarce dataset.
Probabilistic Programming Language (PPL) [45]	A programming language (e.g., Turing.jl, Stan) that allows scientists to specify complex probabilistic models declaratively, with inference performed automatically.
Long Short-Term Memory (LSTM) Network [43]	A type of recurrent neural network specialized for learning from temporal sequences, used to extract features from time-series data before final classification.
SMOTE / M-SMOTE [44]	A synthetic oversampling technique that generates new examples for the minority class in the feature space, as an alternative to GANs for tabular data.
Pre-trained Models (for Transfer Learning) [44]	Models (e.g., BERT, ResNet) previously trained on large benchmark datasets, which can be fine-tuned on a small, specific target dataset.

Workflow Visualizations

GAN Synthetic Data Generation

Failure Horizon Creation

Probabilistic Programming Inference

Addressing Regional Differences in Regulatory Frameworks and Training

In the evolving landscape of global drug development and forensic investigation, understanding and addressing regional differences in regulatory frameworks and training is paramount. This document provides detailed application notes and protocols framed within the broader context of transfer persistence considerations at the activity level research. For drug development professionals, navigating divergent regulatory pathways across regions is essential for efficient global market access, while forensic scientists must account for how these frameworks influence the interpretation of transfer evidence in legal contexts. The following sections synthesize current regulatory trends and experimental approaches to facilitate standardized methodologies across international boundaries, enabling professionals to operate effectively within this complex global environment.

Global Regulatory Landscape: Quantitative Analysis

Table 1: Regional Adoption Rates of Key Sustainability Reporting Frameworks (2022-2025) [47]

Region	Framework	2022 Adoption	2024 Adoption	2025 Adoption	Key Regulatory Influences
Americas	TCFD	27%	-	35%	SEC climate rule withdrawal; California mandates taking effect 2026
	SASB	37%	-	41%	Industry-specific focus driven by investor expectations
	GRI	27%	-	29%	Comprehensive coverage for broad stakeholder disclosure
EMEA	TCFD	-	-	56%	ESRS E1 fully incorporates TCFD's 11 recommended disclosures
	SASB	19%	-	15%	ESRS addresses many SASB topics, reducing its incremental value
	GRI	-	55%	37%	ESRS comprehensive scope makes GRI redundant for many companies
Asia Pacific	TCFD	-	-	63%	Embedded in mandatory disclosure rules (Japan, Hong Kong, Australia)
	SASB	18%	-	22%	Supported by integration into ISSB framework
	GRI	-	54%	53%	Complements climate-focused frameworks for broader stakeholders

Table 2: Regulatory Policy Implementation Across OECD Countries [48]

Regulatory Policy Aspect	Implementation Rate	Key Challenges
Systematic stakeholder engagement when making regulations	82%	Providing feedback post-consultation remains challenging (only one-third provide feedback)
Consideration of agile and flexible design options	41%	Keeping pace with technological innovation
Systematic consideration of cross-country regulatory impacts	30%	Insufficient international coordination
Risk-based approaches to monitoring and enforcement	-	Saving time and resources while improving outcomes

Regional Regulatory Frameworks in Pharmaceutical Development

Comparative Analysis of Major Regions

Table 3: Innovative Drug Classification Across Major Regulatory Jurisdictions [49] [50]

Jurisdiction	Regulatory Body	Definition of Innovative Drugs	Classification System	Key Regulatory Initiatives
China	NMPA	"Drugs not yet introduced to the global market" (shift from "novel to China" to "novel to the world")	Chemical drugs: 5 categories Biologics: 3 classes TCM: 4 classes (Category 1 = innovative)	"Major New Drug Development" Project; ICH guideline adoption; Special approval for innovative medical devices
United States	FDA	New Molecular Entities (NMEs) + Biologics License Application (BLA) products	NDA pathway for NMEs; BLA for biologics	Breakthrough Therapy Designation; Accelerated Approval; Project Orbis for collaborative reviews
European Union	EMA	"Medicine containing an active substance or combination not previously authorized"	Assessed through potential therapeutic benefits and clinical significance	Regulatory sandboxes; EMA harmonization across member states; Focus on unmet medical needs

Regulatory Harmonization Initiatives

Globally, regulatory harmonization is increasingly critical for addressing regional differences. Three key mechanisms have emerged as tools for creating more aligned approaches: [49]

Regulatory Reliance: Where a national regulatory authority gives significant weight to assessments performed by another trusted authority
Harmonization: Process of developing uniform technical guidelines across participating authorities
Convergence: Process whereby regulatory requirements across countries become more aligned over time

The International Council for Harmonisation of Technical Requirements for Pharmaceuticals for Human Use (ICH) and the International Coalition of Medicines Regulatory Authorities (ICMRA) serve as primary vehicles for these alignment initiatives. Regional harmonization efforts such as the East African Community Medicines Registration Harmonization (EAC-MRH) initiative and the ECOWAS Medicines Regulatory Harmonization initiative in West Africa demonstrate the global scope of these efforts. [49]

Regulatory Sandboxes for Innovative Therapies

Regulatory sandboxes have emerged as innovative mechanisms to facilitate development and approval of new technologies where established pathways might not be fit for purpose. These controlled environments allow firms to test innovations under regulatory supervision, particularly promising for rare disease therapies and advanced therapeutic medicinal products. [49] This approach represents a significant shift from traditional "set and forget" rule-making toward adaptive regulatory frameworks that can accommodate rapid technological changes while maintaining safety standards. [48]

Experimental Protocols for Transfer Persistence Research

Handshake Transfer and Persistence Protocol

Objective: To determine the detectability and persistence of non-self DNA acquired during handshake events and transferred to surfaces after various activities and time intervals. [51]

Materials and Equipment:

DNA-free glass plates (140 × 220 mm)
Quantitative PCR systems for DNA quantification
Standard DNA profiling kits and reagents
Sterile swabs for sample collection
Personal protective equipment to prevent contamination

Experimental Procedure: [51]

Participant Pairing: On twelve separate occasions, pair different male and/or female volunteers who do not share living or working spaces
Handshake Event: Perform a firm handshake with right hands for 10 seconds
Group A - Immediate Transfer:
- Each participant immediately places right hand deposit on separate DNA-free glass plates (P1)
- Apply pressure across entire hand for 10 seconds
Group B - Delayed Transfer:
- Participants return to daily activities for 15 minutes post-handshake
- Then place right hand deposit on separate DNA-free glass plates (P1)
Sequential Sampling: All participants subsequently touch additional glass plates (P2-5) at regular intervals
Sample Processing:
- Collect samples using standard forensic swabbing techniques
- Extract DNA according to standardized protocols
- Quantify DNA yield using quantitative PCR
- Generate DNA profiles using standard amplification and electrophoresis procedures

Data Analysis:

Assess proportions of contributing components to mixture profiles
Calculate likelihood ratios of known contributors
Analyze unique allele and peak height contributions for each donor
Evaluate impact of activities performed prior to and post-handshake

Shedder Status Classification Protocol

Objective: To classify individuals as high, intermediate, or low shedders based on their propensity to deposit DNA through touch. [52]

Materials and Equipment:

DNA-free plastic tubes or similar substrates
Quantitative PCR systems
Standard DNA extraction and profiling kits
Sterile swabs and collection materials
Environmental controls to monitor contamination

Experimental Procedure: [52]

Participant Selection: Recruit participants representing both sexes and various age groups
Control Sample Collection: Collect reference DNA samples from all participants
Direct Deposit Test:
- Participants handle DNA-free plastic tubes for standardized duration (e.g., 10 seconds)
- Collect samples from handled surfaces using standardized swabbing techniques
DNA Analysis:
- Extract DNA using standardized forensic methods
- Quantify total DNA yield
- Generate DNA profiles and calculate proportion of self-DNA deposited
Classification Criteria:
- High Shedder: Deposits sufficient DNA for full profile in majority of tests
- Intermediate Shedder: Deposits partial DNA profiles inconsistently
- Low Shedder: Rarely deposits detectable DNA or only minimal amounts

Variables to Consider:

Time since last hand washing
Skin condition and moisture
Environmental factors (temperature, humidity)
Recent activities affecting skin cell turnover

The Scientist's Toolkit: Research Reagent Solutions

Table 4: Essential Research Materials for Transfer Persistence Studies [51] [52]

Item	Function	Application Notes
DNA-free glass plates	Standardized substrate for touch deposits	140 × 220 mm size; ensures consistent surface area for comparison between experiments
Quantitative PCR systems	DNA quantification	Essential for determining total DNA yield from collected samples; enables comparison between shedders
Standard forensic swabs	Sample collection from surfaces	Maintain consistency in collection pressure, pattern, and moisture application across all samples
DNA extraction kits	Isolation of DNA from collected samples	Use standardized protocols to minimize variability in extraction efficiency
STR amplification kits	DNA profile generation	Standard commercial kits ensure results are comparable to forensic casework
Personal protective equipment	Contamination prevention	Critical for maintaining integrity of low-template DNA samples in transfer studies
Environmental monitoring controls	Detection of background DNA	Assess cleanliness of experimental environment and equipment

Signaling Pathways and Conceptual Framework

Implementation Considerations for Cross-Regional Research

Addressing Regional Regulatory Divergence

The research highlights significant regional variations in regulatory frameworks that directly impact transfer persistence research. In Europe, the European Sustainability Reporting Standards (ESRS) are reshaping disclosure practices, while in the Americas, voluntary frameworks continue to gain ground despite the SEC's withdrawal of federal climate disclosure rules. [47] Asia Pacific demonstrates the highest TCFD adoption rates at 63%, reflecting regional regulatory mandates. [47] These differences necessitate tailored approaches when designing multi-jurisdictional research protocols.

For drug development professionals, understanding these regional frameworks is essential for designing global clinical trials and regulatory submissions. The emergence of regulatory sandboxes presents particular promise for innovative therapies that may not fit conventional approval pathways. [49] Similarly, forensic scientists must recognize how regional differences in legal standards of evidence may affect the admissibility of transfer persistence research in different jurisdictions.

Standardization Protocols for Multi-Center Studies

Based on the experimental protocols and regulatory analysis, the following standardization approaches are recommended:

Implement Cross-Jurisdictional Calibration: Regular inter-laboratory comparisons using standardized reference materials across research centers in different regions
Adopt Harmonized Terminology: Develop consistent definitions for key concepts such as "shedder status," "transfer level," and "persistence threshold" across jurisdictions
Establish Regional Reference Databases: Create region-specific baseline data accounting for environmental, genetic, and behavioral factors influencing transfer mechanisms
Integrate Regulatory Science Principles: Apply regulatory science methodologies to validate new transfer persistence assays across different regulatory environments

These approaches will enhance the reliability and admissibility of transfer persistence research across different regulatory frameworks while maintaining scientific rigor.

The efficacy of analytical results in both forensic science and pharmaceutical manufacturing is fundamentally dependent on the initial sample recovery process. Inadequate recovery techniques can detrimentally impact the entire analytical chain, leading to potential false negatives, inaccurate quantification, or misinterpretation of evidence. Within the context of activity level research, which evaluates how DNA or chemical residues are transferred, persist, and are recovered from surfaces, the choice of recovery method is not merely a preliminary step but a critical variable that shapes all subsequent interpretations [53]. This application note provides a detailed comparison of swabbing, tape-lifting, and other recovery techniques, supported by quantitative data and standardized protocols, to guide researchers and scientists in selecting and optimizing recovery methods for robust and reliable results.

Comparative Efficacy of Recovery Techniques

The selection of a recovery method must be tailored to the specific experimental context, including the nature of the residue (e.g., DNA, active pharmaceutical ingredients), the surface properties, and the analytical goals. The following sections and comparative tables summarize key findings from forensic and pharmaceutical studies.

Table 1: Comparison of DNA Recovery Methods from Different Surfaces

Recovery Method	Surface Type	Relative Efficacy	Key Findings and Considerations
Single Swab	Various (Porous & Non-porous)	High	Demonstrates higher efficiency in recovering DNA across a wide variety of experimental settings [54].
Double-Swab	Various (Porous & Non-porous)	Variable	Does not consistently improve recovery rates compared to the single-swab method [54].
Cutting-Out	Cotton, Paper	Higher	Results in higher DNA recovery from cotton and paper surfaces [55].
Cutting-Out	Cardboard	Lower	Less efficient for cardboard surfaces compared to other methods [55].
Tape-Lifting	Non-Porous (e.g., glass, plastic)	Moderate	Quick and straightforward; tapes with better adhesion can produce higher yields than swabbing, but rigidity and size can complicate analysis [54].

Table 2: Key Considerations in Pharmaceutical Cleaning Validation Swab Recovery

Parameter	Best Practice	Common Pitfalls
Spike Level	125%, 100%, and 50% of the Acceptable Residue Limit (ARL), extending down to the Limit of Quantitation (LOQ) [56].	Using a range that is too narrow or not centered on the ARL.
Recovery Factor	Use the average of the triplicate recovery data set. A minimum recovery of 70% is typically targeted [56].	Using the single lowest recovery value, which can lead to unnecessary cleaning failures.
Swab Area	A standard 5 cm x 5 cm (25 cm²) area is recommended for being representative and practical [56].	Using a sample size that is too large (e.g., 100 cm²) for confined or complex equipment geometries.
Material of Construction (MOC)	Perform recovery studies on all product-contact MOCs. Stainless steel can be used for initial studies, with porous materials requiring special attention [56].	Neglecting to test "worst-case" materials like gaskets, which can trap residues.

Detailed Experimental Protocols

Protocol 1: Swab Recovery for Cleaning Validation

This protocol outlines the procedure for determining swab recovery efficiency of an Active Pharmaceutical Ingredient (API) from equipment surfaces, a cornerstone of cleaning validation in pharmaceutical quality control [56] [57].

Objective: To validate that the swabbing technique can recover a specified percentage (typically ≥70%) of a target residue from a specific surface material.
Materials:
- Coupons: Material of construction (MOC) coupons (e.g., 316L stainless steel, glass, PTFE).
- Swabs: Pre-defined swab type (e.g., polyester, cotton).
- Solvent: Appropriate solvent for the residue (e.g., acetonitrile, acetone, buffer).
- API Solution: Standardized solution of the target analyte.
- Micropipettes: Calibrated micropipettes for accurate solution application.
- Sample Vials: Clean vials for swab extraction.
- Analytical Instrumentation: HPLC-UV, LC-MS/MS, or other validated method for quantification.
Procedure:
- Coupon Preparation: Clean all coupons thoroughly with solvent and verify they are free of interference.
- Spiking: Apply a known volume of the API solution (e.g., 100 µL) onto the coupon surface to achieve target concentrations (e.g., 50%, 100%, 125% of ARL). Allow the solvent to evaporate completely under controlled conditions.
- Swabbing: a. Moisten the swab with the designated solvent. b. Firmly swab the entire spiked area (e.g., 25 cm²) using horizontal strokes, rolling the swab to use all sides. c. Repeat with a second dry swab if using a double-swab technique.
- Extraction: Place the swab head into a sample vial containing a precise volume of extraction solvent. Shake or sonicate for a defined period (e.g., 10-15 minutes) to extract the residue.
- Analysis: Analyze the extract using the validated analytical method. Also, analyze blank swabs and standard solutions of known concentration.
- Calculation:
  - % Recovery = (Amount of API recovered from coupon / Theoretical amount of API spiked on coupon) × 100

Protocol 2: Touch DNA Collection Using Tape-Lifting

This protocol describes the use of adhesive tapes for collecting trace DNA evidence from non-porous surfaces at crime scenes [54].

Objective: To collect and recover cellular material from a defined surface area for subsequent DNA profiling.
Materials:
- Adhesive Tape: Forensic-grade tape of a specified size and adhesion.
- Scalpel or Scissors: For cutting the tape.
- DNA-Free Paper or Plastic Sheets: For mounting the tape after collection.
- Personal Protective Equipment (PPE): Gloves, mask, and hairnet to prevent contamination.
Procedure:
- Surface Examination: Document and photograph the surface before sampling.
- Tape Preparation: Cut a piece of tape large enough to cover the area of interest.
- Collection: a. Firmly press the adhesive side of the tape onto the surface. b. Apply even pressure across the entire tape surface. c. Slowly peel the tape back from one corner, avoiding lateral movement.
- Mounting: Carefully place the tape, adhesive-side down, onto a clean paper or plastic sheet. Avoid creating air bubbles.
- Storage and Transport: Seal the sheet in an appropriate evidence bag and store under appropriate conditions until DNA extraction.
- DNA Extraction: In the laboratory, the tape can be placed directly into a tube for digestion, or the residue can be washed off for processing. Standard DNA extraction, quantification, and amplification protocols follow.

The Scientist's Toolkit: Essential Research Reagents and Materials

Table 3: Key Materials for Recovery Studies

Item	Function / Application
Polyester Swabs	Surface sampling for both pharmaceutical residues and forensic DNA; selected for strength and low residue retention [57].
Adhesive Tape (Forensic Grade)	Lifting of cellular material and micro-debris from non-porous surfaces for DNA analysis [54].
Acetonitrile & Acetone	Organic solvents used for dissolving and recovering poorly water-soluble APIs (e.g., Oxcarbazepine) from equipment surfaces during cleaning validation [57].
Stainless Steel Coupons	Representative surfaces for recovery studies in pharmaceutical manufacturing; used to simulate equipment contact surfaces [56].
Phosphate-Free Alkaline Detergent	Used in manual and automated cleaning processes to remove residues without introducing interfering phosphates [57].

Workflow and Decision Pathways

Sample Recovery Selection Workflow

The following diagram illustrates the logical decision process for selecting an appropriate sample recovery method based on the sample context.

Optimizing recovery is a multifaceted challenge that requires a scientific and evidence-based approach. No single method is universally superior; the choice between swabbing, tape-lifting, cutting-out, or rinsing depends on a clear understanding of the residue, surface, and analytical endpoint. The quantitative data and standardized protocols provided here serve as a foundation for researchers to make informed decisions, thereby enhancing the accuracy and reliability of their work in activity level research, cleaning validation, and forensic investigation. Consistent and well-validated recovery processes are indispensable for generating data that can withstand scientific and regulatory scrutiny.

Mitigating Uncertainty in Interpretation of Mixed and Trace Evidence

Quantitative Data on Factors Influencing DNA-TPPR

The interpretation of mixed and trace DNA evidence is governed by the principles of DNA transfer, prevalence, persistence, and recovery (DNA-TPPR). The following tables summarize key quantitative factors that introduce uncertainty and must be considered during analysis [11] [58] [53].

Table 1: Factors Affecting the Complexity of DNA Mixture Interpretation

Factor	Description	Impact on Interpretation Uncertainty
Number of Contributors	The total number of individuals whose DNA is present in the sample.	Increases exponentially with each additional contributor; mixtures from >3 individuals are often too complex for reliable interpretation [58].
DNA Quantity	The proportion of DNA each contributor adds to the mixture.	Low-level contributors (minor components) are susceptible to allelic drop-out, making their profiles difficult to detect or resolve [58].
DNA Degradation	The breakdown of DNA strands over time or due to environmental exposure.	Results in a loss of genetic information, manifesting as allelic drop-out and an imbalanced peak profile, complicating contributor assignment [58].
Shedder Status	The propensity of an individual to deposit DNA via touch.	Introduces variability in the amount of DNA transferred during contact, affecting which contributors are detected and in what proportion [11].
Background DNA	The pre-existing non-self DNA on a surface or body area prior to the event of interest.	Can mask or dilute DNA transferred during a criminal action, leading to complex mixtures with irrelevant contributors [11].

Table 2: Factors Influencing DNA Transfer, Persistence, and Recovery (TPPR) on Body Surfaces

Factor	Consideration	Effect on Evidence Interpretation
Body Area Sampled	Areas such as neck, arms, breasts (external) vs. vaginal, anal, oral (internal).	Influences the prevalence of background DNA, persistence time of foreign DNA, and efficiency of recovery methods [11].
Type of Contact	Skinto-skin touch vs. contact with bodily fluids (semen, saliva).	Affects the initial amount of DNA transferred and its subsequent persistence on the substrate [11].
Time to Sampling	The delay between the alleged contact and the collection of evidence.	Longer delays increase the degradation and loss (persistence) of deposited DNA, reducing the quality and quantity of the profile [11].
Sampling Method	Double-swab technique, single wet/dry swab, or tape-lifting.	The double-swab technique has been shown to recover approximately 13.7% more foreign DNA than other methods from skin [11].

Experimental Protocols for TPPR Studies

The following protocols provide a standardized methodology for conducting research into DNA Transfer, Persistence, Prevalence, and Recovery, which forms the foundation for evaluative opinions at the activity level [11] [53].

Protocol 1: DNA Recovery from Skin via Double-Swabbing

This protocol details the optimal method for collecting touch DNA evidence from skin surfaces following a contact event [11].

1. Pre-Sampling Preparation - Materials: Sterile water, two consecutive cotton or viscose swabs per area, swab packaging (e.g., paper envelopes or plastic tubes), tamper-evident seals, personal protective equipment (PPE). - Documentation: Pre-label swab packaging with unique sample identifiers. Record the body area to be sampled, donor details, and time since contact.

2. Sampling Procedure - Moisten the First Swab: Slightly moisten the tip of the first swab with sterile water. Avoid oversaturation. - Sample the Area: Firmly roll the moistened swab over the entire target skin surface. Apply sufficient pressure to dislodge epithelial cells. Alternatively, a rubbing motion may be used, though rolling is preferred for controlled pressure. - Air Dry: Place the first swab in its packaging and allow it to air dry completely at room temperature to prevent microbial growth. - Second Dry Swab: Using a second, dry swab, repeat the rolling/rubbing procedure over the same skin area. This collects cellular material not recovered by the wet swab. - Packaging: Place the second, now-dry swab in its separate packaging. Seal and store both swabs at room temperature or frozen until extraction.

3. Post-Sampling - Controls: Collect control samples from adjacent, non-contacted skin areas of the same individual using the same method. - Chain of Custody: Maintain a rigorous chain of custody documentation for all samples.

Protocol 2: Simulating DNA Transfer for Activity-Level Research

This protocol outlines a controlled experiment to study DNA transfer in a mock assault scenario, evaluating variables such as shedder status and contact duration [11].

1. Experimental Setup - Ethics Approval: Obtain approval from the relevant institutional ethics committee before beginning. - Participant Recruitment & Consent: Recruit adult volunteer "donors" and "recipients". Obtain written, informed consent that outlines the study's purpose, procedures, and data handling policies. - Baseline Sampling: Swab the palms of all donors and the forearms of all recipients prior to contact to establish background DNA levels.

2. Controlled Contact Simulation - Standardize Contact: The donor firmly grips the recipient's forearm for a predetermined duration (e.g., 10, 30, 60 seconds). The pressure and grip area should be consistent across experiments. - Variable Manipulation: Systematically vary one factor per experiment (e.g., contact duration, use of gloves by donors, or application of moisturizer on skin) to isolate its effect.

3. Post-Contact Sample Recovery - Immediate Sampling: Immediately after contact, sample the recipient's contacted forearm using the double-swabbing technique described in Protocol 1. - Persistence Time Series: To study persistence, sample the contacted area at multiple time intervals post-contact (e.g., 15 min, 1 hour, 6 hours, 24 hours).

4. Analysis and Data Interpretation - DNA Profiling: Extract and profile DNA from all samples using standard laboratory protocols and sensitive amplification kits. - Profile Analysis: Use probabilistic genotyping software (PGS) to deconvolute mixed profiles and calculate the relative contribution of donor DNA. - Statistical Evaluation: Analyze data to determine the impact of the manipulated variable (e.g., contact duration) on the success of detecting and attributing the donor's DNA.

Workflow Visualizations

Activity-Level Interpretation Workflow

Mitigating Mixture Uncertainty with PGS

Research Reagent and Material Solutions

Table 3: Essential Materials for DNA TPPR and Mixture Interpretation Research

Item	Function/Application
Cotton or Viscose Swabs	The primary tool for sample collection from surfaces and skin. The double-swab technique is recommended for optimal recovery [11].
Probabilistic Genotyping Software (PGS)	Computer programs that use statistical models to calculate the probability of observing a complex DNA mixture given different contributor scenarios, accounting for drop-in, drop-out, and allele frequencies [58].
High-Sensitivity DNA Amplification Kits	Modern polymerase chain reaction (PCR) kits capable of generating DNA profiles from minute quantities of DNA (trace evidence), which often results in the detection of complex mixtures [58].
Sterile Water	Used to moisten the first swab in the double-swab technique to enhance cell collection from dry surfaces [11].
Reference DNA Samples	Buccal (cheek) swabs or blood samples collected from all volunteer donors and recipients in a controlled study to provide known reference profiles for comparison against evidence samples [11].

Ensuring Credibility: Validation, Fit-for-Purpose Models, and Comparative Analysis

Establishing Validation Frameworks for Activity-Level Evaluations

Activity-level evaluations represent a critical advancement in forensic science, moving beyond source identification to address how forensic evidence was transferred during the commission of a crime. This framework assesses findings given activity-level propositions, often answering 'how' and 'when' evidence was deposited [4]. Such evaluations are particularly valuable for explaining the significance of trace evidence, including fibres and DNA, within the specific context of a case. The evaluation relies on understanding the dynamics of transfer, persistence, prevalence, and recovery (TPPR) of materials [15] [59].

Despite their importance, global adoption faces significant barriers. Recent research indicates that 53% of practitioners in Australia and New Zealand felt their laboratory processes were insufficient for implementing activity-level reporting, viewing it as an initiative for the next 5-10 years [60]. Challenges include methodological concerns, a lack of robust data for informing probabilities, regional differences in regulatory frameworks, and insufficient training resources [4]. This protocol establishes a validation framework to overcome these barriers, providing forensic scientists with structured approaches for implementing scientifically rigorous activity-level evaluations.

Quantitative Data on Evidence Transfer and Persistence

Robust activity-level evaluation requires quantitative data on the behaviour of evidence under various conditions. The following tables summarize key empirical data essential for validating transfer and persistence assumptions.

Table 1: Fibre Persistence Based on Physical Activity Intensity (4-hour observation period) [34]

Activity Intensity	Approximate Fibre Retention	Key Influencing Factors
Low Intensity	Higher persistence	Minimal garment movement, reduced environmental contact
Moderate Intensity	Moderate persistence	Intermittent movement, varied contact surfaces
High Intensity	Rapid initial loss, lower final persistence	Significant garment movement, increased environmental friction

Table 2: Forensic Fibre Characteristics and Categorization [34]

Property	Categorization	Forensic Significance
Colour	10 categories (e.g., black/grey, blue, red, yellow)	Primary characteristic for visual discrimination
Fibre Type	9 categories (e.g., cotton, wool, man-made)	Indicates source material and transfer potential
Length	4 categorical ranges (≤1.0 mm, 1.0–3.0 mm, 3.1–5.0 mm, >5.0 mm)	Influences persistence; shorter fibres shed more easily

Table 3: Operational Barriers to Implementing Activity-Level Reporting [60] [4]

Barrier Category	Description	Reported Impact
Resource Limitations	Lack of robust data, training, and funding	Prevents 53% of labs from implementing ALR
Methodological Concerns	Reticence toward proposed evaluation methodologies	Hinders standardized adoption across jurisdictions
Regulatory Differences	Regional variations in legal frameworks and standards	Complicates international consensus and practice

Experimental Protocols for Activity-Level Evaluation

Protocol for Fibre Transfer and Persistence Simulation

This protocol quantifies fibre transfer and persistence under controlled physical activities, generating data for evaluating evidence significance [34].

Objective: To simulate realistic transfer scenarios and quantify the persistence of textile fibres on recipient garments under varying intensities of physical activity.
Materials and Reagents:
- Donor and recipient garments (e.g., 100% cotton T-shirts, polyester/cotton hoodies)
- 48 mm adhesive tape (e.g., Scotch Tough Grip) and clear acetate sheets for fibre recovery
- Stereoscopic zoom macroscope with digital camera (e.g., Leica EZ4D) and comparison microscope (e.g., Leica DM4M-FSCB)
Procedure:
- Garment Preparation: Launder all garments separately. Blank (clean) each garment using a lint remover and adhesive tape to remove extraneous fibres prior to each experimental session.
- Simulated Assault: Conduct a choreographed upper body-dominant assault re-enactment between a 'victim' and 'assailant,' each wearing specific garments to facilitate initial fibre transfer.
- Persistence Phase: Direct participants to engage in a defined physical activity (low, moderate, or high intensity) for specified durations (10, 30, 60, 120, or 240 minutes).
- Fibre Recovery: Systematically tapelift the entire garment surface using a standardized grid pattern (e.g., 1x1 cm grids). Apply tape to clear acetate sheets for preservation and analysis.
- Microscopic Examination: Examine all fibres within ten predetermined grids per tapelift. Count and categorize each fibre by colour and generic fibre type.
- Fibre Measurement: Measure the continuous length of a randomized subset of target fibres using microscope software (e.g., Leica Application Suite).
Data Analysis: Calculate the number and proportion of target fibres retained per unit area over time. Analyse the relationship between activity intensity, time elapsed, and fibre retention rate. Categorize fibre length data to determine if persistence is size-dependent.

Protocol for DNA Transfer Simulation to Address Activity-Level Propositions

This protocol provides a methodology for generating empirical data to distinguish between direct and indirect DNA transfer scenarios, a common activity-level question [59].

Objective: To simulate direct and indirect DNA transfer activities and quantify recovery data to inform the evaluation of competing activity-level propositions.
Materials and Reagents:
- DNA-free objects for handling (e.g., knife sheath, tools, clothing items)
- Sterile swabs and DNA collection kits
- DNA extraction and quantification kits
- PCR amplification kits and genetic analyzers
Procedure:
- Direct Transfer Simulation: Participant A handles the object directly (e.g., touches the snap of a knife sheath) for a defined period and pressure.
- Indirect Transfer Simulation: Participant A shakes hands with Participant B, who then handles the target object with the same contact parameters.
- Sample Collection: Swab the identical contact area on the object from both simulations using a standardized swabbing technique and solution.
- DNA Analysis: Extract, quantify, and generate DNA profiles from all recovered samples following standard laboratory protocols.
Data Analysis: Compare the quantity of DNA recovered and the quality of DNA profiles obtained from direct versus indirect transfer simulations. Statistically analyse the data to determine the evidential strength for distinguishing between these competing activity-level propositions.

Framework Implementation and Workflow

The following diagram illustrates the logical workflow for implementing a validation framework for activity-level evaluation, from case context to final interpretation.

Validation Framework Workflow

The structured workflow ensures evaluations are logical, robust, balanced, and transparent [59]. This process integrates TPPR parameters and case context to calculate a likelihood ratio, providing a scientifically valid method for addressing activity-level questions.

The Scientist's Toolkit

Implementing activity-level evaluations requires specific reagents, materials, and instrumentation. The following table details essential components for conducting transfer and persistence studies.

Table 4: Research Reagent Solutions and Essential Materials [34]

Item	Function/Application	Specification Example
Textile Garments	Serve as donor/recipient substrates in transfer studies	100% cotton T-shirts (180 gsm); Polyester/cotton hoodies (310 gsm)
Adhesive Tape	Recovers fibres from garment surfaces for analysis	48 mm width (e.g., Scotch Tough Grip Moving Tape)
Acetate Sheets	Provides clear backing for storing and examining tape lifts	Clear acetate sheets (e.g., EXP600 OHP)
Stereoscopic Macroscope	Initial examination and morphological characterization of recovered fibres	Leica EZ4D with integrated digital camera
Comparison Microscope	Detailed characterization at higher magnification	Leica DM4M-FSCB (50-1000x magnification)
Measurement Software	Quantifies continuous length of fibres	Leica Application Suite (LAS) with measurement module

The validation framework presented herein provides a structured pathway for integrating activity-level evaluations into operational forensic practice. By leveraging empirical TPPR data, following standardized experimental protocols, and adhering to a rigorous interpretive workflow, practitioners can substantially improve the credibility and utility of forensic evaluations. The proposed approach addresses a critical gap in forensic science, empowering experts to provide more meaningful assistance to investigators, attorneys, and courts. This framework represents a significant step toward global acceptance of robust, factual, and helpful activity-level reporting that ultimately strengthens the criminal justice system.

The Role of Fit-for-Purpose Principles in Model Evaluation and Regulatory Review

The Fit-for-Purpose (FFP) principle has emerged as a cornerstone framework for ensuring that models and analytical methods in scientific research and development are appropriately aligned with their intended applications. Within drug development, FFP provides a strategic approach for selecting and validating Model-Informed Drug Development (MIDD) tools that are closely matched to specific Questions of Interest (QOI) and Contexts of Use (COU) across all stages from discovery to post-market surveillance [24]. Simultaneously, in forensic science, FFP principles guide the development of models and analytical approaches for activity level research, particularly in understanding DNA transfer and persistence, to evaluate evidence under competing activity propositions [42] [51] [41]. This application note explores the application of FFP principles across these domains, providing structured protocols, quantitative data summaries, and visualization tools to enhance methodological rigor and regulatory acceptance.

The Fit-for-Purpose concept represents a pragmatic, risk-based approach to method validation and model development that prioritizes fitness for intended use over one-size-fits-all validation standards. In regulatory science, FFP acknowledges that the level of validation and documentation should be commensurate with the decision consequence and potential patient risk [61]. For activity level research in forensic contexts, FFP principles guide the development of experimental designs and probabilistic models that appropriately address questions about how biological material was transferred and persisted under specific case circumstances [42] [41].

The fundamental components of an FFP approach include:

Clear definition of the Context of Use (COU): The specific purpose and application of the model or method must be explicitly stated [24] [62].
Alignment with Questions of Interest (QOI): The methodology must directly address the specific scientific or regulatory questions being posed [24].
Risk-based validation strategy: The extent of validation reflects the model's influence on decisions and potential consequences of incorrect decisions [61].
Iterative development and refinement: Models and methods may evolve through successive iterations as new data and knowledge emerge [63] [61].

Fit-for-Purpose Framework in Regulatory Science

Regulatory Adoption of FFP Principles

Regulatory agencies have formally embraced FFP approaches through various initiatives. The U.S. Food and Drug Administration's Fit-for-Purpose Initiative provides a pathway for regulatory acceptance of dynamic tools for use in drug development programs [25]. This initiative recognizes that due to the evolving nature of certain drug development tools (DDTs) and the inability to provide formal qualification, a designation of 'fit-for-purpose' can be established based on thorough evaluation of submitted information [25].

The FDA's FFP determination is publicly available to facilitate greater utilization of these tools in drug development programs. To date, the FDA has granted FFP designation to several applications, including:

Alzheimer's disease model for clinical trial design (CAMD)
MCP-Mod statistical method for dose-finding (Janssen and Novartis)
Bayesian Optimal Interval (BOIN) design for dose selection (MD Anderson Cancer Center)
Empirically Based Bayesian Emax Models for dose selection (Pfizer) [25] [61]

FFP in Model-Informed Drug Development (MIDD)

In MIDD, the FFP principle guides the selection and application of quantitative tools throughout the five stages of drug development: discovery, preclinical research, clinical research, regulatory review, and post-market monitoring [24]. The appropriate modeling methodology depends on the specific stage and questions being addressed, as illustrated in Table 1.

Table 1: Fit-for-Purpose Modeling Tools in Drug Development

Development Stage	Key Questions of Interest	Fit-for-Purpose Modeling Approaches
Discovery	Target identification, compound screening	QSAR, AI/ML for compound optimization [24]
Preclinical Research	Lead optimization, FIH dose prediction	PBPK, QSP, semi-mechanistic PK/PD [24]
Clinical Research	Dose optimization, trial design	PopPK, ER, clinical trial simulation [24]
Regulatory Review	Benefit-risk assessment, labeling	Model-integrated evidence, PBPK [24] [61]
Post-Market Monitoring	Safety monitoring, label updates	Model-based meta-analysis, virtual population simulation [24]

The Model Master File (MMF) framework has recently emerged as a platform to support model reusability and sharing in regulatory settings, further enhancing the efficiency of FFP application in drug development [61].

FFP in Biomarker Method Validation

In biomarker development, FFP validation has been widely adopted to ensure that assays produce reliable and interpretable results appropriate for their intended use [62] [63]. The level of validation varies based on the assay classification and its application in the drug development continuum, from exploratory research to critical decision-making [63].

Table 2: Fit-for-Purpose Biomarker Assay Validation Parameters

Performance Characteristic	Definitive Quantitative	Relative Quantitative	Quasi-Quantitative	Qualitative
Accuracy/Trueness	Required	Required	Not required	Not required
Precision	Required	Required	Required	Not required
Sensitivity	LLOQ	LLOQ	Characteristic response	Detection limit
Specificity	Required	Required	Required	Required
Dilution Linearity	Required	Required	Not applicable	Not applicable
Assay Range	LLOQ-ULOQ	LLOQ-ULOQ	Characteristic range	Not applicable

The FFP approach to biomarker validation progresses through discrete stages: (1) definition of purpose and candidate assay selection; (2) method validation planning; (3) performance verification; (4) in-study validation; and (5) routine use with quality control monitoring [63]. This iterative process allows for continuous improvement and refinement based on accumulating experience and data.

Fit-for-Purpose Principles in Activity Level Research

Activity Level Propositions in Forensic Science

In forensic science, particularly in DNA transfer and persistence studies, FFP principles guide research designed to address activity level propositions—questions about how DNA was transferred and persisted under specific case circumstances [42] [41]. Such research aims to provide empirical data and probabilistic models to evaluate evidence given competing scenarios about activities that may have led to DNA deposition [41].

Key questions in activity level research include:

How much DNA is transferred through specific activities?
How persistent is transferred DNA over time and under various conditions?
Can primary transfer be distinguished from secondary transfer?
Can habitual use be differentiated from single instances of contact? [41]

Experimental Data on DNA Transfer and Persistence

Recent research has generated quantitative data on DNA transfer and persistence relevant to activity level assessments. The following tables summarize key findings from empirical studies.

Table 3: DNA Transfer and Persistence Following Hand Contact

Contact Scenario	Detection Rate	Key Influencing Factors	Study
Immediate post-handshake	94% (LR ≥ 10,000)	Shedder status, hand dominance	[51]
15 minutes post-handshake	50% (LR ≥ 10,000)	Activities performed in interim	[51]
Intimate contact	Vaginal mucosa detected up to 36h	Type of contact, time elapsed	[42]
Non-intimate social contact	Vaginal mucosa in 1 sample	Associated DNA quantity	[42]

Table 4: DNA Recovery from Habitual vs One-Time Use Items

Item Type	Habitual User DNA Mean	One-Time User DNA Mean	Differential Recovery
Mobile Phone	High (≥80% contribution)	Low	Most pronounced difference
Car Key	High (≥80% contribution)	Low	Pronounced difference
Office Mouse	High (≥80% contribution)	Low	Pronounced difference
Drinking Glass	Variable	Variable	Less pronounced difference
Knife Handle	Variable	Variable	Less pronounced difference

The data indicate that item porosity, surface texture, and nature of contact significantly influence DNA transfer and persistence [41]. Non-porous items with textured surfaces (e.g., mobile phones, car keys) show more predictable DNA deposition patterns compared to smooth or porous items [41].

Experimental Protocols for Activity Level Research

Protocol for Handshake Transfer and Persistence Study

Objective: To quantify the transfer and persistence of non-self DNA acquired through handshakes and determine how activities performed after transfer affect DNA detection.

Materials:

DNA-free glass plates (140 × 220 mm)
Sterile swabs and collection kits
Quantitative PCR system
STR amplification kits
Genetic analyzers
Probabilistic genotyping software

Procedure:

Participant Selection: Recruit volunteer pairs who do not share living or working spaces to minimize background DNA transfer.
Baseline Sampling: Collect buccal reference samples from all participants.
Handshake Transfer: Participants perform firm handshakes for 10 seconds using right hands.
Experimental Groups:
- Group A: Participants immediately place right hand deposits on glass plates (P1) with 10 seconds of pressure.
- Group B: Participants return to daily activities for 15 minutes before placing hand deposits on glass plates (P1).
Series Sampling: All participants place four additional sequential handprints on separate glass plates (P2-P5) at 5-minute intervals.
Sample Processing:
- Swab entire hand contact area on each plate.
- Extract DNA using standardized forensic protocols.
- Quantify DNA yield using qPCR.
- Perform STR amplification and capillary electrophoresis.
- Analyze profiles using probabilistic genotyping software.
Data Analysis:
- Calculate Likelihood Ratios (LR) for known contributors.
- Determine proportion of mixture contributions for each donor.
- Assess impact of activities on DNA transfer and persistence [51].

Protocol for Habitual vs One-Time Use Study

Objective: To differentiate DNA contributions from habitual and one-time users of commonly encountered items and assess the potential for secondary transfer.

Materials:

Test items: mobile phones, car keys, office mice, drinking glasses, knife handles
Sterile swabs and collection kits
Quantitative PCR system
STR amplification kits
Genetic analyzers

Procedure:

Participant Selection: Recruit participants familiar with each other but not co-habiting.
Habitual Use Phase: Designated habitual users employ test items for 2 hours during normal work activities.
Initial Sampling: Swab items after habitual use period according to standard forensic protocols.
One-Time Use Phase: Second participants handle each item for 5 seconds in a manner consistent with normal use.
Post-Use Sampling: Reswab items after one-time use.
Cleaning Protocol: Clean non-porous items with 70% ethanol and reswab to assess DNA removal.
Sample Processing:
- Extract DNA using standardized forensic protocols.
- Quantify DNA yield.
- Perform STR amplification and analysis.
- Determine DNA contributor proportions using probabilistic genotyping.
Data Analysis:
- Compare DNA quantities and contributor proportions before and after one-time use.
- Assess effectiveness of cleaning protocols.
- Evaluate potential for secondary transfer [41].

Visualization of Fit-for-Purpose Frameworks

FFP Model Development and Evaluation Workflow

Figure 1: Fit-for-Purpose Model Development and Evaluation Workflow. This diagram illustrates the iterative process for developing and evaluating models under FFP principles, from initial definition of Context of Use through implementation.

DNA Transfer Pathways and Influencing Factors

Figure 2: DNA Transfer Pathways and Influencing Factors. This diagram illustrates the pathway of DNA transfer from source to interpretation, highlighting key factors that influence transfer and persistence outcomes.

The Scientist's Toolkit: Essential Research Reagents and Materials

Table 5: Essential Research Materials for FFP Studies

Material/Reagent	Specification	Function/Application	Example Use
DNA-free surfaces	Glass plates, plastic items	Standardized substrates for transfer studies	Handprint deposition studies [51]
Sterile swabs	Forensic-grade collection devices	Sample collection from surfaces	Recovery of DNA from test items [41]
Quantitative PCR kits	Human-specific DNA targets	DNA quantification and quality assessment	Determining total DNA yield [42] [41]
STR amplification kits	Commercial multiplex systems	DNA profiling and contributor analysis	Generating DNA profiles for comparison [42] [51]
Probabilistic genotyping software	Continuous interpretation systems	Deconvolution of mixed DNA profiles	Determining likelihood ratios for contributors [42] [41]
Reference DNA samples	Buccal cells or blood samples	Known profiles for comparison	Baseline references for participant identification [51] [41]
Statistical analysis software	R, Python, or specialized packages	Data analysis and visualization	Bayesian network analysis for activity level assessment [42]

The Fit-for-Purpose principle provides a robust framework for developing and evaluating models and methods across diverse scientific domains, from drug development to forensic activity level research. By emphasizing alignment between methodology and intended application, FFP approaches promote efficient resource allocation while maintaining scientific rigor appropriate to the decision context. The protocols, data summaries, and visualizations provided in this application note offer practical guidance for implementing FFP principles in research settings, facilitating the generation of reliable, interpretable, and regulatory-acceptable evidence for both developmental and forensic applications. As these fields continue to evolve, the FFP paradigm will remain essential for ensuring that scientific approaches remain appropriately matched to their intended contexts of use.

The forensic science paradigm is progressively shifting from merely identifying the source of biological material to answering activity-level propositions, which concern how and when forensic evidence was deposited [4]. This is particularly crucial when evaluating direct transfer scenarios, such as skin-to-skin contact between individuals in cases of assault. The interpretation of such trace evidence, including DNA, requires a robust understanding of its transfer, persistence, prevalence, and recovery (DNA-TPPR) to objectively weigh competing scenarios about the activities that led to its deposition [11] [64]. This framework allows scientists to move beyond the question of "whose DNA is this?" to the more forensically relevant question of "how did this DNA get here?" [4]. This application note provides a detailed protocol for designing experiments and analyzing data to compare activity-level propositions, with a specific focus on trace evidence transfer and persistence.

Quantitative Data Synthesis

Foundational research into DNA-TPPR provides essential quantitative data that informs the assessment of competing activity-level propositions. The tables below synthesize key empirical findings from experimental studies.

Table 1: Factors Influencing DNA Transfer and Persistence on Skin

Factor	Impact on DNA Evidence	Key Experimental Findings
Shedder Status	Influences the amount of DNA an individual deposits [11].	Individuals are classified as "good" or "poor" shedders, affecting the probability of detecting a foreign profile after contact [11].
Background DNA	Non-self DNA present on a skin surface prior to the activity of interest [11].	The prevalence and quantity of background DNA can mask DNA transferred during an alleged activity, complicating profile interpretation [11].
Contact Dynamics	Duration and pressure of skin-to-skin contact [65].	Increased pressure and contact time generally lead to higher transfer ratios; however, the relationship is complex and non-linear [65].
Persistence Time	The duration for which deposited DNA remains detectable [11].	DNA on skin degrades and is lost over time due to environmental exposure, sweat, and skin shedding; most foreign DNA is undetectable within a few hours [11].
Body Area Sampled	The specific location on the body from which a sample is collected [11].	Areas with higher friction (e.g., arms) may yield more DNA than others; internal body cavities can retain DNA longer than external skin [11].

Table 2: DNA Recovery Efficiency from Skin Using Different Sampling Techniques

Sampling Technique	Description	Reported Efficiency & Key Findings
Double-Swabbing	Application of a wet swab (e.g., moistened with distilled water) followed by a dry swab to the same area [11].	Recovered approximately 13.7% more offender DNA than other methods in a mock assault scenario; considered the most effective for touch DNA [11].
Single Swabbing (Wet)	Use of a single pre-moistened swab with a rolling or rubbing motion.	Yields lower DNA recovery compared to double-swabbing; efficiency varies with the specific movement (rubbing vs. rolling) [11].
Single Swabbing (Dry)	Use of a single dry swab with a rolling or rubbing motion.	Generally less effective than wet swabbing for recovering cellular material from skin [11].
Tape Lifting	Use of adhesive tape to remove material from the skin surface [11].	Can be effective for certain surfaces but was less efficient than double-swabbing for skin-to-skin contact in one study [11].

Experimental Protocols

Universal Protocol for Transfer and Persistence Studies

This protocol, adapted from foundational trace evidence research, provides a standardised method for generating quantitative data on evidence transfer [65]. Using a UV-powder proxy material allows for highly replicated and quantifiable experiments.

1. Objective: To quantify the transfer and persistence of a proxy particulate material (simulating trace DNA or other evidence) under controlled conditions of pressure and contact time.

2. Materials:

Donor and Receiver Materials: 5 cm x 5 cm swatches of relevant fabrics (e.g., 100% cotton, wool, nylon) [65].
Proxy Material: UV powder mixed with flour in a 1:3 ratio by weight [65].
Application Surface: A flat, stable surface.
Weights: Calibrated masses (e.g., 200 g, 500 g, 1000 g).
UV Light Source: For illuminating the proxy material.
Imaging System: Camera with fixed settings and mounting equipment.
Image Analysis Software: e.g., ImageJ (Fiji) with a standardised macro for particle counting [65].

3. Methodology:

Step 1: Background Imaging. Place the donor and receiver swatches separately under UV light and capture background images (P1 and P2).
Step 2: Donor Preparation. Sprinkle a known quantity of the UV powder/flour mixture onto the central 3 cm x 3 cm area of the donor material. Capture an image (P3).
Step 3: Transfer Event. Carefully place the receiver material on top of the donor. Place the chosen weight on top for the designated contact time (e.g., 30, 60, 120, 240 seconds). Remove the weight and separate the swatches [65].
Step 4: Post-Transfer Imaging. Capture images of the donor material (P4) and the receiver material (P5) under UV light.
Step 5: Persistence Experiment. Attach the receiver material from Step 4 to an item of clothing. The clothing is worn for a defined period (e.g., up to 7 days) during normal activities. Image the receiver material at regular intervals to track particle loss over time [65].
Step 6: Image Analysis. Use the ImageJ macro to automatically count particles in all images. Calculate the Transfer Ratio and Transfer Efficiency using the formulas below [65].

4. Data Analysis:

Transfer Ratio = (Actual Receiver) / (Actual Donor) [65]
Transfer Efficiency = (Actual Receiver) / (Donor post-deposition - Donor post-transfer) [65]
Where:
- Actual Receiver = P5 - P2
- Actual Donor = P3 - P1

Statistical analysis (e.g., Mann-Whitney tests) should be applied to compare different experimental conditions (e.g., mass, time, material type) [65].

Protocol for DNA Recovery from Skin in Mock Assault Scenarios

This protocol details the optimal method for collecting foreign DNA following skin-to-skin contact.

1. Objective: To maximize the recovery of non-self DNA transferred to a recipient's skin during a simulated physical contact.

2. Materials:

Swabs: Sterile cotton-tipped swabs (e.g., Puritan Cap-Shure) or nylon FLOQ swabs.
Deionized Water: For moistening swabs.
Labeled Swab Containers: For evidence storage.

3. Methodology:

Step 1: Moistening the Swab. Moisten the first swab with deionized water. Excess water should be gently shaken off.
Step 2: First Swab Pass. Roll the moistened swab thoroughly over the entire target skin area using a continuous, firm rolling motion. Avoid rubbing, which may lyse cells.
Step 3: Second Swab Pass. Immediately after the first pass, use a dry swab and roll it over the same skin area. This collects residual moisture and any cells dislodged but not picked up by the wet swab.
Step 4: Drying and Storage. Allow the swabs to air-dry at room temperature before placing them in secure, labeled containers to prevent degradation of DNA [11].

Visualizing Workflows and Relationships

The following diagrams, generated using the DOT language and the specified color palette, illustrate the core logical and experimental frameworks.

Activity Level Evaluation Logic

Trace Evidence Transfer Experiment

The Scientist's Toolkit

This table details essential reagents and materials for conducting transfer, persistence, and recovery experiments.

Table 3: Key Research Reagent Solutions and Materials

Item	Function/Application	Specific Examples & Notes
UV Powder & Flour Proxy	A safe, quantifiable simulant for trace particulates like skin cells or GSR. Allows for highly replicated, low-cost experiments and visualization under UV light [65].	1:3 (w/w) mixture of UV powder and flour. The ratio ensures particles are visible and transfer realistically [65].
Swabbing Systems	The primary tool for recovering cellular material from skin and other surfaces.	Cotton Swabs (Puritan): Traditional, effective for double-swabbing. Nylon FLOQ Swabs (Copan): Claim enhanced DNA release during extraction [11].
Image Analysis Software	To objectively and efficiently count thousands of transferred proxy particles.	ImageJ / Fiji: Open-source software. A standardised macro automates counting, ensuring consistency and reproducibility across studies [65].
Standardised Swatches	Provide a consistent and controllable surface for transfer experiments.	5 cm x 5 cm swatches of defined materials (e.g., 100% cotton, wool, nylon). Standardization allows for inter-laboratory comparisons [65].

The increasing complexity of drug development has necessitated the adoption of advanced computational modeling and simulation tools. This article provides a detailed benchmarking analysis of four key approaches: Physiologically-Based Pharmacokinetic (PBPK), Quantitative Systems Pharmacology (QSP), Population PK (PopPK), and Artificial Intelligence/Machine Learning (AI/ML) modeling. Within the context of transfer persistence considerations activity level research, these tools offer distinct capabilities for understanding drug behavior, predicting patient responses, and optimizing therapeutic interventions. The following sections present comparative frameworks, experimental protocols, and visualization tools to guide researchers in selecting and implementing appropriate methodologies throughout the drug development pipeline.

Comparative Analysis of Modeling Approaches

Table 1: Key Characteristics of Pharmacokinetic and Pharmacodynamic Modeling Approaches

Characteristic	PBPK	QSP	Population PK	AI/ML
Primary Approach	Bottom-up, mechanistic [66] [67]	Bottom-up, systems-level [68]	Top-down, empirical [67]	Data-driven, statistical [69]
Core Focus	Predicting PK using physiology and drug properties [66]	Modeling drug effects on biological systems [68]	Characterizing variability in drug exposure [70]	Identifying patterns from historical data [69]
Typical Application	DDI, organ impairment, pediatrics [71]	Target validation, mechanism of action, efficacy prediction [72]	Covariate analysis, dosing optimization [70]	Early DDI prediction, parameter forecasting [69] [73]
Data Requirements	In vitro ADME data, physiological parameters [66]	Biological pathway data, drug-target interactions [72]	Sparse or rich clinical PK data [70]	Large training datasets (structural, ADME) [69] [73]
Regulatory Acceptance	Established for DDI (74.2% of PBPK submissions) [71]	Growing (60 FDA submissions in 2020) [74]	Well-established for dosing recommendations [67]	Emerging for parameter prediction [73]
Key Software Tools	Simcyp, GastroPlus, PK-Sim [66] [71]	Various proprietary & open-source platforms [72]	NONMEM, Monolix [70] [75]	PyDarwin, TensorFlow, custom QSAR [73] [75]

Table 2: Data Requirements and Parameter Sources Across Modeling Paradigms

Parameter Type	PBPK	QSP	Population PK	AI/ML
Physicochemical Properties	Molecular weight, logP, pKa [66]	Variable based on biological system	Limited incorporation	Molecular descriptors, fingerprints [69]
In Vitro ADME	CLint, permeability, plasma protein binding [66]	Potentially incorporated in system context	Rarely used directly	Training features for parameter prediction [73]
Physiological/System	Tissue volumes, blood flows [66]	Biological pathways, network interactions [72]	Not typically incorporated	Can incorporate system parameters as features [73]
Clinical PK	For model verification [66]	For model qualification [74]	Primary data source [70]	Training labels and feature derivation [69]
Covariates	Fixed physiological variability [66]	Biological variability in system components	Estimated relationships with PK parameters [70]	Can be included as predictive features [75]

Diagram 1: Application Timeline of Modeling Approaches in Drug Development. This workflow illustrates the strategic deployment of different modeling methodologies across the drug development continuum, highlighting their complementary nature.

Experimental Protocols and Methodologies

PBPK Model Development Protocol

Objective: To construct and qualify a PBPK model for predicting human pharmacokinetics and assessing drug-drug interactions.

Required Input Parameters:

Physicochemical properties: Molecular weight, logP, pKa, solubility profile [66]
Plasma protein binding: Fraction unbound in plasma (fub) [66]
Blood-to-plasma ratio: B:P partitioning [66]
In vitro metabolism data: Intrinsic clearance (CLint) from hepatocytes or microsomes [66]
Enzyme kinetics: Vmax, Km for relevant metabolic pathways [66]
Inhibition/induction parameters: IC50, KI, kinact for DDI assessment [66]
Physiological parameters: Tissue volumes, blood flows (system-dependent) [66]

Protocol Steps:

Parameter Acquisition: Compile all drug-dependent parameters from in vitro assays (Table 1) [66].
Model Construction: Implement parameters into PBPK software (e.g., Simcyp, GastroPlus) using appropriate organ compartment structure [66].
Preclinical Verification: Verify model performance against in vivo preclinical PK data in relevant species [66].
Human PK Prediction: Simulate human PK profiles using verified model parameters [66].
Model Refinement: Apply middle-out approach using early clinical data to refine parameters if needed [66].
Application: Utilize qualified model for DDI, organ impairment, or special population simulations [71].

Qualification Criteria: Successful prediction of clinical PK parameters (AUC, Cmax) within 2-fold of observed values [66].

Population PK Model Development Protocol

Objective: To develop a population PK model characterizing sources of variability in drug exposure and identifying significant covariates.

Data Requirements:

PK samples: Sparse or rich concentration-time data from clinical studies [70]
Dosing records: Accurate documentation of dose amounts, times, and routes [70]
Patient covariates: Demographic, laboratory, genetic, and concomitant medication data [70]
Sampling matrix: Documentation of plasma, blood, or other matrix with appropriate handling [70]

Protocol Steps:

Data Preparation: Compile dataset with proper handling of BLQ data and validation of key variables [70].
Base Model Development: Identify optimal structural model (1, 2, or 3 compartments) using objective function value comparison [70].
Statistical Model: Estimate interindividual variability and residual error models [70].
Covariate Testing: Evaluate potential covariate relationships using stepwise forward addition/backward elimination [70].
Model Qualification: Assess using diagnostic plots, visual predictive checks, and bootstrap analysis [70].
Model Application: Simulate alternative dosing regimens for specific subpopulations [67].

Software Implementation:

Primary software: NONMEM, Monolix [70] [75]
Algorithm: FOCE, SAEM, or Bayesian estimation methods [70]
Model selection: AIC, BIC, or likelihood ratio test for nested models [70]

AI/ML-Enhanced PK Modeling Protocol

Objective: To implement machine learning approaches for predicting PK parameters and automating model development.

Data Sourcing and Preparation:

Database Curation: Compile training data from public databases (e.g., PK-DB, DrugBank) or internal repositories [73].
Feature Engineering: Generate molecular descriptors (e.g., structural fingerprints, physicochemical properties) [69] [73].
Data Partitioning: Split data into training, validation, and test sets (typical ratio: 70/15/15).

Model Training Approaches:

QSAR Modeling: Implement random forest, support vector machines, or neural networks for ADME parameter prediction [73].
Automated PopPK: Utilize frameworks like pyDarwin with Bayesian optimization for structural model selection [75].
Neural-ODE: Apply neural ordinary differential equations for direct PK profile prediction [73].

Validation Framework:

Internal Validation: Cross-validation performance metrics (RMSE, MAE, R²) [75].
External Validation: Prediction accuracy on holdout test compounds [73].
Domain of Applicability: Assessment of model applicability to new chemical space [73].

Research Reagent Solutions

Table 3: Essential Software and Database Resources for Pharmacokinetic Modeling

Resource Category	Specific Tools	Primary Application	Key Features
PBPK Platforms	Simcyp Simulator [71]	PBPK modeling & simulation	Population variability, DDI prediction
	GastroPlus [66]	PBPK & absorption modeling	ACAT model, biowaiver considerations
	PK-Sim [66]	Whole-body PBPK	Open systems biology framework
PopPK Software	NONMEM [75]	Population PK/PD analysis	Industry standard, maximum likelihood estimation
	Monolix	Population PK/PD analysis	SAEM algorithm, user-friendly interface
AI/ML Frameworks	pyDarwin [75]	Automated popPK development	Bayesian optimization, model space search
	TensorFlow/PyTorch	Neural network development	Flexible deep learning architectures
Database Resources	PK-DB [73]	Pharmacokinetic data	Curated PK parameters across compounds
	DrugBank	Drug & target information	Comprehensive drug property data

Integrated Workflow for Transfer Persistence Research

Diagram 2: Integrated Modeling Workflow for Drug Development Decisions. This diagram illustrates the synergistic relationships between different modeling approaches, highlighting how outputs from earlier models inform subsequent analyses in a comprehensive drug development strategy.

The integration of these modeling approaches creates a powerful framework for addressing transfer persistence considerations in pharmacological research. The combined workflow leverages the strengths of each methodology:

AI/ML provides rapid parameter estimates and model structure recommendations early in development when data are limited [69] [75].
PBPK incorporates mechanistic understanding and enables extrapolation to special populations [66] [71].
Population PK quantifies and explains variability in target populations using clinical data [70] [67].
QSP bridges PK and PD to predict clinical efficacy and safety outcomes [72] [68].

This integrated approach facilitates more informed decision-making throughout the drug development process, from candidate selection to late-stage clinical trials and regulatory submission.

Assessing Model Influence and Risk within the Totality of Evidence

Within forensic science, the evaluation of forensic findings given activity-level propositions (ALR) addresses questions of how or when DNA evidence was deposited [4]. This represents a shift from traditional source-level questions ("Whose DNA is this?") to activity-level questions ("How did this DNA get here?") [76]. Answering these questions requires a framework to assess the model influence and model risk of the methods used to interpret DNA transfer, persistence, prevalence, and recovery (DNA-TPPR) within the totality of evidence [77].

This protocol outlines a standardized approach for forensic researchers and scientists to evaluate the credibility and influence of computational models and methodologies used for activity-level assessments. The framework adapts risk-informed credibility assessment principles from other scientific disciplines to the specific challenges of forensic DNA-TPPR research [77].

Conceptual Framework and Key Definitions

Core Concepts

The credibility of a model is defined as the trust in its predictive capability for a specific context of use, established through the collection of evidence [77]. Assessing this credibility within the totality of evidence requires understanding several key concepts:

Model Influence: The contribution of the computational model relative to other evidence in making a decision. A model with high influence on a final decision requires more rigorous credibility assessment [77].
Model Risk: The possibility that the model and its results may lead to an incorrect decision and adverse outcome. Risk is determined by both model influence and decision consequence [77].
Totality of Evidence: The integrated consideration of all available lines of evidence, where the model is one component alongside other experimental, observational, and case-specific data.

The Risk-Informed Credibility Assessment Framework

The credibility assessment process involves multiple interconnected steps, as shown in the workflow below:

Diagram 1: Risk-informed credibility assessment workflow.

Experimental Protocols for Generating TPPR Data

To inform activity-level models, robust experimental data on DNA Transfer, Prevalence, Persistence, and Recovery (TPPR) is essential. The following protocols describe standardized methodologies for generating these foundational data.

Protocol 1: DNA Recovery from Skin Surfaces Using Double-Swabbing

Purpose: To optimally recover trace DNA deposits from human skin surfaces following physical contact, simulating assault scenarios [11].

Background: The double-swabbing technique is considered one of the most effective methods for recovering touch DNA from skin surfaces. Its efficacy has been demonstrated in mock assault scenarios where an "offender" grips the forearms of a "victim" [11].

Key Materials:
- Cap-Shure sterile cotton swabs (Puritan) or nylon FLOQ swabs (Copan)
- Sterile, DNA-free water
- Microcentrifuge tubes
- Swab storage boxes or paper envelopes
- Personal protective equipment (gloves, mask)
Procedure:
- Moisten the first swab: Lightly moisten a cotton or flocked swab with sterile, DNA-free water.
- First swab application: Apply the moistened swab to the target skin area using a combination of rolling and rubbing motions, covering the entire surface to be sampled. Apply moderate pressure.
- Air drying: Allow the first swab to air dry completely at room temperature for approximately 30-60 minutes.
- Second swab application: Using a dry swab, repeat the rolling and rubbing motions over the same skin area.
- Packaging: Place both swabs from the same sample into a single swab storage box or paper envelope. Seal and label clearly.
- Storage: Store samples at room temperature until DNA extraction.

Note: Studies have shown the double-swabbing method can recover approximately 13.7% more offender DNA than single-swab methods [11].

Purpose: To evaluate the frequency and extent of indirect DNA transfer in controlled, social environments, providing data for activity-level assessments [78].

Background: This uncontrolled, activity-level evaluation helps understand how DNA is transferred between individuals and onto objects through casual contact, which is crucial for interpreting DNA findings in forensic casework [78].

Key Materials:
- Pre-cleaned residential environment
- New, pre-sampled underwear for participants
- Sterile swabs for intimate sampling (e.g., penile, pubic hair combing)
- Common household items (e.g., glasses, cutlery, door handles, toilet handle)
- Video recording equipment for contact monitoring
- DNA quantification kits (e.g., Quantifier Trio)
Procedure:
- Preparation: Clean the experimental environment (e.g., a home) two weeks prior to the study. Conduct baseline DNA sampling of all target items.
- Participant Preparation: On the day of the experiment, provide participants with new, researcher-supplied underwear. Collect reference DNA samples from all participants and their cohabitants.
- Shedder Status Determination: Have each participant place their hand on a glass slide on three separate days. Quantify the deposited DNA to classify them as low, intermediate, or heavy shedders [78].
- Social Interaction: Host a controlled social event (e.g., one hour) where participants engage in activities like playing board games, eating, drinking, and simulated bathroom use. Record all interactions from multiple angles.
- Post-Interaction Sampling:
  - Sample all items contacted during the event.
  - Collect intimate samples (e.g., underwear, penile swabs) five hours post-event.
  - Participants self-comb pubic hair and perform penile swabbing.
- Analysis: Quantify and profile all DNA samples. Correlate DNA findings with observed contacts, participant shedder status, and personal habits.

Protocol 3: Direct and Indirect DNA Transfer Simulation (ReAct Project)

Purpose: To simulate forensic scenarios involving direct and indirect DNA transfer for evaluating the value of DNA results given activity-level propositions [10].

Background: The ReAct project, supported by the European Network of Forensic Science Institutes (ENFSI), designed experiments to simulate typical casework circumstances like robberies where a tool is used to force entry [10].

Key Materials:
- Screwdriver (target object)
- Door or window frame (forced entry point)
- Sterile swabs for DNA collection
- DNA extraction and quantification kits
- Probabilistic genotyping software
Procedure:
- Direct Transfer Experiment:
  - A known "defendant" owns and uses the screwdriver.
  - An unknown person uses the defendant's stolen screwdriver to force a door/window.
  - Sample the screwdriver handle and the forced entry point.
- Indirect Transfer Experiment:
  - The defendant neither owns nor uses the screwdriver.
  - An unknown offender touches an object previously handled by the defendant, then uses the screwdriver to force entry.
  - Sample the screwdriver handle and the object touched by the defendant.
- Analysis:
  - Extract and quantify DNA from all samples.
  - Generate DNA profiles using standard methods and probabilistic genotyping.
  - Analyze results using Bayesian Networks (e.g., Shiny_React() application) to calculate likelihood ratios for activity-level propositions [10].

Quantitative Data and Analysis

Inter-Laboratory DNA Recovery Variation

Data from the ReAct project, involving 23 laboratories, revealed significant variation in DNA recovery rates, which directly impacts activity-level assessments [10].

Table 1: Inter-Laboratory DNA Recovery Variation in ReAct Project

Experiment Type	Number of Labs	Median DNA Recovery Range	Impact on Activity-Level Assessment
Direct & Indirect Transfer	23	200 pg – 5 ng	Considerable differences in Likelihood Ratios (LRs) due to varying recovery efficiencies between laboratories.
Direct Transfer	23	Varies by lab	Single contributor profiles matching the person of interest (POI) effectively discriminated between propositions.
Indirect Transfer	23	Varies by lab	Both single and major contributor profiles matching the POI supported the proposition of tool use.

Studies of DNA transfer in social settings provide quantitative data on expected DNA amounts from various interactions.

Table 2: DNA Recovery from Social Setting Experiments

Sample Type	Number of Samples	DNA Quantity Range	Average DNA Quantity	Average Number of Contributors
Various Items (Vase, Utensils)	127	0.06 ng – 46 ng	3.5 ng	2
Female Underwear (Gusset)	1 sample reported	45 ng	45 ng	Not specified
Female Underwear (Non-gusset)	Multiple samples	Not specified	1.2 ng (Visitor)0.25 ng (Host)	Not specified
Male Underwear	Multiple samples	Not specified	2.8 ng (Host)0.09 ng (Visitor)	Not specified
Toilet Handle / Doorknob	2 samples reported	Not specified	Not specified	4

The Scientist's Toolkit: Essential Research Reagents and Materials

Table 3: Key Reagents and Materials for DNA TPPR Research

Item	Function/Brief Explanation	Example Brands/Types
Cotton Swabs	Standard collection tool for DNA from surfaces. Viscose or cotton.	Puritan Cap-Shure [11]
FLOQ Swabs	Flocked swabs with perpendicular fibers for improved sample release and recovery.	Copan Nylon FLOQ Swabs [11]
Tape Lifts	Alternative collection method for dry surfaces on external skin.	SceneSafe FAST minitape [11]
Quantifier Trio Kit	Quantitative real-time PCR kit for human DNA quantification and assessment of degradation/inhibition.	Thermo Fisher Scientific [78]
Yfiler Plus Kit	Amplification kit for Y-STR analysis, useful for detecting male DNA in mixed samples, especially in intimate swabs.	Thermo Fisher Scientific [78]
Direct PCR Kits	Amplification without prior DNA extraction, maximizing sensitivity for trace DNA samples.	Various [11]
Bayesian Network Software	Computational tool for calculating likelihood ratios given activity-level propositions based on TPPR data.	Shiny_React() [10]
Probabilistic Genotyping Software	Software for interpreting complex DNA mixtures using statistical models.	STRmix, EuroForMix

Application and Decision Framework

The final assessment of model credibility determines its suitability for use in addressing the original question of interest. This decision process, informed by the totality of evidence, is summarized below:

Diagram 2: Decision framework based on credibility assessment.

Conclusion

The integration of robust transfer, persistence, and recovery principles into activity-level evaluation represents a paradigm shift for enhancing predictive accuracy in both forensic science and drug development. By adopting a structured, fit-for-purpose methodology—from foundational understanding and standardized protocols to rigorous validation—researchers can overcome existing adoption barriers and provide more credible, court-ready or clinic-ready evidence. Future progress hinges on global collaboration to build comprehensive, open-access data repositories, refine model-informed frameworks like MIDD, and embrace emerging AI technologies. This will ultimately bridge the gap between trace evidence and conclusive activity-level interpretation, strengthening decision-making in legal and therapeutic contexts alike.