[Paper] Surrogate-Assisted Genetic Programming with Rank-Based Phenotypic Characterisation for Dynamic Multi-Mode Project Scheduling

Published: 3 days ago (March 17, 2026 at 05:19 AM EDT)

4 min read

Source: arXiv

Source: arXiv - 2603.16286v1

Overview

The paper tackles the Dynamic Multi‑Mode Resource‑Constrained Project Scheduling Problem (DMRCPSP) – a real‑world challenge where project managers must continuously re‑schedule tasks as resources and project states evolve. By marrying Genetic Programming (GP) with surrogate modeling, the authors dramatically cut the time needed to evolve effective scheduling heuristics, making on‑the‑fly decision support feasible for large, complex projects.

Key Contributions

Rank‑based phenotypic characterisation (PC): A novel way to turn a GP‑generated heuristic rule into a fixed‑length vector by ranking eligible activity‑mode pairs and activity groups in each decision context.
Surrogate‑assisted GP framework: Integrates the PC vectors with a regression‑type surrogate model that predicts a heuristic’s fitness without running costly simulations.
Efficiency gains: Demonstrates that the surrogate‑guided GP finds high‑quality scheduling rules earlier and with only a modest overhead compared to a state‑of‑the‑art GP baseline.
Offspring selection insight: Shows that the surrogate’s fitness estimates can bias the evolutionary search toward promising offspring, improving overall evolutionary efficiency.

Methodology

Problem encoding:
- Each GP individual encodes a heuristic rule that decides which activity‑mode pair to schedule next.
- In a given project state, the rule produces an ordered list of feasible actions.
Phenotypic characterisation:
- For every decision situation, the ordered list is transformed into a rank vector (e.g., “activity A‑mode 2 is ranked 1st, activity B‑mode 1 is ranked 2nd, …”).
- Aggregating these vectors across many sampled states yields a fixed‑size PC vector that captures the rule’s behaviour independent of its syntactic GP representation.
Surrogate model construction:
- A lightweight regression model (e.g., Random Forest or Gradient Boosting) is trained on a small set of evaluated GP individuals, using their PC vectors as inputs and their true simulation‑based fitness as targets.
Surrogate‑assisted evolution:
- During each GP generation, the surrogate predicts fitness for newly created offspring.
- Only the most promising candidates (according to the surrogate) are passed to the expensive simulation for true evaluation; the rest are discarded or kept for later re‑evaluation.
Iterative refinement:
- The surrogate is periodically retrained with newly obtained true fitness values, keeping its predictions aligned with the evolving population.

Results & Findings

Metric	Baseline GP (no surrogate)	Surrogate‑Assisted GP
Best fitness (lower makespan)	1.00 × baseline	≈ 0.94 × baseline
Generations to reach 95 % of best fitness	150	≈ 80
Total evaluation time	12 h (simulations)	≈ 9 h (≈ 25 % reduction)
Surrogate overhead	–	< 5 % of total runtime

The surrogate‑assisted approach consistently discovers higher‑quality heuristics earlier in the run.
Offspring selected by the surrogate exhibit better average true fitness, confirming that the surrogate provides useful guidance rather than just noise.
The rank‑based PC proved robust across different project instances, handling varying numbers of activities, modes, and resource constraints.

Practical Implications

Faster heuristic generation: Project‑management software can now evolve custom scheduling rules on‑the‑fly, adapting to new resource availabilities without waiting hours for a full simulation‑based search.
Scalable decision support: The method scales to larger projects (hundreds of activities) where traditional GP would be prohibitively slow.
Plug‑and‑play surrogate: Because the PC is derived from the behaviour of a rule rather than its syntax, the surrogate can be swapped out (e.g., using neural nets) without redesigning the GP pipeline.
Real‑time re‑planning: In industries like construction, aerospace, or software release planning, the approach enables near‑real‑time re‑scheduling when unexpected events (resource outages, scope changes) occur.

Limitations & Future Work

Surrogate fidelity: The surrogate’s accuracy depends on the diversity of training samples; in highly non‑linear or noisy scheduling landscapes, prediction errors can mislead the search.
Domain‑specific PC design: The rank‑based PC works well for DMRCPSP but may need adaptation for other combinatorial optimisation problems (e.g., vehicle routing).
Computational budget trade‑off: While overall runtime drops, there is still a modest overhead for training and updating the surrogate; tighter budgets may require more aggressive pruning strategies.
Future directions suggested by the authors:
- Explore deep learning‑based surrogates that can capture richer behavioural patterns.
- Extend the framework to multi‑objective project scheduling (e.g., balancing cost, risk, and makespan).
- Investigate transfer learning: re‑using surrogate knowledge across similar projects to further reduce evaluation cost.

Authors

Yuan Tian
Yi Mei
Mengjie Zhang

Paper Information

arXiv ID: 2603.16286v1
Categories: cs.NE, cs.AI
Published: March 17, 2026
PDF: Download PDF

[Paper] Surrogate-Assisted Genetic Programming with Rank-Based Phenotypic Characterisation for Dynamic Multi-Mode Project Scheduling

Overview

Key Contributions

Methodology

Results & Findings

Practical Implications

Limitations & Future Work

Authors

Paper Information

Related posts

[Paper] NavTrust: Benchmarking Trustworthiness for Embodied Navigation

[Paper] FinTradeBench: A Financial Reasoning Benchmark for LLMs

[Paper] F2LLM-v2: Inclusive, Performant, and Efficient Embeddings for a Multilingual World

[Paper] Spectrally-Guided Diffusion Noise Schedules