DBIx::Class Fix Plan

April 30, 2026 · View on GitHub

Module: DBIx::Class 0.082844 (installed via jcpan) Branch: feature/dbix-class-destroy-weaken | PR: https://github.com/fglock/PerlOnJava/pull/485

Documentation Policy

Every non-trivial code change MUST document: what it solves, why this approach, what would break if removed.

Installation & Paths

Path	Contents
`~/.perlonjava/lib/`	Installed modules (`@INC` entry)
`~/.perlonjava/cpan/build/DBIx-Class-0.082844-NN/`	Build dir with tests

DBIC_BUILD=$(ls -d ~/.perlonjava/cpan/build/DBIx-Class-0.082844-* 2>/dev/null | grep -v yml | sort -t- -k5 -n | tail -1)

How to Run the Suite

cd /Users/fglock/projects/PerlOnJava3 && make
cd "$DBIC_BUILD"
JPERL=/Users/fglock/projects/PerlOnJava3/jperl
mkdir -p /tmp/dbic_suite
for t in t/*.t t/storage/*.t t/inflate/*.t t/multi_create/*.t t/prefetch/*.t \
         t/relationship/*.t t/resultset/*.t t/row/*.t t/search/*.t \
         t/sqlmaker/*.t t/sqlmaker/limit_dialects/*.t t/delete/*.t t/cdbi/*.t; do
    [ -f "$t" ] || continue
    timeout 60 "$JPERL" -Iblib/lib -Iblib/arch "$t" > /tmp/dbic_suite/$(echo "$t" | tr '/' '_' | sed 's/\.t$//').txt 2>&1
done
# Summary excluding TODO failures
for f in /tmp/dbic_suite/*.txt; do
    real=$(grep "^not ok " "$f" 2>/dev/null | grep -v "# TODO" | wc -l | tr -d ' ')
    [ "$real" -gt 0 ] && echo "FAIL($real): $(basename $f .txt)"
done | sort

Remaining Failures

File	Count	Status
`t/52leaks.t`	7 (tests 12-18)	Deep — refCount inflation in DBIC LeakTracer's `visit_refs` + ResultSource back-ref chain. Needs refCount-inflation audit; hasn't reproduced in simpler tests
`t/52leaks.t` (line 430, harness-only)	NEW — INVESTIGATE NEXT	"Unable to perform storage-dependent operations with a detached result source (source 'Artist' is not associated with a schema)". Test passes standalone (11/11 in 46s) but raises this exception when run as part of `./jcpan -t DBIx::Class` after ~20 prior tests. Schema's weak ref to ResultSource (or RS's strong ref to schema?) is being cleared prematurely — DESTROY firing on the schema while a child resultset still expects it. Different failure mode from tests 12-18 (which is a leak, this is over-eager DESTROY). See "Investigation Plan: Schema-Detached Bug" below.
`t/storage/txn_scope_guard.t`	1 (test 18)	Needs DESTROY resurrection semantics (strong ref via @DB::args after MIN_VALUE). Tried refCount-reset approach — caused infinite DESTROY loops when WARN handler re-triggers captures. Needs architectural redesign (separate "destroying" state from MIN_VALUE sentinel)

t/storage/txn.t — FIXED (90/90 pass) via Fix 10m (eq/ne fallback semantics).

Investigation Plan: Schema-Detached Bug in t/52leaks.t (line 430) — IN PROGRESS

Symptom

Under ./jcpan -t DBIx::Class (full 314-test suite), t/52leaks.t fails with:

DBIx::Class::ResultSource::schema(): Unable to perform storage-dependent operations
with a detached result source (source 'Artist' is not associated with a schema).
at t/52leaks.t line 430

Tests were run but no plan was declared and done_testing() was not seen — i.e. the test died mid-execution, not at the leak-detection assertion.

Standalone ../../jperl -Ilib -It/lib t/52leaks.t passes 11/11 in ~46 s. Failure only manifests when ~20+ prior DBIC tests have run through the same harness JVM.

Code path that triggers it

t/52leaks.t lines 414–438 iterate a chain of accessor closures over $phantom:

for my $accessor (
    sub { shift->clone },
    sub { shift->resultset('CD') },
    sub { shift->next },
    sub { shift->artist },
    sub { shift->search_related('cds') },
    sub { shift->next },
    sub { shift->search_related('artist') },
    sub { shift->result_source },
    sub { shift->resultset },              # <── line 430 fails here
    sub { shift->create({ name => 'detached' }) },
    ...
) {
    $phantom = populate_weakregistry( $weak_registry, scalar $_->($phantom) );
}

Each step replaces $phantom. The step-before-failure produces a ResultSource (via ->result_source); the failing step calls ->resultset on it, which does $self->schema or die 'detached…'. So the ResultSource's schema attribute (an inflated weak ref) is empty by the time we read it.

Hypothesis

A previous test in the harness has populated the global walker / weak-ref state in a way that makes the schema's weak ref to itself get auto-cleared during a mid-statement walker pass. When the row's result_source is later asked for its schema, the weak ref reads as undef.

The walker-gate property fix (PR #618 / commit ce8186e89) widened the gate to fire on every blessed object whose storedInPackageGlobal flag is set. The perf-cache fix in 691f95386 keeps the BFS bounded but doesn't change which objects get gated. If a Schema/ResultSource pair becomes gate-eligible mid- test under cumulative state pressure, weak-ref clearing is over-applied.

Diagnostic plan

Pinpoint which earlier test contaminates the JVM. Run the same DBIC prefix one test at a time and bisect: with [00..101], [00..52], [00..40], etc., find the smallest prefix whose final state makes a freestanding populate_weakregistry( $weak_registry, $phantom->result_source->resultset ) throw.
Capture the exact moment. With the bisected prefix, instrument DBIx::Class::ResultSource::schema() (or the Java side at WeakRefRegistry.clearWeakRefsTo) to log:
- which Schema instance is having its weak refs cleared,
- what triggered the clear (DESTROY? walker pass? scope exit?),
- the call stack in Perl + Java at the moment of the clear.
Compare reachability: at the moment of the clear, is the Schema actually unreachable (in which case the clear is correct and DBIC has a genuine ref-tracking gap), or is it reachable but the walker missed it? If walker missed it, that's a PerlOnJava bug in ReachabilityWalker.
Verify with c4db69e8d baseline. That commit's documented run is ./jcpan --jobs 1 -t DBIx::Class → 0/13858 fails. If we can apply just the relevant commits (PR #618 walker-gate property change + 691f95386 perf cache) on top of c4db69e8d's parent and reproduce the failure, the property-based gate is the regression source.

What we already know (from today's instrumentation)

The harness parent JVM is not the bottleneck. 10 jstack samples over 32 min show the parent in IOOperator.selectWithNIO Thread.sleep(10) polling 99.7 % of the time (6 s CPU in 32 min wall). It's just waiting.
The harness uses IPC::Open3 → ProcessInputHandle, which does correctly return false from isReadReady() when the child is silent — that's the intended behaviour, not the bug.
The orphan-watchdog landed in PR #635 prevents leftover JVMs from contaminating subsequent runs (no more 100% CPU starvation), but does NOT fix the schema-detached exception itself.

Confirmed root cause — `ScalarRefRegistry` race against forced GC

The Schema's weak ref is being cleared by MortalList.maybeAutoSweep() (a 5-second-throttled ReachabilityWalker.sweepWeakRefs(true) invocation that fires at every statement boundary once WeakRefRegistry.weakRefsExist flips). This is timing-dependent: which test trips the bug depends on when the 5-second timer happens to expire relative to the test's accessor chain.

Confirmed via env var:

Mode	t/52leaks.t result
default (auto-GC every 5 s)	crashes mid-test: `detached result source` at line 430
`JPERL_NO_AUTO_GC=1`	runs to completion, 14/23 subtests fail (the existing tests 12-18 leak detection)

The walker DOES seed lexicals (ReachabilityWalker.walk() lines 111–153, two loops):

// (a) ScalarRefRegistry — every RuntimeScalar that holds a ref
for (RuntimeScalar sc : ScalarRefRegistry.snapshot()) {
    if (sc.captureCount > 0) continue;
    if (WeakRefRegistry.isweak(sc)) continue;
    if (MortalList.isDeferredCapture(sc)) continue;
    if (!MyVarCleanupStack.isLive(sc)) {
        if (sc.scopeExited) continue;
        if (!sc.refCountOwned) continue;
    }
    visitScalar(sc, todo);
}
// (b) MyVarCleanupStack — explicit live my-var registration
for (Object liveVar : MyVarCleanupStack.snapshotLiveVars()) {
    if (liveVar instanceof RuntimeScalar sc) { ... }
    else if (liveVar instanceof RuntimeBase rb) { ... }
}

But there's a race. sweepWeakRefs calls ScalarRefRegistry.forceGcAndSnapshot() BEFORE iterating the snapshot. ScalarRefRegistry is a WeakHashMap. Any scalar whose only live JVM-side reference is a stack-frame local can get GC'd from the registry between the force-GC and the snapshot — even though the Perl-level lexical is still on the stack and reachable.

When that happens to the test's my $schema:

Path (a) misses it (gone from ScalarRefRegistry.snapshot() after the forced GC).
Path (b) misses it too — MyVarCleanupStack (Phase D-W1) was added specifically for my @arr / my %hash (RuntimeArray / RuntimeHash). A my $scalar = $ref does not register there because scalars aren't tracked by that mechanism.

Result: the schema is seedable only through the WeakHashMap path — which is exactly the one path that races against the forced GC inside sweepWeakRefs itself.

This explains:

Why simple reproducers pass — short scopes, low GC pressure, the WeakHashMap entry survives long enough to be snapshotted.
Why DBIC fails after ~20 prior tests — cumulative GC pressure raises the probability that my $schema is the unlucky entry GC'd between forceGcAndSnapshot and the snapshot read.

Fix path (narrow, TDD ordering)

Make MyVarCleanupStack track my $scalar = $ref strongly the same way it tracks my @arr / my %hash. Then walker path (b) finds the schema's lexical even when path (a)'s WeakHashMap entry has been GC'd.

Order matters — test before fix:

Add JPERL_FORCE_SWEEP_EVERY_FLUSH=1 debug knob ✅ DONE (this PR). In MortalList.maybeAutoSweep() — bypasses the 5-s throttle and the weakRefsExist gate when the env var is set. Required for (2) to be deterministic; same code is also useful for any future walker investigation.
Add a FAILING unit test under src/test/resources/unit/refcount/walker_lexical_scalar_root.t that uses JPERL_FORCE_SWEEP_EVERY_FLUSH=1 to deterministically reproduce the race.

⚠️ STATUS — hypothesis disconfirmed. The simple-lexical reproducer PASSES under JPERL_FORCE_SWEEP_EVERY_FLUSH=1$ + 20 \times $Internals::jperl_gc(). So does a DBIC-shape reproducer with global-registered schemas. The walker correctly seeds both paths today.

The actual DBIC bug is somewhere else — likely tied to:
- Moo / Class::C3::XS / MRO interaction with refCount tracking
- DBIC's per-row _result_source cached weak ref via accessor magic
- Or Storable's seen-table inflating refcounts during dclone
- Or some other DBIC-specific structural cycle
The fix path below ("Implement the fix in EmitVariable.java") is speculative — it might help, might not. Don't implement it without first capturing a real DBIC failure with the diagnostic knob enabled and inspecting which seeding gate dropped the schema.
Capture a real DBIC failure with the diagnostic knob. Next investigation step: add JPERL_WALKER_TRACE=1 instrumentation to ReachabilityWalker.sweepWeakRefs() so every cleared weak ref is logged with its target identity + findPathTo() output. Then run:
```
JPERL_FORCE_SWEEP_EVERY_FLUSH=1 JPERL_WALKER_TRACE=1 \
    ./jcpan -t DBIx::Class > /tmp/full.log 2>&1
```
The first line in the trace whose target is a Schema/ResultSource that DBIC subsequently complains about is the actual blind spot. Then we know which seeding gate to fix — without speculating.
~~Implement the fix in EmitVariable.java~~ — DEFERRED until (3) gives us evidence about which gate is actually missing.
~~Run the unit test~~, ~~Run the full DBIC suite~~ — same.

Next steps (concrete, in order)

This PR (#635) ships the diagnostic infrastructure (JPERL_FORCE_SWEEP_EVERY_FLUSH) and the corrected investigation plan. Whoever picks up the actual fix should:

Step A — Add `JPERL_WALKER_TRACE=1` instrumentation

In src/main/java/org/perlonjava/runtime/runtimetypes/ReachabilityWalker.java, inside sweepWeakRefs(boolean quiet) (around the loop that clears weak refs to unreachable objects), add an env-gated System.err.println for each clear that records:

WALKER_CLEAR target=<className>@<identityHashCode>
             refCount=<base.refCount>
             refCountOwned=<sc.refCountOwned>
             scopeExited=<sc.scopeExited>
             captureCount=<sc.captureCount>
             storedInPackageGlobal=<base.storedInPackageGlobal>
             path=<findPathTo(target) or "<unreachable>">
             seedStats=globals:<n> myVars:<n> scalarRefReg:<n>

The seed-stats snapshot is the critical piece — it tells us whether the schema's lexical was eligible to be seeded, even if it ended up filtered out.

Step B — Capture a real DBIC failure

JPERL_FORCE_SWEEP_EVERY_FLUSH=1 JPERL_WALKER_TRACE=1 \
    timeout 3000 ./jcpan -t DBIx::Class > /tmp/dbic.log 2>/tmp/dbic.trace

In the resulting dbic.trace file, find the first WALKER_CLEAR line whose target class is DBIx::Class::Schema, DBIx::Class::ResultSource::Table, or DBIx::Class::Storage::DBI (the classes whose weak-ref clearing produces the detached result source exception). That single trace line identifies the seeding gate that incorrectly excluded a still-live object.

Step C — Fix the specific gate

Once we know which property/state caused the false-negative seeding, the fix is in ReachabilityWalker.walk() Phase 2 (lines 111–153). Most likely candidates:

The sc.captureCount > 0 skip (path (a)) is over-eager — closures that capture but are themselves only weakly held in globalCodeRefs won't be walked, leaving the captured scalar un-seeded. Fix: remove the captureCount > 0 skip when the scalar is refCountOwned (the closure capture is real ownership).
MyVarCleanupStack.isLive(sc) returns false for scalars allocated by JVM-stack temporaries during method dispatch (e.g. @_ storage). Fix: always seed refCountOwned=true scalars regardless of isLive.
A specific DBIC pattern (e.g. Class::C3::XS next-method dispatch, Moo accessor magic, Sub::Quote eval-string captures) creates a reachability path through closures that the walker doesn't know how to follow. Fix: walk additional capture sources.

The trace tells us which.

Step D — Promote the fix's regression test

Once Step B identifies the actual pattern, the smallest version of it becomes the regression test under src/test/resources/unit/refcount/walker_<specific_gate>.t. The dev/sandbox/walker_blind_spot/*_PASSES.t files in this PR are starting points for that — they set up the JPERL_FORCE_SWEEP_EVERY_FLUSH harness correctly, just don't exercise the actual blind spot yet.

Step E — Verify on full DBIC suite

timeout 3600 ./jcpan -t DBIx::Class > /tmp/final.log 2>&1
grep "Files=" /tmp/final.log    # must show 0/N failures

Run on a quiet box (no concurrent jcpan/jprove from other worktrees) so the test is meaningful.

Architectural note (don't repeat past mistakes)

/dev/modules/dbix_class.md "What Didn't Work" warns:

Cascading cleanup after rescue → destroys Schema internals
WEAKLY_TRACKED for birth-tracked objects → refcounts inaccurate
DESTROY resurrection via refCount=0 reset → infinite loops

The fix here is narrower: ensure the walker's reachability set is complete (so the sweep doesn't clear refs to live objects). It is NOT to disable the sweep entirely — that breaks the leak-detection tests (observed: 14/23 fails with JPERL_NO_AUTO_GC=1).

Status

Reproduced under ./jcpan -t DBIx::Class (occasionally; today on test t/52leaks.t ~test #21 of the suite).
Confirmed root cause: maybeAutoSweep() 5-s timer + ReachabilityWalker missing live lexicals as roots. Disabling auto-GC removes the crash; keeping it on with broken reachability clears live weak refs.
Build deterministic repro infrastructure (next step) — see "How to make this reliably reproducible" below. Without that, every repro attempt is timing-dependent and we waste cycles on flaky runs.
Pinpoint which seeding/walking phase in ReachabilityWalker misses the test-scope my $schema lexical.
Add lexical seeding and re-run; expect t/52leaks.t to pass under auto-GC at ALL test positions in the harness.
Verify under full DBIC suite (must hit 0/314 fails).

How to make this reliably reproducible

Today's testing is flaky because the bug only fires when the auto-sweep 5-s timer expires at a precise moment relative to Perl's statement boundaries. Naive reproducers either complete too fast (no sweep fires) or have lexical roots so simple the walker can't miss them. We need infrastructure that forces both knobs:

1. Force sweep timing — `JPERL_FORCE_SWEEP_EVERY_FLUSH=1`

Add a debug-only env var that makes MortalList.maybeAutoSweep() fire on every MortalList.flush() call (i.e. at every Perl statement boundary), bypassing the 5-s throttle and the WeakRefRegistry.weakRefsExist gate. With that, ANY reproducer pattern that would hit the walker's blind spot fails on the first statement.

This converts a stochastic 1-in-314 timing race into a deterministic "sweep happens here" → "schema cleared here" → "next access dies" sequence we can debug step-by-step.

Implementation: gate the existing throttle/flag check in MortalList.maybeAutoSweep() (lines 643-651) on !System.getenv("JPERL_FORCE_SWEEP_EVERY_FLUSH"). ~3 lines.

2. Walker diagnostic transcript — `JPERL_WALKER_TRACE=1`

When set, ReachabilityWalker.sweepWeakRefs() writes a structured log line for EVERY weak-ref it clears, including:

target classname + System.identityHashCode
findPathTo(target) output (which is "" for the cases we care about)
a one-line snapshot of which seeding sources fired this walk (globalCodeRefs.size, globalHashes.size, ScalarRefRegistry.snapshot.size, MyVarCleanupStack.snapshotLiveVars.size)
caller stack (1-2 frames of native Java; useful for distinguishing manual Internals::jperl_gc() vs MortalList.flush()-triggered)

Together with (1), running:

JPERL_FORCE_SWEEP_EVERY_FLUSH=1 JPERL_WALKER_TRACE=1 \
    ./jperl small_repro.t 2> /tmp/sweep_trace.log

…produces an exact ordered transcript of every clear in the small reproducer. The first line whose target is a Schema (or any blessed object the test still wants alive) is the bug.

3. Tiered reproducers (graduate from simple → DBIC-like)

Today's dev/sandbox/walker_blind_spot/simple_lexical_repro.t doesn't fail — too simple. We need a tier of progressively-richer reproducers to find the smallest one that fails under (1):

T1 (simplest): 1 schema, 1 result-source, 1 weakened back-ref. Already exists; passes.
T2: T1 + holding the schema indirectly through a closure-captured $self chain (mirroring DBIC's accessor closures).
T3: T2 + passing the schema through @_ arg-pass via a method call that uses shift to consume it.
T4: T3 + using overloaded operators (DBIC ResultSource has "" overload via Carp::Clan; many JVM temporaries from stringify).
T5: T4 + populating WeakRefRegistry with thousands of unrelated weakened scalars, like populate_weakregistry() does.
T6: T5 + interleaving dclone on a separate complex structure to inflate Storable's internal seen-table, mirroring t/52leaks.t's $fire_resultsets->().

The smallest tier that fails under JPERL_FORCE_SWEEP_EVERY_FLUSH=1 is the bug-trigger pattern. We add it as a unit test under src/test/resources/unit/refcount/walker_blind_spot.t so any future fix has an automated guard.

4. Prefix bisection on the full DBIC suite

Independent of the small-repro work, narrow the harness reproduction. The current full run is ~40 min and hits ~1-2 failures stochastically. Build a prefix bisection harness:

cd cpan_build_dir/DBIx-Class-0.082844
JPERL_FORCE_SWEEP_EVERY_FLUSH=1 JPERL_WALKER_TRACE=1 \
    timeout 600 ../../jperl -MTest::Harness \
        -e 'test_harness(0, "blib/lib", "blib/arch")' \
        t/52leaks.t

If the SCALAR-prefix (just t/52leaks.t alone) fails under (1), we have a 1-test deterministic harness reproducer in <30s. If it doesn't fail, add prior tests one at a time (binary search on the suite list) until it does. The smallest failing prefix is reliable repro.

Plan ordering (to minimize wasted effort)

Implement (1) and (2) in PerlOnJava — both are small, debug-only, gated on env vars. Cost: ~30 min of dev work.
Run (4) prefix bisection with (1) + (2) enabled — gives deterministic harness repro within ~1 hour.
Inspect the walker transcript at the moment of premature clear. That tells us exactly which seeding gate dropped the schema.
Fix the seeding gate in ReachabilityWalker.walk().
Run the full suite (no debug envs) to verify the fix.
Promote the smallest reproducer from (3) into src/test/resources/unit/refcount/walker_blind_spot.t so the fix stays fixed.

Why we can't ship without this fix

A user running jcpan DBIx::Class will see a clean install when run alone (passes standalone) but a failed install under the published smoke-test infrastructure. That's a worse user experience than the current pre-PR-#635 state (where the storable bugs blocked things up front). Per @dev/cpan-reports/cpan-compatibility.md we publish "DBIx::Class PASS" — we can't ship a regression behind that flag.

Completed Fixes

Fix	What	Key Insight
1	LIFO scope exit + rescue detection	`LinkedHashMap` for declaration order; detect `$self` rescue in DESTROY
2	Deferred weak-ref clearing for rescued objects	Sibling ResultSources still need weak back-refs
3	DBI `RootClass` attribute for CDBI compat	Re-bless handles into `${RootClass}::db/st`
4	`clearAllBlessedWeakRefs` + exit path	END-time sweep for all blessed objects; also run on `exit()`
5	Auto-finish cached statements	`prepare_cached` should `finish()` Active reused sth
6	`next::method` always uses C3	Perl 5 always uses C3 regardless of class MRO setting
7	Stash delete weak-ref clearing + B::REFCNT fix	`deleteGlob()` triggers clearWeakRefs
8	DBI BYTE_STRING + utf8::decode conditional	Match DBD::SQLite byte-string semantics
9	DBI UTF-8 round-trip + ClosedIOHandle	Proper UTF-8 encode/decode for JDBC
10a	Clear weak refs when `localBindingExists` blocks callDestroy	In `flush()` at refCount 0
10d	`clearAllBlessedWeakRefs` clears ALL objects	END-time safety net no longer blessed-only
10e	`createAnonymousReference()` for Storable/deserializers	Anon hashes from dclone no longer look like named `\%h`
10f	Cascade scope-exit cleanup when weak refs exist	`WeakRefRegistry.weakRefsExist` fast-path flag
10g	`base.pm`: treat `@ISA` / `$VERSION` as "already loaded"	Fixes `use base 'Pkg'` on eval-created packages. DBIC t/inflate/hri.t now 193/193
10h	`flock()` allows multiple shared locks from same JVM	Per-JVM shared-lock registry keyed by canonical path. Fixes `t/cdbi/columns_as_hashes.t` hang
10i	`fork()` doesn't emit `1..0 # SKIP` after tests have run	Only emits when `Test::Builder->current_test == 0`. Sets $! to numeric EAGAIN + auto-loads Errno. Fixes DBIC txn.t "Bad plan"
10j	DBI stores mutable scalars for user-writable attrs	`new RuntimeScalar(bool)` instead of `scalarTrue` so `$dbh->{AutoCommit} = 0` works
10k	Overload `""` self-reference falls back to default ref form	Identity check in `toStringLarge` + ThreadLocal depth guard in `Overload.stringify`
10l	`@DB::args` preserves invocation args after `shift(@_)`	New `originalArgsStack` (snapshot) in RuntimeCode parallel to live `argsStack`
10m	`eq`/`ne` throw "no method found" when overload fallback not permitted	Match Perl 5: blessed class with `""` overload but no `(eq`/`(ne`/`(cmp` and no `fallback=>1` → throw. Fixes DBIC t/storage/txn.t test 90

What Didn't Work (don't re-try)

Approach	Why it failed
`System.gc()` before END assertions	Advisory; no guarantee
`releaseCaptures()` on ALL unblessed containers	Falsely reaches 0 via stash refs; Moo infinite recursion
Decrement refCount for captured blessed refs at inner scope exit	Breaks `destroy_collections.t` test 20 — outer closures legitimately keep objects alive
`git stash` for testing alternatives	Lost work — never use
Rescued object `refCount = 1` instead of `-1`	Infinite DESTROY loops (inflated refcounts always trigger rescue)
Cascading cleanup after rescue	Destroys Schema internals (Storage, DBI::db) the rescued Schema needs
Call `clearAllBlessedWeakRefs` earlier	Can't pick "significant" scope exits during test execution
`WEAKLY_TRACKED` for birth-tracked objects	Birth-tracked (refCount≥0) don't enter WEAKLY_TRACKED path in `weaken()`
Decrement refCount for WEAKLY_TRACKED in `setLargeRefCounted`	WEAKLY_TRACKED refcounts inaccurate; false-zero triggers
Hook into `assert_empty_weakregistry` via Perl code	Can't modify CPAN test code per project rules
`deepClearAllWeakRefs` in unblessed callDestroy	Too aggressive — clears refs for objects still alive elsewhere. Failed `destroy_anon_containers.t` test 15
DESTROY resurrection via refCount=0 reset + incrementRefCountForContainerStore resurrection branch	Worked for simple cases but caused infinite DESTROY loops for the `warn` inside DESTROY pattern: each DESTROY call triggers the WARN handler which pushes to @DB::args → apparent resurrection → refCount > 0 → eventual decrement → DESTROY fires again → loop. The mechanism needs a separate "being destroyed" state distinct from MIN_VALUE to avoid re-entry

Non-Bug Warnings (informational)

Mismatch of versions '1.1' and '1.45' in t/00describe_environment.t for Params::ValidationCompiler::Exception::Named::Required: Not a PerlOnJava bug. Exception::Class deliberately sets $INC{$subclass.pm} = __FILE__ on every generated subclass.
Subroutine is_bool redefined at Cpanel::JSON::XS line 2429: Triggered when Cpanel::JSON::XS loads through @ISA fallback. Cosmetic only.

Fix 10: t/52leaks.t tests 12-18 — IN PROGRESS

Failure Inventory

Test	Object	B::REFCNT	Category
12	`ARRAY \| basic random_results`	1	Unblessed, birth-tracked
13-15	`DBICTest::Artist` / `DBICTest::CD`	2	Blessed row objects
16	`ResultSource::Table` (artist)	2	Blessed ResultSource
17	`ResultSource::Table` (artist)	5	Blessed ResultSource
18	`HASH \| basic rerefrozen`	0	Unblessed, dclone output

All 7 fail at line 526 assert_empty_weakregistry — weak refs still defined.

Key Timing Constraint

Assertion runs during test execution (line 526), not in an END block. clearAllBlessedWeakRefs() (END-time sweep) is too late.

Root Cause: Parent Container Inflation

$base_collection (parent anonymous hash) has refCount inflated by JVM temporaries from:

visit_refs() deep walk (passes hashref as function arg)
populate_weakregistry() + hash access temporaries
Storable::dclone internals
$fire_resultsets->() closures

When scope exits, scalar releases 1 reference but hash stays at refCount > 0. callDestroy never fires → scopeExitCleanupHash never walks elements → weak refs persist.

Implication: Fixes that hook into callDestroy/scopeExit for the parent hash are blocked because it never dies. Our minimal reproducers (/tmp/dbic_like.pl, /tmp/blessed_leak.pl, /tmp/circular_leak.pl) no longer leak, but the real DBIC pattern still does.

Diagnostic Facts

B::REFCNT inflates by +1 vs actual: B::svref_2object($x)->REFCNT calls Internals::SvREFCNT($self->{ref}) which bumps via B::SV's blessed hash slot. Failure inventory values are actual refCount + 1 (or 0 when refCount = MIN_VALUE).
Unicode confirmed irrelevant: t/52leaks.t uses only ASCII data.

Next Steps

Both remaining failures (t/52leaks.t tests 12-18 and t/storage/txn_scope_guard.t test 18) hit fundamental limitations of PerlOnJava's selective refCounting that can't be solved without a major architectural change:

Why t/52leaks.t tests 12-18 Are Blocked

$base_collection (parent anonymous hash) has refCount inflated by JVM temporaries created during visit_refs, populate_weakregistry, Storable::dclone, $fire_resultsets->(). When its scope exits, the scalar releases 1 reference but the hash stays at refCount > 0 → callDestroy never fires → scopeExitCleanupHash never cascades into children → weak refs persist.

Attempted fixes:

Orphan sweep for refCount==0 objects (Fix 10n attempt #1): No effect because leaked objects have refCount 1-5, not 0.
Deep cascade from parent at scope exit: Parent itself never triggers scope exit because its refCount > 0.
Reachability-based weak-ref clearing: Would require true mark-and-sweep from symbol-table roots — a major architectural addition.

The simple reproducers (/tmp/dbic_like.pl, /tmp/blessed_leak.pl, /tmp/anon_refcount{2,3,4}.pl, /tmp/dbic_like2.pl) all pass. Only the full DBIC pattern leaks, because real DBIC code paths create JVM temporaries via overloaded comparisons, accessor chains, method resolution, etc.

Why t/storage/txn_scope_guard.t test 18 Is Blocked

Test requires DESTROY resurrection semantics: a strong ref to the object escapes DESTROY via @DB::args capture in a $SIG{__WARN__} handler. When that ref is later released, Perl calls DESTROY a second time; DBIC's detected_reinvoked_destructor emits Preventing *MULTIPLE* DESTROY() warning.

Attempted fix (Fix 10n attempt #2): Set refCount = 0 during DESTROY body (not MIN_VALUE), track currentlyDestroying flag to guard re-entry, detect resurrection by checking refCount > 0 post-DESTROY.

Failure mode: my $self = shift inside DESTROY body increments refCount to 1 via setLargeRefCounted when $self is assigned. When DESTROY returns, $self is a Java local that goes out of scope without triggering a corresponding decrement (PerlOnJava lexicals don't hook scope-exit decrements for scalar copies). Post-DESTROY refCount=1 → false resurrection detection → loops indefinitely on File::Temp DESTROY during DBIC test loading.

Root cause: PerlOnJava's selective refCount scheme can't accurately track the net delta from a DESTROY body, because lexical assignments increment but lexical destruction doesn't always decrement.

What Would Fix Both

Either:

True reachability-based GC — mark from symbol-table roots on demand, clear weak refs for unreachable objects. Expensive but matches Perl's model exactly.
Accurate lexical decrement at scope exit — audit every my $x = <ref> path to ensure scope exit fires a matching decrement. Large, risky refactor.

See dev/design/refcount_alignment_plan.md for a phased plan that implements both.

Deferred until such architectural work becomes practical.

Historical notes (previously attempted)

visit_refs / LeakTracer instrumentation — ran diagnostics, identified parent hash refCount inflation as the blocker.
createReference() audit — Fixed: Storable, DBI. Other deserializers (JSON, XML::Parser) don't appear in the DBIC leak pattern.
Targeted refcount inflation sources — function-arg copies tracked via originalArgsStack (Fix 10l), @DB::args preservation works; but inflation in map/grep/keys temporaries remains.

Selective Refcounting Internals (reference)

States: -1=untracked; 0=tracked, 0 counted refs; >0=N counted refs; -2=WEAKLY_TRACKED; MIN_VALUE=DESTROY called.

Tracking activation: [...]/{...} → refCount=0; \@arr/\%hash → refCount=0 + localBindingExists=true; bless → refCount=0; weaken() on untracked non-CODE → WEAKLY_TRACKED.

Increment/decrement: setLargeRefCounted() on ref assignment when refCount≥0; marks scalar refCountOwned=true. Decrement at overwrite or scopeExitCleanup → deferDecrementIfTracked → flush().

END-time order: main returns → flushDeferredCaptures → flush() → clearRescuedWeakRefs → clearAllBlessedWeakRefs → END blocks.

Internals::SvREFCNT: refCount>=0 → actual; <0 → 1; MIN_VALUE → 0.

Key Code Locations

File	Method	Relevance
`RuntimeScalar.java`	`setLargeRefCounted()`	Increment/decrement
`RuntimeScalar.java`	`scopeExitCleanup()`	Lexical cleanup at scope exit
`RuntimeScalar.java`	`toStringLarge()`	Overload `""` self-recursion guard
`MortalList.java`	`deferDecrementIfTracked()`	Defers decrement to flush
`MortalList.java`	`scopeExitCleanupHash()`	Hash value cascade
`MortalList.java`	`flush()`	Processes pending decrements
`DestroyDispatch.java`	`callDestroy()`	Fires DESTROY / clears weak refs
`WeakRefRegistry.java`	`weaken()`	WEAKLY_TRACKED transition
`WeakRefRegistry.java`	`clearAllBlessedWeakRefs()`	END-time sweep (all objects)
`RuntimeHash.java`	`createReference()` / `createAnonymousReference()`	Named vs anonymous hash ref creation
`RuntimeArray.java`	`createReference()` / `createAnonymousReference()`	Named vs anonymous array ref creation
`RuntimeCode.java`	`pushArgs` + `originalArgsStack`	@DB::args snapshot preservation
`Overload.java`	`stringify()`	Overload `""` recursion depth guard
`CustomFileChannel.java`	`flock()` + `sharedLockRegistry`	POSIX-compatible multi-shared-lock
`SystemOperator.java`	`fork()`	Test-safe skip + EAGAIN errno
`Base.java`	`importBase()`	`@ISA` / `$VERSION` loaded-check
`Internals.java`	`svRefcount()`	Internals::SvREFCNT impl

Architecture Reference

dev/architecture/weaken-destroy.md — refCount state machine, MortalList, WeakRefRegistry
dev/design/destroy_weaken_plan.md — DESTROY/weaken implementation plan (PR #464)
dev/sandbox/destroy_weaken/ — DESTROY/weaken test sandbox
dev/patches/cpan/DBIx-Class-0.082844/ — applied patches for txn_scope_guard