On Thu, May 22, 2025 at 12:59:24PM -0400, Jeff King wrote: > > By the way, I suspect I am not reading the code correctly, but I am > > not sure what fill_midx_entry() does with a failed case. > > > > midx_for_object(&m, pos); > > pack_int_id = nth_midxed_pack_int_id(m, pos); > > > > if (prepare_midx_pack(r, m, pack_int_id)) > > return 0; > > > > With or without cached failure, this should return 0 when and only > > when m->packs[pack_int_id] is a usable pack. But what about the > > access on the next line? > > I think there's a subtlety here with incremental midx's, in that a pack > id can be "global" within the whole midx chain, or a local index into a > specific chain element's list of packs. If you'll indulge me in a bit of pedantry for a moment, I would add that the pack_int_id is *always* global within the whole MIDX chain. It only happens to additionally be the correct local index when m->num_packs_in_base is zero. The thing here: m->packs[pack_int_id - m->num_packs_in_base]; ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ is indexing into the current layer's pack array, which only contains the packs at that layer, so the index there is no longer really a pack_int_id at all. But... > In fill_midx_entry(), I think we get such a global id back from > nth_midxed_pack_int_id(). And then when we hand that to > prepare_midx_pack(), it is converted into a local midx/id pair with: > > pack_int_id = midx_for_pack(&m, pack_int_id); ...this confusion is really my fault, since the thing midx_for_pack() is returning isn't really a pack_int_id in the classic sense at all. I think I chose this name to avoid making the patch noisy, and to avoid the possibility of accidentally using the global identifier when you meant to use the local index. (As an aside, I think I recall you suggesting a while ago that it might be interesting to define "global" things with a different type than "local" ones to prevent this sort of confusion. That would allow us to keep both "pack_int_id" and the return value of "midx_for_pack()" in scope at the same time, without the possibility of using one when you meant to use the other.) > where the end of that function is the same: > > return pack_int_id - m->num_packs_in_base; > > you saw elsewhere. So at that point we have a local midx and index, > which is what prepare_midx_pack() fills via the m->packs[pack_int_id] > field. > > So when you say "only when m->packs[pack_int_id] is a usable pack", you > are talking about the local m/pack_int_id within that function. > > But back in the caller... > > > p = m->packs[pack_int_id - m->num_packs_in_base]; > > > > Do we have any guarantee that we called prepare_midx_pack() for > > the pack at (pack_int_id - m->num_packs_in_base)th slot? Can p > > be NULL here? And with the magic "we know this fails" value, can p > > be that magic value? > > Our pack_int_id is the global one, so it needs to be adjusted. But this > pack pointer we access is the same one that was filled (or not) by > prepare_midx_pack(). So it cannot be NULL or the magic "fails" value, > because prepare_midx_pack() returned 0. > > So I think this code is fine. Thanks for the analysis. > One thing that did puzzle me: in prepare_midx_pack() we not only adjust > the pack_int_id, but we may walk back through the midx chain to find the > correct multi_pack_index struct. Wouldn't the caller need to do the > same? > > The answer is that it does. The midx_for_object() call in > fill_midx_entry() does that same walk, storing the result in its local > "m" variable. So the walk backwards in prepare_midx_pack() is > superfluous for this particular caller, who we know is already handing > us the desired multi_pack_index struct, and it could just do: > > pack_int_id -= m->num_packs_in_base; Right. You could imagine that after getting the pack_int_id back from nth_midxed_pack_int_id(), we could do: midx_for_pack(&m, pack_int_id); , which would be a noop because of the chain invariant that we never duplicate objects or packs anywhere in the chain. (IOW, if you know object X is in pack Y, and you move your MIDX pointer to the layer containing X, you are guaranteed to be able to find Y in that layer's array of packs.) > rather than calling midx_for_pack(). But the same is not necessarily > true for other callers, so we should continue calling that function. ...exactly ;-). > I suspect this would all be a bit more obvious if prepare_midx_pack() > simply returned the pack pointer, avoiding the need for callers to look > at m->packs at all (and making it a true cache, internally only to > prepare_midx_pack()). > > Looking at other callers of prepare_midx_pack(): > > - in fill_packs_from_midx(), we do not adjust our "m" to match the > index. But that is OK, because we adjust our local index (which we > get by iterating from 0 to m->num_packs) to a global index when > calling the function: > > if (prepare_midx_pack(ctx->repo, m, m->num_packs_in_base + i)) > ... > open_pack_index(m->packs[i]); > > which is fine. > > - I'm less sure of the call in expire_midx_packs(). It iterates over > num_packs in the same way, but does: > > if (prepare_midx_pack(r, m, i)) > continue; > > and then looks at m->packs[i]. That would be wrong if "m" is not the > first item in the chain. Ah, I see. Earlier we do: > > if (m->base_midx) > die(_("cannot expire packs from an incremental multi-pack-index")); > > so we know that the global and local ids are equivalent in this > instance (since the "base" midx . Still seems a bit fragile. Yeah, I think you can only meaningfully expire packs from the most recent layer, which is why I added that guard. I agree it is still fragile, though. > - There's a similar case in midx-write.c:want_included_pack(). That > one seems to have the same local/global confusion, but I do not > obviously see anything preventing it from being fed a non-base midx. > So it might possibly be buggy? Yeah, this spot is definitely broken. At minimum it would need something like: --- 8< --- diff --git a/midx-write.c b/midx-write.c index 0897cbd829..54a04f7b75 100644 --- a/midx-write.c +++ b/midx-write.c @@ -1634,9 +1634,10 @@ static int want_included_pack(struct repository *r, uint32_t pack_int_id) { struct packed_git *p; + midx_for_pack(&m, pack_int_id); if (prepare_midx_pack(r, m, pack_int_id)) return 0; - p = m->packs[pack_int_id]; + p = m->packs[pack_int_id - m->num_packs_in_base]; if (!pack_kept_objects && p->pack_keep) return 0; if (p->is_cruft) --- >8 --- > Likewise fill_included_packs_batch() in the same file. I think this one is actually OK for the same reason as the expire_midx_packs() case. Its sole caller in midx_repack() has: if (m->base_midx) die(_("cannot repack an incremental multi-pack-index")); , so we are OK there. We might want to add an ASSERT() in fill_included_packs_batch() to make it clearer, though. > In both cases I think if prepare_midx_pack() returned a pointer, we > could just use it directly. Agreed. Thanks, Taylor