S-buckets sizes difference

dariolina · May 10, 2023, 7:48pm

I have plotted the following distribution of current s-bucket sizes for a 1300-piece sector:

The median is around 750 chunks per bucket. I attribute the drop after s-bucket index ~50000 to the fact that by that point most of the records have most of their 2^{15} chunks encoded. In fact, 2^{15}/(1 - 1/e) \approx 51800 is the expected number of challenge needed to find 2^{15} proofs in a Chia PoS table.
For 1000 records in a sector the picture is quite expectedly similar with a median of around 580:

In this case each bucket (before the drop) is ~19 KiB in size.
Do we want to make buckets more uniform?
I do not think the “unlucky” tail buckets with significantly fewer tickets create an incentive to drop them, since the prover still needs all chunks to create KZG witness and the time to encode them is negligible. However, intuitively, it feels like having bucket sizes more uniform is better.

Chen_Feng_2023 · May 11, 2023, 9:01am

I agree with you that having a more uniform distribution is better. Plus, you already proposed a smart idea to achieve this.

nazar-pc · May 11, 2023, 10:08am

We can do something as simple as taking first encoded chunks for even piece offsets and last for odd piece offsets. Not sure how tricky or performance it would be to implement and diagram will certainly get very messy, but it is a relatively minor change to the protocol.

dariolina · May 15, 2023, 8:42am

Nazar’s solution is much simpler and should achieve eliminating the tail and leveling the buckets. I have opened an improvement issue on GitHub.

Topic		Replies	Views
Redefining s-buckets Research	6	227	April 25, 2023
Slight change to sector contents Research research , consensus , sector , plot	17	264	April 16, 2023
Proof of space performance improvements by sharing tables between pieces Research consensus , proof-of-space , performance , chia	29	559	August 10, 2023
Extreme audit algorithm inefficiency (CPU-bound) Research consensus , performance	11	346	November 6, 2023
Remove erasure coding from v2.3 plotting Research research , consensus , sector , plot	8	242	April 25, 2023

S-buckets sizes difference

Related Topics