Unable to Parse Large NBT Arrays #85

bdkopen · 2023-12-31T20:17:09Z

I am attempting to parse NBT files that contain very large arrays. Whenever I do this, an error is thrown because the underlying node-protodef library caps arrays at a max size of 0xFFFFFF (16,777,215). According to the commit history, this was originally added to prevent OOM crashes.

Based on this NBT format specification, a byte array should be able to have a size of 2^31.

Support should be added to allow larger NBT array sizes to be parsed.

Example Code

const arrBuffer = await file.arrayBuffer();
const buffer = new Buffer(arrBuffer);

// Error happens here
const parsed = (await parse(buffer)).parsed;

Error

 ⨯ node_modules/protodef/src/compiler.js (70:11) @ readFn
 ⨯ Error: Read error for undefined : array size is abnormally large, not reading: 18571392
    at async uploadFile (./src/app/load/_actions/uploadFile.ts:22:21)

Example NBT File

This is an example NBT file that includes a very large byte array (size of ~18 million). Parsing will throw an error if executed on this file. No error is thrown when a NBT with smaller array sizes is parsed.
LargeNBTFile.schem.gz

The text was updated successfully, but these errors were encountered:

rom1504 · 2023-12-31T22:19:18Z

We could add an option to that node protodef type to allow large arrays. Keep it off by default And then propagate that option to pnbt How come your file has such big arrays ?

…

On Sun, Dec 31, 2023, 9:17 PM Braden Kopenkoskey ***@***.***> wrote: I am attempting to parse NBT files that contain very large arrays. Whenever I do this, an error is thrown because the underlying node-protodef <https://github.com/ProtoDef-io/node-protodef/blob/912683467d4eec4df852a4ae66030338416b48b1/src/datatypes/compiler-structures.js#L13> library caps arrays at a max size of 0xFFFFFF (16,777,215). According to the commit history <ProtoDef-io/node-protodef@6a625d6>, this was originally added to prevent OOM crashes. Based on this NBT format specification <https://minecraft.fandom.com/wiki/NBT_format#Binary_format>, a byte array should be able to have a size of 2^31. Support should be added to allow larger NBT array sizes to be parsed. Example Code const arrBuffer = await file.arrayBuffer();const buffer = new Buffer(arrBuffer); // Error happens hereconst parsed = (await parse(buffer)).parsed; Error ⨯ node_modules/protodef/src/compiler.js (70:11) @ readFn ⨯ Error: Read error for undefined : array size is abnormally large, not reading: 18571392 at async uploadFile (./src/app/load/_actions/uploadFile.ts:22:21) Example NBT File This is an example NBT file that includes a very large byte array (size of ~18 million). Parsing will throw an error if executed on this file. No error is thrown when a NBT with smaller array sizes is parsed. LargeNBTFile.schem.gz <https://github.com/PrismarineJS/prismarine-nbt/files/13802101/LargeNBTFile.schem.gz> — Reply to this email directly, view it on GitHub <#85>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAR437U4RMOORNBXOYGZRZ3YMHB5DAVCNFSM6AAAAABBIO4DW2VHI2DSMVQWIX3LMV43ASLTON2WKOZSGA3DCMJWGAZTCNI> . You are receiving this because you are subscribed to this thread.Message ID: ***@***.***>

bdkopen · 2023-12-31T22:36:17Z

We could add an option to that node protodef type to allow large arrays. Keep it off by default And then propagate that option to pnbt

This seems like a reasonable solution to me.

How come your file has such big arrays ?

I am working with Minecraft schematic files. Every single block has it's own index in the byte array, so a schematic with dimensions of 500 * 500 * 75 gets to a size of about 18 million, surpassing the limit. It is probably not an optimal format, but I am working with the standard that is already set.

rom1504 · 2023-12-31T22:37:41Z

Ok makes sense, you may be interested by https://github.com/PrismarineJS/prismarine-schematic

extremeheat · 2024-01-01T19:27:03Z

Some time ago (2020) there was problems with prismarine-nbt crashing V8 with OOM, so that check was meant to safely fail. I was thinking that check could be replaced with a simple if (arrayLength > (offset + buffer.length)) throw() check, but actually we can't do that since we don't know the size of the array items (which may be variable) before iterating. Since compiler is available to the type gen code it should not be hard to delete that line based on condition like if (compiler.skipChecks == true) skip.

rom1504 · 2024-01-01T23:34:01Z

Why not add a param for the array type ? That seems cleaner than a global param

…

On Mon, Jan 1, 2024, 8:27 PM extremeheat ***@***.***> wrote: Some time ago (2020) there was problems with prismarine-nbt crashing V8 with OOM, so that check was meant to safely fail. I was thinking that check could be replaced with a simple if (arrayLength > (offset + buffer.length)) throw() check, but actually we can't do that since we don't know the size of the array items (which may be variable) before iterating. Since compiler is available to the type gen code it should not be hard to delete that line based on condition like if (compiler.skipChecks == true) skip. — Reply to this email directly, view it on GitHub <#85 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAR437U7UA4RIDU4SGXSAHDYMMEZFAVCNFSM6AAAAABBIO4DW2VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTQNZTGQ2DQOJVGA> . You are receiving this because you commented.Message ID: ***@***.***>

extremeheat · 2024-01-02T00:03:55Z

Because then every instance in the protocol schema, nbt.jsons and all the protocol.json that uses an array type would need to be updated. Otherwise the check would not be consistent

rom1504 · 2024-01-02T01:22:37Z

It can be made to default to false so we don't need to update usages

…

On Tue, Jan 2, 2024, 1:04 AM extremeheat ***@***.***> wrote: Because then every instance in the protocol schema, nbt.jsons and all the protocol.json that uses an array type would need to be updated — Reply to this email directly, view it on GitHub <#85 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAR437QFYM2UUYDX62VOTFLYMNFHLAVCNFSM6AAAAABBIO4DW2VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTQNZTGUZTIMZZG4> . You are receiving this because you commented.Message ID: ***@***.***>

extremeheat · 2024-01-02T01:34:57Z

Right, but then users would need to have a way to update the type in the relevant part of the schema to toggle the behavior

bdkopen · 2024-04-20T18:48:12Z

@extremeheat and @rom1504, I have an open PR in node-protodef to address the proposed skipChecks option. Once that PR is reviewed and published, I have a PR ready for this repository to use the new compiler option.

rom1504 mentioned this issue Jan 12, 2024

My Schematic is so large, Protodef can't handle it ProtoDef-io/node-protodef#150

Open

bdkopen mentioned this issue Apr 20, 2024

Add Optional skipChecks to Disable Array Size Check ProtoDef-io/node-protodef#154

Merged

bdkopen mentioned this issue Jun 11, 2024

Use ProtoDef skipChecks flag to allow full NBT array size support #91

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unable to Parse Large NBT Arrays #85

Unable to Parse Large NBT Arrays #85

bdkopen commented Dec 31, 2023 •

edited

Loading

rom1504 commented Dec 31, 2023 via email

bdkopen commented Dec 31, 2023 •

edited

Loading

rom1504 commented Dec 31, 2023

extremeheat commented Jan 1, 2024

rom1504 commented Jan 1, 2024 via email

extremeheat commented Jan 2, 2024 •

edited

Loading

rom1504 commented Jan 2, 2024 via email

extremeheat commented Jan 2, 2024 •

edited

Loading

bdkopen commented Apr 20, 2024

Unable to Parse Large NBT Arrays #85

Unable to Parse Large NBT Arrays #85

Comments

bdkopen commented Dec 31, 2023 • edited Loading

Example Code

Error

Example NBT File

rom1504 commented Dec 31, 2023 via email

bdkopen commented Dec 31, 2023 • edited Loading

rom1504 commented Dec 31, 2023

extremeheat commented Jan 1, 2024

rom1504 commented Jan 1, 2024 via email

extremeheat commented Jan 2, 2024 • edited Loading

rom1504 commented Jan 2, 2024 via email

extremeheat commented Jan 2, 2024 • edited Loading

bdkopen commented Apr 20, 2024

bdkopen commented Dec 31, 2023 •

edited

Loading

bdkopen commented Dec 31, 2023 •

edited

Loading

extremeheat commented Jan 2, 2024 •

edited

Loading

extremeheat commented Jan 2, 2024 •

edited

Loading