Bitcode rewrite #19

caibear · 2024-02-22T22:17:14Z

I rewrote the entire library. docs
It's been tested/fuzzed but still needs some work before release.

# If you want to try it early
bitcode = "=0.6.0-beta.1"

New features:

much faster
very compressible
doesn't require any hints (determines them at runtime)
deserialize &str

Alpha release:

big endian platforms (compile error right now)
signed integer size reduction (currently treated as unsigned)
usize
Result

Beta release:

handle derive macro errors instead of panicking
remove lifetime bound on DecodeBuffer
fix documented unsound code in serde impl

Full release:

make bitcode::Buffer Send + Sync
recursive types
#[bitcode(with_serde)] (can only do bitcode::serialize right now)
#![forbid(unsafe_code)] feature flag (serde only and slightly slower)
CString
IpAddr (see Add support for std::net::{*Addr*} #30)
deserialize &[u8]

caibear · 2024-02-22T23:04:46Z

Benchmarks for those who are interested. Previous version of bitcode isn't shown here but it has speed similar to bincode, size simliar to new bitcode, and compressed size 20% worse than bincode.

Format	Compression	Size (bytes)	Serialize (ns)	Deserialize (ns)
bincode		49.1	35	115
bincode	lz4	16.1	86	115
bincode	deflate-fast	13.1	166	176
bincode	deflate-best	8.9	3708	141
bincode	zstd-0	12.4	172	146
bincode	zstd-22	8.5	32312	133
bincode-varint		22.3	36	116
bincode-varint	lz4	10.8	71	119
bincode-varint	deflate-fast	10.1	146	165
bincode-varint	deflate-best	8.0	2664	153
bincode-varint	zstd-0	8.2	123	131
bincode-varint	zstd-22	7.8	23939	136
bitcode		16.9	29	104
bitcode	lz4	9.8	42	108
bitcode	deflate-fast	8.3	86	137
bitcode	deflate-best	6.8	1661	125
bitcode	zstd-0	7.1	58	115
bitcode	zstd-22	6.2	24713	115
bitcode-derive		16.9	10	12
bitcode-derive	lz4	9.7	25	17
bitcode-derive	deflate-fast	8.3	68	45
bitcode-derive	deflate-best	6.8	1597	33
bitcode-derive	zstd-0	7.1	40	24
bitcode-derive	zstd-22	6.2	24905	25

tbillington · 2024-02-26T02:44:22Z

doesn't require any hints (determines them at runtime)

This is remarkable, is there any particular part of the branch you could point me to so I could see how it's done?

Side question, do you see any benefits in hinting on top of runtime determined?

Hope it's okay to ask the questions in your PR :) thanks for making this library

caibear · 2024-02-26T03:41:13Z

doesn't require any hints (determines them at runtime)

This is remarkable, is there any particular part of the branch you could point me to so I could see how it's done?

https://github.com/SoftbearStudios/bitcode/blob/5bdc22ba943d0ba8de092a763327b8167656611f/src/pack.rs
https://github.com/SoftbearStudios/bitcode/blob/5bdc22ba943d0ba8de092a763327b8167656611f/src/pack_ints.rs
https://github.com/SoftbearStudios/bitcode/blob/5bdc22ba943d0ba8de092a763327b8167656611f/src/f32.rs (albiet this one requires the aid of a compression algorithm like deflate/lz4/zstd to do anything)

Side question, do you see any benefits in hinting on top of runtime determined?

After adding hints to bitcode, I didn't use them as much as I thought I would because it was tedious. The types of "packing" I'm using in this new version are designed to quickly determine if they're applicable and pack the data. I'm probably not going to add manual hints back because most people don't benefit from them (me included). Also this new version prioritizes working with general purpose compression which some of the old hints got in the way of (e.g. #[bitcode_hint(ascii)] made characters 7 bits which confused byte-wise compression algorithms).

Hope it's okay to ask the questions in your PR :) thanks for making this library

Yeah! I made this PR public before being finished to see what people think about it.

…arks.

LevitatingBusinessMan · 2024-02-28T11:25:41Z

Very impressive!!!

caibear · 2024-02-28T22:48:40Z

I just released bitcode = "=0.6.0-alpha.1" which has the core features done.

…ove locality.

caibear · 2024-03-12T23:31:40Z

I'm interested in hearing opinions about making bitcode::encode/bitcode::decode use a thread local bitcode::Buffer.
This is how we internally use bitcode so I'm considering upstreaming it.

thread_local! {
    static BUFFER: std::cell::RefCell<Buffer> = Default::default();
}
pub fn encode<T: Encode + ?Sized>(t: &T) -> Vec<u8> {
    BUFFER.with(|b| b.borrow_mut().encode(t).to_vec())
}
pub fn decode<'a, T: Decode<'a> + ?Sized>(bytes: &'a [u8]) -> Result<T, Error> {
    BUFFER.with(|b| b.borrow_mut().decode(bytes))
}

Pros:

Small messages encode/decode 2x faster
Large messages encode 20% faster and decode 10% faster

Cons:

Keeps large amounts of memory when encoding/decoding a single large message
Unexpected behavior

You could always opt out by creating a new buffer for each encode/decode call.

caibear · 2024-03-14T06:45:05Z

I just released bitcode = "=0.6.0-beta.1".

reimplement bitcode::Buffer
fix unsound code in serde impl
optimize bitcode::serialize and bitcode::deserialize by 30-40%

jestarray · 2024-04-26T03:04:35Z

@caibear https://docs.rs/bitcode/0.5.0/bitcode/struct.Buffer.html#method.deserialize
So buffer.deserialize() is now removed? I'm guessing if I want to deserialize with reusable buffer, I have to use encode/decode?

caibear · 2024-04-26T03:43:39Z

@caibear https://docs.rs/bitcode/0.5.0/bitcode/struct.Buffer.html#method.deserialize So buffer.deserialize() is now removed? I'm guessing if I want to deserialize with reusable buffer, I have to use encode/decode?

Yes this was removed. Currently you have to use Encode/Decode if you want to reuse allocations.
Buffer::{serialize, deserialize} might be reimplemented in a future version, but doing so is non-trivial.

Note: Saving allocations is an optimization that's usually 10% faster on large messages and 50% faster on small messages.
If you care about speed, you probably want to use Encode/Decode anyway because they're usually 2-5x faster.

jestarray · 2024-04-26T04:02:40Z

@caibear Ahh I see. I'm using hecs(ecs) and other libs and would need to derive Encode/Decode on them as well so I'll stick with 0.5.0 in the meantime.

caibear · 2024-04-26T04:22:17Z

@caibear Ahh I see. I'm using hecs(ecs) and other libs and would need to derive Encode/Decode on them as well so I'll stick with 0.5.0 in the meantime.

Have you benchmarked 0.6 with allocations against 0.5 without allocations for your usecase? 0.6 with allocations might be faster if your messages are large enough.

jestarray · 2024-04-26T14:25:05Z

@caibear Wow, 0.6 is really that better huh? I have not benchmarked just yet. What do you mean by "large enough" in terms of size? My game sends position updates 30 times a sec and the average size is ~3-5kilobytes

caibear · 2024-04-27T05:02:25Z

@caibear Wow, 0.6 is really that better huh? I have not benchmarked just yet. What do you mean by "large enough" in terms of size? My game sends position updates 30 times a sec and the average size is ~3-5kilobytes

0.6 is generally faster/smaller than 0.5 across all benchmarks. The question here is if the gain in speed outweighs the additional allocations. I just benchmarked deserializing 5kb of messages and 0.6 is 30% faster. I don't know your exact structs, but this should be a good baseline.

On a side note: I also benchmarked 0.6 derive and it's 7x faster than 0.6 serde.

Bitcode rewrite.

c159f74

caibear marked this pull request as draft February 22, 2024 22:17

This was referenced Feb 22, 2024

Streaming API #6

Open

Update bitcode djkoloski/rust_serialization_benchmark#65

Merged

caibear changed the title ~~Draft: Bitcode rewrite~~ Bitcode rewrite Feb 23, 2024

caibear added 8 commits February 23, 2024 12:18

Remove derive macro decode impl and use decode_in_place in array/Option.

3b00907

Fix typo.

f165694

Implement Result.

2148646

Optimize OptionEncoder::encode_vectored.

eb811da

Fix doc warning on comment.

e9f63b5

Compile on big endian platforms.

12430a5

Fix compile on 32 bit x86.

8c2e558

Implement usize, use IntEncoder<usize> in LengthEncoder.

5bdc22b

caibear added 7 commits February 26, 2024 12:11

Optimize FastVec::reserve by storing capacity ptr instead of usize.

2e2e842

Don't pack <= 2 bytes since it would take 2 or 3 bytes.

70882ca

Clarify doc.

f0ef934

Update lz4_flex and use unsafe encode/decode for more accurate benchm…

88fe307

…arks.

Fix comment.

a740a0c

Add back github workflow (without miri big-endian for now).

2bfd003

Use older nightly due to fmt changing.

5b2864d

caibear added 4 commits February 28, 2024 13:57

Pack signed integers, use F32Encoder in serialize.

b13efde

cargo test --all-features, cargo check --no-default-features

29e4f71

Fix comment.

e86c94e

Release 0.6.0-alpha.1

863bd14

Fix #18 and release new version.

e9f1774

caibear added 5 commits March 10, 2024 18:49

Add bincode benchmarks to cargo bench.

a291a8b

Move Length{Decoder, Encoder} out of Box for Map/Seq encoders to impr…

fb8508d

…ove locality.

Add non generic Buffer type which fixes DecodeBuffer lifetime issue.

6ab27a6

Add some inlines.

58ae910

Fast path for tuple/struct with 1 field.

d53b292

caibear added 9 commits March 12, 2024 22:30

Replace 4 unsafe transmutes with a safe abstraction.

26960e8

Fix miri big-endian not compiling.

9aaaf39

Handle errors instead of panicking in derive macro.

99b7b67

Remove fn main from doc tests.

b8acaa1

Improve impl Debug for Error.

7f2104c

Fix documented unsound code in serde impl.

9279196

clippy

290149c

clippy

d8f869d

Release 0.6.0-beta.1

e832667

caibear added 5 commits March 14, 2024 13:59

Fix panic on empty array encode.

c22dbbf

Add empty array to serde tests.

1328a36

Add library example.

e9b92af

Edit previous.

46febfc

Improve safety comments.

63f5ed5

caibear marked this pull request as ready for review March 16, 2024 04:08

caibear merged commit 431b88f into main Mar 16, 2024
1 check passed

caibear deleted the bitcode_rewrite branch March 16, 2024 06:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bitcode rewrite #19

Bitcode rewrite #19

caibear commented Feb 22, 2024 •

edited by finnbear

Loading

caibear commented Feb 22, 2024

tbillington commented Feb 26, 2024

caibear commented Feb 26, 2024

LevitatingBusinessMan commented Feb 28, 2024

caibear commented Feb 28, 2024

caibear commented Mar 12, 2024 •

edited

Loading

caibear commented Mar 14, 2024

jestarray commented Apr 26, 2024

caibear commented Apr 26, 2024

jestarray commented Apr 26, 2024

caibear commented Apr 26, 2024

jestarray commented Apr 26, 2024 •

edited

Loading

caibear commented Apr 27, 2024 •

edited

Loading

Bitcode rewrite #19

Bitcode rewrite #19

Conversation

caibear commented Feb 22, 2024 • edited by finnbear Loading

caibear commented Feb 22, 2024

tbillington commented Feb 26, 2024

caibear commented Feb 26, 2024

LevitatingBusinessMan commented Feb 28, 2024

caibear commented Feb 28, 2024

caibear commented Mar 12, 2024 • edited Loading

caibear commented Mar 14, 2024

jestarray commented Apr 26, 2024

caibear commented Apr 26, 2024

jestarray commented Apr 26, 2024

caibear commented Apr 26, 2024

jestarray commented Apr 26, 2024 • edited Loading

caibear commented Apr 27, 2024 • edited Loading

caibear commented Feb 22, 2024 •

edited by finnbear

Loading

caibear commented Mar 12, 2024 •

edited

Loading

jestarray commented Apr 26, 2024 •

edited

Loading

caibear commented Apr 27, 2024 •

edited

Loading