My take is that arenas are very useful, but not to use as a stack so much as to ...

couchand · on Oct 31, 2022

> How would one handle this in Rust, C++, or Java?

In Rust you'd prefer to write the parser to slice the original memory and not copy out until you're done parsing. You can see this in, for instance, the signature of methods in the httparse crate:

    pub fn parse_headers<'b: 'h, 'h>(
        src: &'b [u8],
        dst: &'h mut [Header<'b>]
    ) -> Result<(usize, &'h [Header<'b>])>

To translate, this means that you must provide a byte slice that lives at least as long as 'b, and a mutable array of Headers that lives at least as long as 'h, and those headers then may reference data that lives as long as 'b (that is, the original bytes).

This way we avoid creating the garbage in the first place by demanding that the original allocation live long enough.

pencilguin · on Oct 31, 2022

It is odd to present arena allocation as a technique for C when it is most conveniently used in C++. C++'s Standard library has numerous accommodations to this method, and the core language definition acknowledges as legitimate constructing new objects over top of undestructed old objects. It has been used as long as C++ existed. Code using it is clean and maintainable.

I gather Rust is beginning to accumulate similar accommodations. It is already explicitly "safe" to seem to leak memory.

jeltz · on Nov 1, 2022

While it might be more conveniently used in C++ the use of arena allocators in C is ancient and it can be pretty convenient even in C. The PostgreSQL code base for example makes heavy use of arena allocators.

cma · on Oct 31, 2022

For c++ you can require the destructor be removed with

    ~Foo() = delete;

Or be trivial (https://en.cppreference.com/w/cpp/language/destructor#Trivia...), then use compile time checks on

    std::is_trivially_destructible || !std::is_destructible

to be allowed into the arena.