#235 [Coding Guideline]: Do not create values from uninitialized memory #240

manhatsu · 2025-12-04T06:30:55Z

Closes #235.

netlify · 2025-12-04T06:31:02Z

✅ Deploy Preview for scrc-coding-guidelines ready!

Name	Link
🔨 Latest commit	`50101d2`
🔍 Latest deploy log	https://app.netlify.com/projects/scrc-coding-guidelines/deploys/69444e5b5b545f0008c732f7
😎 Deploy Preview	https://deploy-preview-240--scrc-coding-guidelines.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

rcseacord · 2025-12-04T15:12:45Z

I've made some improvement suggestions in a PR here: manhatsu#1

manhatsu · 2025-12-04T23:05:30Z

I've made some improvement suggestions in a PR here: manhatsu#1

Thank you very much. Merged to this branch

PLeVasseur · 2025-12-04T23:17:34Z

Hey @manhatsu 👋 it looks like from the CI that a new tag needs to be added.

Could you follow what @rcseacord did in this PR to add the unsafe tag with an appropriate description? Ideally you would do that as a separate PR, as that's easy to review and merge.

PLeVasseur

Hi @manhatsu -- thank you for contributing. Please see the comment I left on how to generate a template.

src/coding-guidelines/values.rst

workingjubilee

What @inkreasing says is correct. This description is insufficient to reflect the restrictions imposed by MIRI here:

https://play.rust-lang.org/?version=stable&mode=debug&edition=2024&gist=abb9da1c391902b21c03ed1d21767b58

Note this is not UB before line 15.

Bytes remain uninit until written. You may not read uninitialized bytes as any initialized type, period, not even if "all" bitpatterns are considered valid, because uninit is the 257th bitpattern for a byte, effectively: 0xUU. By contrast, u8 is 0x00 through 0xFF, inclusive. We use MaybeUninit<u8> to indicate the final state is possible, and it is valid to read that value (well, from any allocation that has a byte in it, at least).

workingjubilee · 2025-12-05T20:40:27Z

src/coding-guidelines/values.rst

+   A program shall not create a value of any type from uninitialized memory,
+   except when accessing a field of a union type,
+   where such reads are explicitly defined to be permitted even if the bytes of that field are uninitialized.
+   It is prohibited to interpret uninitialized memory as a value of any Rust type such as a
+   primitive, aggregate, reference, pointer, struct, enum, array, or tuple.


This definition does not consider composition of types:

use std::mem::MaybeUninit; union Uninit32 { u: u32, i: i32, f: f32, void: (), } struct Newtype32(Uninit32); fn main() { let x: Newtype32 = unsafe { MaybeUninit::uninit().assume_init() }; }

https://play.rust-lang.org/?version=stable&mode=debug&edition=2024&gist=feae8c987fa0b2703533ce0ebf8b23ba

This passes miri because all the bytes in Newtype32 are defined by Uninit32, which is allowed to be uninitialized.

When bytes are read as a type (e.g. by using ptr.read(), *ptr, or mem::transmute), a "typed read" occurs. This asserts the bytes are valid as that type. Uninit32 and MaybeUninit are the same thing here: unions with () as a possibility, which means they must be valid to read from a blob of uninitialized bytes within a valid allocation¹. Because Newtype32 is entirely defined by Uninit32, it is also valid to read from uninitialized bytes: the struct wrapper does not impose a novel validity requirement. If this is a mandatory guideline, it should be more exacting about why.

Footnotes

Note that this is likely a stronger requirement than the actual rules will be regarding union validity once final details of those are hashed out. I'm just giving an example that is very "in the clear". ↩

src/coding-guidelines/values.rst

workingjubilee · 2025-12-05T21:06:21Z

src/coding-guidelines/values/gui_uyp3mCj77FS8.rst.inc

+   .. non_compliant_example::
+      :id: non_compl_ex_Qb5GqYTP6db1
+      :status: draft
+
+      This noncompliant example creates a value of type ``u32`` from uninitialized memory via 
+      `assume_init <https://doc.rust-lang.org/stable/std/mem/union.MaybeUninit.html#method.assume_init>`_:
+
+      .. code-block:: rust
+
+         use std::mem::MaybeUninit;
+
+         let x: u32 = unsafe { MaybeUninit::uninit().assume_init() }; // UB


You got this right, but the lesson needs to be extended elsewhere: assume_init and the read of a union field at its type are not really different operations in the semantics, so why would u32 be valid to read from a union field?

src/coding-guidelines/values.rst

felix91gr

I hope this helps

src/coding-guidelines/values/gui_uyp3mCj77FS8.rst.inc

felix91gr · 2025-12-08T23:29:58Z

src/coding-guidelines/values/gui_uyp3mCj77FS8.rst.inc

+      - may violate niche or discriminant validity,
+      - may create invalid pointer values, or


I believe that these two would benefit of being further developed with their own examples. Going in-depth in the Rationale is fine: your goal is to make it obvious why this guideline is required. Explain as much as you find is needed.

I simplified the text here. The examples expand on pointer and reference validity.

felix91gr · 2025-12-08T23:43:41Z

src/coding-guidelines/values/gui_uyp3mCj77FS8.rst.inc

+      - creates undefined behavior for most types,
+      - may violate niche or discriminant validity,
+      - may create invalid pointer values, or
+      - may produce values that violate type invariants.


Like the first bullet point, I believe the last one needs to point to an official reference or documentation of some kind that explains in full why this is the case.

removed these bullets.

felix91gr · 2025-12-08T23:58:41Z

src/coding-guidelines/values.rst

+   :scope: system
+   :tags: undefined-behavior, unsafe
+
+   Do not create a typed value from uninitialized memory.


I would perhaps divide this guideline into two.

There is a very good guideline in this PR that deals with the creation of typed values from uninitialized memory by using functions like assume_init on memory that has not been fully initialized on the relevant bytes. That transition, from MaybeUninit<T> to T, &mut T and others, is Undefined Behavior.

But there is also another guideline in this PR that deals with how and when it's valid to access fields of unions. I think it's best for the two to be separate, since one of them deals with the creation of typed values and the other deals with the reading of typed values from unions.

Sidenote:

Keep in mind that MaybeUninit<T> is itself a union, so however the guideline that deals with access to fields of unions ends up being written, you will want to make sure you consider it among the exceptions ;)

You might also want to take a look at these pages:

The Rustonomicon on working with uninitialized memory from unsafe https://doc.rust-lang.org/nomicon/unchecked-uninit.html

The explanation of Initialization Invariant, examples and such, from the docs of MaybeUninit.

The Language Reference on unions: https://doc.rust-lang.org/reference/items/unions.html

The first two contain most of what I think you'll need to complete this / these Guideline(s). I hope they help.

I'll have to think on this a bit, as splitting guidelines is a lot of work.

So I think it would be hard to separate these because a union is an exception to this rule so it would have to be mentioned. Once it is, you have introduced all the tricky union behavior as to what is and is not a violation.

I have changed the title to include reading because my understanding is that reading and calling assume_init are basically the same here.

I have changed the title to include reading because my understanding is that reading and calling assume_init are basically the same here.

That's the thing: they are not. But I don't think that matters much anyways.

Why they are not the same

MaybeUninit<T> is very special for the compiler. assume_init is, among other things, a statement of "from here on, assume this memory is initialized". The compiler uses this fact to deduce things about the program, hence why it's UB to violate those assumptions. That memory never needs to be read for this to happen: it happens because a contract with the compiler has been broken.

Oh the other hand, reading from a union field is either well-defined or UB, depending on if the field is properly initialized or not.

They are not the same, except for the fact that both can cause UB if used poorly (which is a property of everything that inhabits in unsafe).

Why I don't think it matters much anyway

MaybeUninit<T> is the main (almost only, in fact) avenue through which a programmer can create initialized, valid values of a type T, from uninitialized memory. It's a carefully defined API, and it has a tight integration with the compiler.

On the other hand, unions are a low-level primitive that can be used, among other things, to emulate MaybeUninit<T>. That's, in principle, how MaybeUninit<T> was constructed - you can see it's itself a union.

But unions are rather obscure in Unsafe Rust, and they have their own niche use cases. The contexts in which they are used are so different from MaybeUninit<T> that bundling them together seems like a mistake to me.

The process for safely working with them is more crude than how it is for MaybeUninit<T>, which I would also consider part of their mismatch if they were to be ruled about in the same guideline.

Remember: we could have a guideline called "Do not invoke UB", which would be more or less well-defined (we could say "MIRI under Tree Borrows", and that would be well-defined enough for our purposes), but it would be a terrible guideline because so many different things can do UB.

What I'd do

Yeah, I'd separate it into two.

One for MaybeUninit<T>

To make it more actionable, you could go straight ahead and make a guideline about how to use this type in a safe manner.

Basically, ruling on how to call the assume_ APIs of MaybeUninit<T>:

assume_init

assume_init_drop

assume_init_mut

assume_init_read

assume_init_ref

As well as:

array_assume_init

Their documentation explains their safety invariants. The guideline would basically point out how and why to satisfy them.

One for unions

Again, unions have their own niches and contexts. They are also WAY cruder to use than MaybeUninit<T>. I think the examples will show just how different they are.

I think unions merit their own rule. They are, after all, one of the few low-level primitives the language has.

No, the code you replied you doesn't have UB.

The problem here is what does "fully initialized" mean? It doesn't include padding, so it doesn't mean all bytes. There are also zero-sized types that don't have to (can't?) be initialized. And some types are allowed to hold unitialized data (like MaybeUninit and unions with a () member). These also don't have to be initialized to be "valid". So if you have a type that only has members that can hold uninit bytes then you can do MaybeUninit::uninit().assume_init() without UB.

See also: #general > UB in Safe Rust @ 💬

This passes miri:

use std::mem::MaybeUninit; union Uninit32 { u: u32, i: i32, f: f32, void: (), } struct Newtype32(Uninit32); fn main() { let x: Newtype32 = unsafe { MaybeUninit::uninit().assume_init() }; let y: () = unsafe { MaybeUninit::uninit().assume_init() }; }

https://play.rust-lang.org/?version=stable&mode=debug&edition=2024&gist=dc68e3622c93a4a7bf4dcd2d0187cef3

Ah, you're right. Okay, I'm not up to properly reviewing this then. @inkreasing or @workingjubilee can I interest you guys in reviewing this guideline?

I can help with the more form-related bits, like where should explanations go and where should the actionable items go. But I'll need the help of one of you, who are more experienced with this, to give the semantics a thumbs up.

Lemme know and I'll assign you as reviewer :3

I have changed the title to include reading because my understanding is that reading and calling assume_init are basically the same here.

This understanding is functionally correct: the important mechanical element is sometimes referred to as "transmuting" or "read-at-type". It is the low-level statement (expression?) in the Rust abstract machine that moves some bytes and thinks of them as being bytes of a specific type as it does so. Bytes "at rest" have no such identity in the Rust AM, but they acquire a type while in motion because that often has an implication on the specific bytes that actually have to move. For instance, while it is definitely valid to copy all the bytes blindly, it may also be valid to e.g. move individual fields of a struct, instead of moving the entire struct-with-padding, which may be important when considering function calls in ABIs which do precisely that.

The missing detail in your reading, @felix91gr, is that the assume_init documentation references the initialization invariant documentation on the type. That documentation describes that individual types have their own initialization invariant. We usually say something like "every byte has to be in the set of valid bitpatterns for that type". This is easily logically handled by tracking certain bytes as having "uninit" as a unique state that doesn't correspond to 0x00 through 0xFF. So for a type that allows uninit for a given byte, the typed read permits a read of the uninit bytes, as "0xUU" is in the valid set of bitpatterns. This does get slightly confusing because we don't consider it a bitpattern but we kinda... do...?

Anyway, it is easy to read as being truly independent and context-free. We should probably change that.

OK, I've resolved all of this in the new version

src/coding-guidelines/values.rst

workingjubilee

Much better.

src/coding-guidelines/values.rst

felix91gr · 2025-12-11T03:04:49Z

Sidenote: this probably should be part of the Unsafety chapter. Anything dealing with upholding validity invariants in unsafe should probably go there, I believe.

rcseacord · 2025-12-11T10:21:29Z

Sidenote: this probably should be part of the Unsafety chapter. Anything dealing with upholding validity invariants in unsafe should probably go there, I believe.

I created an issue #241 that discusses this. See what you think. Anyway, we should probably have this discussion there.

PLeVasseur · 2025-12-15T18:35:01Z

Hi @manhatsu, @rcseacord -- please see this PR: #288

Please simply replace the current commits on your feature branch with that single commit on the above PR.

That way we can keep the review history on this PR.

rcseacord · 2025-12-15T23:05:18Z

@felix91gr I might be coming around to your view that union should be split out into a different rule.

deunionfied

had a slightly older version, so I replaced it with the latest.

inkreasing · 2025-12-17T20:59:55Z

src/coding-guidelines/values/gui_uyp3mCj77FS8.rst.inc

+   or related functions, is treated in the same manner as a typed read.
+   Calling these function when on memory that is not yet fully initialized causes immediate undefined behavior.
+   The memory must be properly initialized according to the requirements of the variable’s type.
+   For example, a variable of reference type must be aligned and non-null.


I don't know if you want to fully list all validity requirements here, but references also need to point to valid memory.

This code has UB:

fn main() { let mut uninit: MaybeUninit<&u8> = MaybeUninit::uninit(); unsafe { // write non-null and aligned address. (&raw mut uninit).cast::<*const u8>().write(ptr::dangling()); // UB here let init = uninit.assume_init(); } }

add this requirement and created a new noncompliant example

added another example and some clarification for references

rcseacord

lgtm

spelling

src/coding-guidelines/values/gui_uyp3mCj77FS8.rst.inc

Co-authored-by: increasing <dev@lucasbaumann.de>

manhatsu force-pushed the doc/no-uninit-value branch from d3b29b5 to 57f303a Compare December 4, 2025 23:42

manhatsu mentioned this pull request Dec 5, 2025

Fix/set valid tag manhatsu/safety-critical-rust-coding-guidelines#2

Closed

PLeVasseur force-pushed the doc/no-uninit-value branch from 57f303a to 4b8cbc7 Compare December 5, 2025 15:31

PLeVasseur reviewed Dec 5, 2025

View reviewed changes

src/coding-guidelines/values.rst Outdated Show resolved Hide resolved

PLeVasseur added chapter: values coding guideline An issue related to a suggestion for a coding guideline labels Dec 5, 2025

github-project-automation bot added this to [Safety Critical Rust Consortium Coding Guidelines] Work Items Dec 5, 2025

inkreasing reviewed Dec 5, 2025

View reviewed changes

src/coding-guidelines/values.rst Outdated Show resolved Hide resolved

workingjubilee suggested changes Dec 5, 2025

View reviewed changes

workingjubilee reviewed Dec 5, 2025

View reviewed changes

src/coding-guidelines/values.rst Outdated Show resolved Hide resolved

manhatsu force-pushed the doc/no-uninit-value branch from 9b81ff4 to 0e2776c Compare December 8, 2025 00:44

inkreasing reviewed Dec 8, 2025

View reviewed changes

src/coding-guidelines/values.rst Outdated Show resolved Hide resolved

inkreasing reviewed Dec 8, 2025

View reviewed changes

src/coding-guidelines/values.rst Outdated Show resolved Hide resolved

felix91gr requested changes Dec 9, 2025

View reviewed changes

workingjubilee reviewed Dec 9, 2025

View reviewed changes

src/coding-guidelines/values.rst Outdated Show resolved Hide resolved

PLeVasseur force-pushed the doc/no-uninit-value branch from dcb19bf to 8d4a655 Compare December 12, 2025 03:28

PLeVasseur mentioned this pull request Dec 15, 2025

#235 [Coding Guideline]: Do not create values from uninitialized memory - to get building again #288

Closed

PLeVasseur and others added 2 commits December 16, 2025 12:30

feat: split into own file

7713f0d

feat: remove description related to union

2ade1b3

deunionfied

manhatsu force-pushed the doc/no-uninit-value branch from 20d6c35 to 2ade1b3 Compare December 16, 2025 03:36

manhatsu requested a review from PLeVasseur December 16, 2025 03:39

manhatsu requested a review from rcseacord December 16, 2025 03:39

rcseacord added 3 commits December 17, 2025 12:14

Update gui_uyp3mCj77FS8.rst.inc

8e147c4

had a slightly older version, so I replaced it with the latest.

Update gui_uyp3mCj77FS8.rst.inc

7b27536

Update gui_uyp3mCj77FS8.rst.inc

807d57c

inkreasing reviewed Dec 17, 2025

View reviewed changes

Update gui_uyp3mCj77FS8.rst.inc

4e17069

added another example and some clarification for references

rcseacord requested review from felix91gr and workingjubilee December 18, 2025 01:10

Update gui_uyp3mCj77FS8.rst.inc

db75b00

rcseacord approved these changes Dec 18, 2025

View reviewed changes

Update gui_uyp3mCj77FS8.rst.inc

39ff5a0

spelling

inkreasing reviewed Dec 18, 2025

View reviewed changes

src/coding-guidelines/values/gui_uyp3mCj77FS8.rst.inc Outdated Show resolved Hide resolved

Update src/coding-guidelines/values/gui_uyp3mCj77FS8.rst.inc

50101d2

Co-authored-by: increasing <dev@lucasbaumann.de>

		- may violate niche or discriminant validity,
		- may create invalid pointer values, or

#235 [Coding Guideline]: Do not create values from uninitialized memory #240

Are you sure you want to change the base?

#235 [Coding Guideline]: Do not create values from uninitialized memory #240

Conversation

manhatsu commented Dec 4, 2025

Uh oh!

netlify bot commented Dec 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for scrc-coding-guidelines ready!

Uh oh!

rcseacord commented Dec 4, 2025

Uh oh!

manhatsu commented Dec 4, 2025

Uh oh!

PLeVasseur commented Dec 4, 2025

Uh oh!

PLeVasseur left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

workingjubilee left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

workingjubilee Dec 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Footnotes

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

felix91gr left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

felix91gr Dec 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rcseacord Dec 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

felix91gr Dec 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

netlify bot commented Dec 4, 2025 •

edited

Loading

workingjubilee left a comment •

edited

Loading

workingjubilee Dec 5, 2025 •

edited

Loading

felix91gr Dec 8, 2025 •

edited

Loading

rcseacord Dec 17, 2025 •

edited

Loading

felix91gr Dec 11, 2025 •

edited

Loading

One for `MaybeUninit<T>`

workingjubilee Dec 14, 2025 •

edited

Loading

workingjubilee Dec 14, 2025 •

edited

Loading