benjaminFRY: Rust and the most elegant FSM

8.13.2015

Rust and the most elegant FSM

Finite state machines should be a foundation upon which Software Engineers build complex systems. In this post I want to show how Rust's enum type supports building FSM's in a very simple and elegant manner.

FSM, what?

For anyone who has studied Computer Science, finite state machines are drilled into your head without mercy (or should have been, go ask for you $'s back if these were not drilled into your head). The first CS class I took in college was basically all FSM's, DFA's (deterministic finite automaton) and Turing machines. I don't want to get into the difference between all of these, so I'm going to just refer to FSM's in general, of which Turing machines and DFA's are subsets. In this example I'm more specifically defining a DFA, as all of the states are known, and there is a begin and end.

Ok, but why is this important? FSM's can simplify problems where you have a known input and known outputs. Specifically what I want to use one for in this example is parsing, boring old parsing. It actually wasn't until I was in the industry that I truly began to appreciate how much simpler and more maintainable your code became if you took the time to define state machines to determine how to move from one state to another.

Background

Firstly, I'm still somewhat new to the Rust language, so when a "real" Rust person reads this, they'll probably complain that I've done xyz wrong, or should do it another way. Let me know! I'd love to understand what I could be doing better. I've been searching for a meaty project to work on to hone my skills in this language, and then I finally found one; It seems like there are security advisories that come out around DNS and specifically Bind all the time. Here is the list of all known Security Advisories in Bind9. Reading through some of the issues made me think, wtf, I'll just write a new DNS server in Rust, b/c you know, why not and it will be safe implicitly, right? While doing that, I was writing the parsers for the binary record data from the DNS rfc's and fell on a really nice way of doing FSM's in Rust.

Now in Java I've used lots of different FSM generators, because I wanted strong language guarantees, for reference, my favorite right now was written by a friend of mine and is annotation based called Tron. This is useful because Java's enums and other basic constructs are not quite as expressive as some of the expressions you can make in Rust.

On to the FSM!

The definition of what we need to parse from RFC1035:

3.1. Name space definitions
Domain names in messages are expressed in terms of a sequence of labels.
Each label is represented as a one octet length field followed by that
number of octets. Since every domain name ends with the null label of
the root, a domain name is terminated by a length byte of zero. The
high order two bits of every length octet must be zero, and the
remaining six bits of the length field limit the label to 63 octets or
less.
To simplify implementations, the total length of a domain name (i.e.,
label octets and label length octets) is restricted to 255 octets or
less. Although labels can contain any 8 bit values in octets that make up a
label, it is strongly recommended that labels follow the preferred
syntax described elsewhere in this memo, which is compatible with
existing host naming conventions. Name servers and resolvers must
compare labels in a case-insensitive manner (i.e., A=a), assuming ASCII
with zero parity. Non-alphabetic codes must match exactly.

There are some more sections that provide details on things like pointers, you can read the spec if you want. Here are the states (I used http://madebyevan.com/fsm/ to build this, which doesn't have edge avoidance so I added some extra states to keep the lines in order):

This state diagram basically represents the above, so I decided to translate this (minus the 'offset' and 'store' states) to Rust enums.

This ended up being pretty elegant. but first a side note on enums in Rust.

'enum' really?

So I've noticed this come up in discussions online a few times now. With questions like, "how do you enumerate a Rust enum?" (which has a really funny answer, IMO): You can't! Wait what? You can't enumerate a Rust enum? Why is it called an enum? No, seriously, I don't have an answer to this, why was it called 'enum', when you can't enumerate it's values (other then writing them in order, you can't in code treat them as an array or series as you can in other languages, Java/C/C++ to name a few).

I'm sure someone has an answer to that, but in the mean time I needed to clarify them in my mind. I think they should have been called 'union's because that is what they are most closely related to. In C a union is a data type which which occupies enough memory space to only hold the largest member of the union, but what C doesn't give you is a way to know which thing in that union it really is! (read more here: C unions)

What's cool in Rust is that it does tell you what's stored in the, ahem, enum. So you can write matching (or destructuring) logic like this:

if let MyEnum::Type1 = my_var {
do_something_really_awesome();
}

Which is very powerful and cool.

FSM Now!

Ok so now for the super awesomeness of Rust. My enum is going to have four states, LabelLengthOrPointer == start, Label, Pointer, Root == end.

/// This is the list of states for the label parsing state machine
enum LabelParseState {
LabelLengthOrPointer, // basically the start of the FSM
Label(u8), // storing length of the label
Pointer(u8), // location of pointer in slice,
Root, // root is the end of the labels list, aka null
}

Ok, now for the code to run the state machine:

/// parses the chain of labels
/// this has a max of 255 octets, with each label being less than 63.
/// all names will be stored lowercase internally.
/// This will consume the portions of the Vec which it is reading...
pub fn parse(slice: &mut Vec<u8>) -> Result<Name, FromUtf8Error> {

let mut state: LabelParseState = LabelParseState::LabelLengthOrPointer;
let mut labels: Vec<String> = Vec::with_capacity(3); // www.example.com

// assume all chars are utf-8. We're doing byte-by-byte operations,
// no endianess issues...
// reserved: (1000 0000 aka 0800) && (0100 0000 aka 0400)
// pointer: (slice == 1100 0000 aka C0), then 03FF & slice = offset
// label: 03FF & slice = length; slice.next(length) = label
// root: 0000
loop {
state = match state {
LabelParseState::LabelLengthOrPointer => {
// determine what the next label is
match slice.pop() {
Some(0) | None => LabelParseState::Root,
Some(byte) if byte & 0xC0 == 0xC0 =>

LabelParseState::Pointer(byte & 0x3F),

Some(byte) if byte <= 0x3F => LabelParseState::Label(byte),
_ => unimplemented!(),
}
},
LabelParseState::Label(count) => {
labels.push(try!(util::parse_label(slice, count)));

// reset to collect more data
LabelParseState::LabelLengthOrPointer
},
LabelParseState::Pointer(offset) => {
// lookup in the hashmap the label to use
unimplemented!()
},
LabelParseState::Root => {
// technically could return here...
break;
}
}
}

Ok(Name { labels: labels })

}

code here.

So I'll just point out a couple of things that Rust does for us here: 1) guarantee that all states are considered 2) that each state has a result because of the assignment to the state. You'll notice that there are some unimplemented!()'s in there, that's because this is a work in progress, but I was so excited about how easy it was to write an FSM in Rust with just the standard language semantics, that I just had to share.

Basically, the sweet sauce here is that the 'state' can carry context implicitly in the enum as part of it's tuple definition. You can obviously make that more complex than what I did here, but this was such a simple and elegant solution to a common problem.

Conclusion

Rust continues to impress me while learning it. There are definitely some oddities like enum's being called enum's when they really are something else. Also, I continue to fight the compiler on mutability around ownership, etc. It's not that I don't get the ownership model, I do... it's that after working with a GC in Java for so long, I have to train myself to think about it each time I pass a reference to something, and as far as I can tell 90% of the time I'm getting it wrong the first time.

Anyway, getting back to hacking DNS now.

50 comments:

Levi said...: a new type with it, you enumerate all the values that inhabit it. This is opposed to struct/tuple type definitions where you are describing a type inhabited by the Cartesian product of its constituent types' inhabitants.

It's probably also meant to be reminiscent of C, Java, etc syntax, but I am not sure the familiarity is worth the possible confusion over the differences!; 1:13 PM
Levi said...: Sorry, I posted the previous comment from my phone and I deleted the first line without noticing.

It should have said something to the effect of:

I'm not sure exactly why they chose 'enum', but it could be because when you define a new type with it,; 1:58 PM
Dan C said...: Umm... a turing machine isn't a finite state machine. The state in a turing machine isn't limited to a finite set of states.; 6:57 PM
Unknown said...: Turing Machines have an infinite tape (given memory constraints), but the states are finite. Though, I won't argue the point, b/c this is based on my education from 15 years ago, and I don't care much about the theory, just the usefulness. Happy to change that line if people nitpick ;); 10:10 AM
Levi said...: It's been a long time since my automata theory course as well, so Dan's comment made me want to refresh my memory. This page of CS notes makes the fundamental difference between finite state automata and Turing machines pretty clear: http://www.cs.hmc.edu/~keller/cs60book/12%20Finite-State%20Machines.pdf

The difference is not so much in the way you define the transition function, but in how that function interacts with input and output. Of course those differences change the sort of transition functions you construct, but it's interesting to see where they're similar and where they're different.; 9:52 PM
Brooke Higgins said...: Keep sharing. valet parking luton; 11:53 PM
subha said...: sir i am LJ plss help me on how i ans. this question i'm applying for philippine national police i have no experience in work,tell me something about yourself.thanks.
C and C++ Training Institute in chennai | C and C++ Training Institute in anna nagar | C and C++ Training Institute in omr | C and C++ Training Institute in porur | C and C++ Training Institute in tambaram | C and C++ Training Institute in velachery; 2:25 AM
Professional Course said...: This comment has been removed by the author.; 5:54 AM
Data Science Training said...: I read that Post and got it fine and informative. Please share more like that...

Data Science Training; 10:37 AM
Data Science Certification said...: This is an excellent article. Thanks for sharing this information. I will be visiting your blog regularly for the latest articles. I will be visiting your blog regularly to see some of the latest posts.

360DigiTMG Data Science Certification; 8:52 AM
Digital Weekday said...: Damien Grant
Damien Grant
Damien Grant
Damien Grant
Damien Grant
Damien Grant
Damien Grant
Damien Grant; 6:36 AM
Digital Weekday said...: Damien Grant
Damien Grant
Damien Grant
Damien Grant
Damien Grant
Damien Grant
Damien Grant
Damien Grant; 6:36 AM
Data Science Hyderabad said...: Great blog with excellent information found very useful thank you.
typeerror nonetype object is not subscriptable; 5:21 AM
Artificial Intelligence Course said...: I will be interested in more similar topics. I see you have some really very useful topics, I will always check your blog thank you.

Artificial Intelligence Course in Bangalore; 6:02 AM
Anonymous said...: Just a shine from you here. I have never expected anything less from you and you have not disappointed me at all. I guess you will continue the quality work.

Business Analytics Course in Bangalore; 6:39 AM
Artificial Intelligence Course said...: I will very much appreciate the writer's choice for choosing this excellent article suitable for my topic. Here is a detailed description of the topic of the article that helped me the most.

Artificial Intelligence Course in Bangalore; 11:08 PM
OGEN Infosystem (P) Limited said...: It’s very helpful for us, thank you so much for sharing such an amazing article. Visit Ogen Infosystem for top Website Designing and PPC Services in Delhi at an affordable price.
PPC Company in Delhi; 11:52 PM
Huongkv said...: Mua vé tại đại lý vé máy bay Aivivu, tham khảo

vé máy bay đi Mỹ Vietnam Airline

thông tin chuyến bay từ mỹ về việt nam

giá vé rẻ đi đà nẵng

vé máy bay hà nội

đặt vé máy bay đi nha trang giá rẻ; 1:32 AM
360DigiTMG said...: "Very Nice Blog!!!

Please have a look about "
data science in malaysia; 1:57 AM
360DigiTMGAurangabad said...: Thanks for Sharing This Article.It is very so much valuable content. I hope these Commenting lists will help to my website
machine learning training in aurangabad; 5:01 AM
360DigiTMGAurangabad said...: Your content is very unique and understandable useful for the readers keep update more article like this.
data science course aurangabad; 2:40 AM
360DigiTMG-Pune said...: This website and I conceive this internet site is really informative ! Keep on putting up!
data scientist online course; 1:15 AM
Priya Rathod said...: I have read your blog its very attractive and impressive. I like it your blog.
Data Science Training in Hyderabad
Data Science Course in Hyderabad; 1:24 AM
Dettifoss IT Solutions said...: Very interesting blog. Many blogs I see these days do not really provide anything that attracts others, but believe me the way you interact is literally awesome.
servicenow training in hyderabad; 12:25 AM
traininginstitute said...: This is a fantastic website , thanks for sharing.
data science course in malaysia; 7:32 AM
Arnold DK said...: Wow, great information. I am sure the info on your blog will help others, Thanks. whatsapp mod; 9:26 PM
traininginstitute said...: Really nice and interesting post. I was looking for this kind of information and enjoyed reading this one.
data scientist certification malaysia; 9:54 PM
Dettifoss IT Solutions said...: The information you have posted is very useful. The sites you have referred was good. Thanks for sharing.
servicenow training in hyderabad; 12:09 AM
Dettifoss IT Solutions said...: Thank You for providing us with such an insightful information through this blog.
servicenow training in hyderabad; 8:17 PM
Data science Institute said...: very nice blog with excellent information thank you.
Data Science Institutes in Bangalore; 8:39 AM
Nathan said...: This is an informative and knowledgeable article. therefore, I would like to thank you for your effort in writing this article.
Business Analytics Course in Chandigarh; 11:22 PM
The Blogger Worlds said...: Quotes to assist You Cope once Losing a husband those we tend to care regarding ne’er absolutely abandon U.S.A.. Lost Husband Quotes; 3:17 AM
APTRON said...: In order to gain an in-demand set of skills required for today's job opportunities in Data Science, APTRON offers the best Data Science training course in Noida.; 2:00 AM
APTRON Delhi said...: Considered as the best Data Science training in Delhi, APTRON Data Science training is also organized through online platforms and training courses. APTRON is recognized as the best Data Science Training Institution in Delhi because of its platform that enables people to explore and experiment with any task with real-time projects.; 1:24 AM
Shailendra said...: Realize your Data Science dreams with Data Science Training in Gurgaon. Don't dream of becoming a certified Data Science professional. Our Data Science courses provide advanced, real-time, hands-on projects to help tackle any type of project.; 1:58 AM
Softcrayons said...: "Excellent post! Well-written, insightful analysis with practical examples and engaging style. Informative and enjoyable to read. Looking forward to more content from you. Keep up the great work!" I am a trainer at Softcrayons Tech Solution Pvt Ltd providing Google Analytics Training Noida at an affordable price. Join our training program today to learn from the best in the industry!; 11:40 PM
The Movie Fashion said...: Your information was exceptionally educational and accommodating. I think you'll continue posting and refreshing oftentimes. Anticipating your resulting one. Negan Jacket; 10:52 PM
The Movie Fashion said...: I always enjoy your blogs. Your content quality is impressive. Winter Sale Jackets; 10:40 PM
Sofia smith said...: Enjoyed your post waiting for a new post Mens Biker Jackets; 9:29 PM
coplez said...: Really like this article. I am enjoying your blogsVarsity Jacket; 12:30 AM
daniel said...: thanks for the information squid game jump suit; 4:31 AM
Nandi IVF said...: In addition to its advanced medical capabilities, the IVF center in Delhi emphasizes the importance of emotional well-being during the fertility treatment process. The staff is trained to provide compassionate care, understanding the psychological and emotional challenges that often accompany infertility.

Patients can expect a holistic approach that not only focuses on the medical aspects of IVF but also addresses the emotional and psychological needs of individuals and couples. This commitment to patient-centered care makes the center a trusted choice for those seeking assistance in their quest to conceive.; 4:43 AM
Emily nohh said...: jacketsaura understands what premium fashion should feel like.; 1:16 AM
Vibe everything said...: Very insightful post explaining how Rust enums and pattern matching can be used to implement an elegant finite state machine. Using enums to represent states and leveraging Rust’s strong type system makes the code more readable and ensures that all transitions are handled safely. Approaches like this highlight how modern programming languages provide powerful abstractions that simplify complex tasks such as parsing and state management. Tutorials like this are extremely useful for developers who want to understand how language features can improve code structure, maintainability, and reliability. As technology continues to evolve, many learners are also building expertise in areas such as Digital Marketing, Data Science, Data Analytics, UI UX Design, and Full Stack Development to better understand modern technology ecosystems and software development practices. Programming languages such as Java and Python are also widely used for building scalable applications, analyzing data, and creating innovative software solutions. Learning platforms like Login360 provide structured training, practical projects, and industry-focused learning paths that help learners strengthen their technical knowledge and prepare for opportunities in today’s rapidly evolving technology landscape.; 9:42 PM
LOGIN360 said...: Great explanation of how Rust’s enum types make building finite state machines (FSMs) simple and elegant. Using enums with associated data lets you represent each state clearly and handle transitions cleanly with pattern matching — making your parser logic easier to read and maintain compared to more verbose approaches. Rust’s strong type system and match expressions really shine in this context.

Also check out UI/UX Course in Kochi by Login360 and Digital Marketing Course by Login360.; 4:46 AM
Vibe everything said...: Really enjoyed this post—using Rust enums to model finite state machines is such a clean and elegant approach. The ability to embed state-specific data directly into enum variants and then use pattern matching for transitions makes the implementation both expressive and safe. It’s especially powerful how Rust ensures all states are handled at compile time, which reduces unexpected runtime issues.

The example of parsing with clearly defined states shows how FSMs can simplify complex logic and improve maintainability, especially in systems-level programming.

It’s interesting how concepts like structured state transitions and data handling tie into larger data-driven systems. This article on Chennai’s Data-Driven Economy: A Hub of Opportunity explores how data is shaping modern careers and innovation.; 5:09 AM
Pushpalatha said...: Great explanation and easy to understand.
For more insights, check What is Digital Marketing?.
Thanks!; 6:22 AM
Pushpalatha said...: This post really helped me understand the topic better.
Your examples make everything easier to grasp.
I agree that hands-on experience matters a lot today.
That’s why structured programs like
Data Science Course with Placement
are gaining attention.; 11:19 PM
LOGIN360 said...: This actually cleared a lot of confusion I had about getting started.
I’m also considering this Data Science Course with Placement.; 11:41 PM
DDaizz said...: This post really stands out for its clarity and usefulness. It’s easy to follow and well explained. I’ve also been exploring some courses like Java Full Stack Developer Course, Digital Marketing Training Institute in Coimbatore, and Full Stack Developer Course in Coimbatore.; 5:42 AM

benjaminFRY