Introduction to DMA

October 30, 2024 by Elecia White

I’m not going to tell you how to get DMA working on your system, I don’t know your processor. Instead, my goal is to give you some insights into how DMA works and why we use it to make our systems faster. By the way, DMA spelled out is direct memory access, that part will make sense soon.

Let's say you have a system that needs to transfer data from place to place. In fact, let's say you have this monitoring system, though the details aren't important. The data from the ADC goes to SPI to an SD card and to the computer via USB. The UART is used for command handling.

Block diagram of data collection embedded system where the data flows from an ADC to a computer (and local SD card).

While there are lots of boxes, the processor (inside the box) here is doing very little. It gets busy when an ADC interrupt comes and data has to flow through the system.

The hardest thing the processor has to do is copy the data from place to place. But copying things from place to place is a silly thing for a processor to do. I mean, the processor copies data from one place to another. The CPU touches each and every byte. But, it doesn’t have to.

If we wanted to use the processor for something useful (such as analyzing this pile of data), we could offload copying into the magical technology of DMA.

I have two metaphors for how DMA works. The first one is terrible but fun. The second is also terrible but also fun. Hopefully between the two, the reasoning and the concept make sense.

I really like these Inkarnate maps so let's look at a different system but in the style of an adventurer map. Please take a look at a similar system to the above, though this one has a display and more of an algorithm.

Adventurer’s map of a processor with included ADC and display port.

This map is a representation of a system with some peripherals and some memory (RAM). The CPU is in the center, connected to Memory Lake with Backenforth Falls. Our circular buffer is there, spinning buffers around and around like a water wheel.

See how everything talks to the CPU?

(And by the CPU, I mean the core of your processor, the Cortex-M4F part of the processor. This isn’t like when I talk about microcontrollers or microprocessors, that’s the whole thing that includes a core CPU.)

Anyway, everything talks to the CPU. The CPU doles out memory as it is needed, sending it over these busses, err, bridges. Any memory transfer or copy or math operation, it all goes through the CPU.

Imagine if these peripherals didn’t need to talk to the processor to get to Memory Lake. What if they made a tunnel through the mountains? Sure, the initial setup would be painful. But once a tunnel was done, the peripheral could put things in the lake and take them out without waiting for CPU cycles.

In this metaphor, those tunnels are DMA, direct memory access.

It is a pain to set up but it frees your processor to do things other than copy in and out of memory. Even better, it means your peripherals don’t need to wait on the CPU to have time to receive or transmit their data.

But how?!? How do you build those tunnels? (Err, set up DMA?) Well, the documentation is needed because how you set up DMA is different for each processor (and sometimes for peripherals in a processor). However, there are some commonalities. But for that I need to switch metaphors.

Are you ready? It is a big switch. Maybe take a breath or two.

Ok, say you have dry cleaning and you pick it up and take it home, put it in the closet. Whether you do it weekly or daily, it is just a stop on the way home. Easy enough. But say you get busy with work and life and wish someone else could do it for you.

However, you know how sometimes it is easier to do something yourself than to explain to someone else how to do it? How delegating something means explaining all the details?

So let’s say your boss noticed you are busy and gives you a couple assistants, each able to do one chore. Each assistant is only skilled in a few types of specific chores.

You allocate one assistant to pick up your dry-cleaning. Of course, you have to instruct the assistant:

This is my dry cleaner address.
Here’s how many items to pickup and what kind.
Here is where they go in my closet.
Here’s what to do if my closet is full.
Here is what to do in case of error.

And that’s just the general outline. When the time comes, you have to tell the assistant:

Ok, here is my current dry cleaning ticket.
Go, go pick it up now.
Text me when you are done so that I know I have clothes available to wear.

Some of this you can tell the assistant ahead of time, they don’t change (like where the dry cleaners is, or what to do if the closet is full). And some things may change with each chore (like the number of items to pick up and that there is something available now).

I don’t know how to build a tunnel but I do know how to delegate. Sometimes DMA is harder than doing it yourself. Other times, it is a luxury to have someone else doing your chores for you.

So that's all metaphor, let's go back to reality. Here is a sequence diagram of an ADC getting data normally.

Again, the processor core touches every byte of data coming in.

On the other hand, fortified with the idea of DMA as a dry cleaning micro assistant, here is some pseudo code for setting up the DMA SPI ADC along with its sequence diagram.

In this example, I’m not incrementing the send buffer from processor to ADC so the processor will always send 0xFFs to ADC but receive good data in an incrementing buffer. Once the transfer is complete, I’ll immediately set up another DMA transfer so it is ready when the ADC has data, the interrupt GPIO can fire off the DMA process that gets data. Meanwhile, the processor can do whatever needs doing until a new batch of data is available.

Adding DMA adds at least one layer of complexity for configuration. But it makes up for that in reducing processor cycles that deal with memory. You may not need DMA or your processor may not support DMA to all peripherals. And sometimes it is harder to set up than it is worth but when you do need DMA it is a very nice luxury.

For more tactical advice, Andrei Chichak had a post about using DMA on an STM32 Cortex with a UART. His introduction to DMA was also excellent, though less fanciful.

Creating Chaos and Hard Faults

June 25, 2024 by Elecia White

Creating Chaos and Hard Faults Video Presentation

The best way to understand why the processor is sending you love letters (exceptions) is to see what they look like when you aren’t also frantically trying to fix your code. This talk goes over the code necessary to cause (and debug) divide by zero, bus errors, stack overflows, and buffer overflows.

For each one, Elecia looks at the information the Cortex-M processor provides and how to use that to determine the cause of the fault. She describes how to use the information in a hard fault handler to create small core dumps to be stored after a system reboot.

This presentation is based on Chapter 9: Getting into Trouble, one of the new chapters in the second edition of Making Embedded Systems.

The slides are available in the Making Embedded Systems github repository.

The resources mentioned:

Code used in the demo: https://github.com/eleciawhite/making-embedded-systems/blob/main/Ch09_Debugging/
Introduction to Hard Faults:
- Debugging Hard Faults on ARM Cortex-M | MCU on Eclipse
- STM32 Hard Fault debugging
First handler shown from FreeRTOS: Debugging and diagnosing hard faults on ARM Cortex-M CPUs
Second handler shown from Memfault’s Interrupt: How to debug a HardFault on an ARM Cortex-M MCU | Interrupt (this is the most in-depth resource )
Adding NULL identification: Setting up the Cortex-M3/4 (ARMv7-M) Memory Protection Unit (MPU) - Sticky Bits
Smashing the Stack for Fun and Profit describes how to manipulate the stack
Buried Treasure and Map Files (and Linker Files) talk and map: https://embedded.fm/blog/mapfiles

This talk was originally presented as a keynote at the Embedded Online Conference in 2024. All of the excellent talks are available there with paid registration.

Expert Webinar: Introduction to Embedded Systems

June 12, 2024 by Elecia White

O’Reilly recently posted Expert Webinar: Introduction to Embedded Systems, a talk given by Elecia White. The slides are in the Making Embedded System github, a repository for all of the bonus goodies and materials that didn’t fit in the new second edition of book.

Embedded Skills Tree

May 19, 2023 by Elecia White

The Embedded.fm Patreon Slack group and I put together an embedded skills tree with the help of Steph Piper (aka MakerQueen). You can find a template in Steph’s github repository (where she has one for 3D Printing and Modelling too!).

Embedded Skills Tree

Teaching an Online Class!

November 05, 2021 by Elecia White

I’m teaching a class!
Check it out :classpert.com/classpertx/cohorts/making-embedded-systems

"Buried Treasure and Map Files" video is now free!

June 24, 2021 by Elecia White

I was showing someone a map file; they were kind of amazed at all this information. I’m glad I got to be there for the realization. And to help them through the first wall of impenetrable hex.

When I was invited to do a talk at the Embedded Online Conference 2020, I wanted to talk about something deeply technical, something people could potentially use in their jobs today.

Putting those two events in the same week meant I knew what I was going to present. And once I got started making a D&D style map to represent memory map files, well, it was hard to stop the giggling.

I’m very pleased that the video presentation is available to the public: embedded.fm/blog/MapFiles

Snails, Paper, and Programming: A Computational Approach to Mollusc Morphology in Origami

June 11, 2021 by Elecia White

I gave a presentation about my love of curved crease origami, math, and snails to a Harvey Mudd College mini-conference. I only had ten minutes to speak so it was… a little fast. The video is below.

The slides (available on github) show I had a few extras that got cut. I made a version of the origami snail generating Python script that can run in a browser (also available on github. Turbinator.py).

Buried Treasure and Map Files

May 20, 2021 by Elecia White

Elecia gave a talk at the Embedded Online Conference 2021 about how to use memory map files. The talk is not yet available to the public but you can see the slides and files here.

Embedded.fm TShirts!

January 27, 2021 by Elecia White

For a limited time, we are offering tshirts! The design is wild and awesome, honoring some of our favorite show titles.

Short-sleeved are here. You can also get long-sleeved tshirts. This is the back.

Imagine if all the show titles got together to have a party to build a Rube Goldberg machine.

The front of the shirt is pretty cool too!

If the robot/radio head is a variable cap and the Embedded is a diode… It’s a radio circuit!

As for the titles on the back, we have a list of show titles, all up to date and everything.

Advice for Working From Home

April 15, 2020 by Chris Svec in Engineering

Advice for working from home: find a routine, make it a habit, be flexible when it doesn't work.

We Are Having a Party! (Sept. 7th, 2019, Aptos CA)

Aptos Village Park

September 02, 2019 by Elecia White

The Embedded podcast recently released our 300th episode. We are having a party to celebrate!

If you’d like to join us, the RSVP link is in the first minute of episode 300 or you can hit the contact link on Embedded.fm and I’ll share it with you.

Giving Feedback

June 07, 2019 by Elecia White in Engineering

I often think my way is the best because it is the most obvious (to me). When I review someone else’s code, I forget that the way it is implemented is the most obvious way to the developer. We can talk about why it is different from what is obvious to me, but I should expect to often be wrong because the programmer has put more thought and effort into solving the problems. My feedback should reflect that I may be wrong because my perspective is different. Put this way, it is much easier for me to be less blunt, less definitive in my correctness.

List of All Embedded Episodes

March 25, 2019 by Elecia White in News

Do you want a list of all the Embedded episodes? Maybe you are wondering if we had a particular person on or you’re starting to get into a topic and want to hear a show about it?

ESE101: C is for Cookie, and also C

December 11, 2018 by Chris Svec in Software, Engineering

C is for Cookie, and also C.

Bang Bang Con Talk Ideas

November 12, 2018 by Elecia White

We recently recorded an Embedded show with Lindsey Kuper about the !!Con West conference. The !!ConWest call for speakers closes November 30, 2018. I have some ideas for talks.

Recruiter Followup

November 08, 2018 by Elecia White

We spoke about portfolios with Anita Pagin on a recent the Embedded podcast. But what is a portfolio, exactly?

The Consumer Llama and the Internet of Things

November 01, 2018 by Elecia White

The goal for this comic is to provide generic slides anyone can use to explain the different IoT methodologies, comparing and contrasting the technologies and their impact on customers.

So Many Ways To Stall

August 20, 2018 by Elecia White in News

When a question is asked, you can take your time answering. You don’t have to say the first thing that leaps into your head. It is ok take a few seconds to review it. Or if you draw a complete blank, there are ways to stall for time as you find the words. Let me share a few of those methods.

Seventeen Machine Learning Tools

August 13, 2018 by Elecia White in Engineering

When all you have is a hammer, everything looks like nails. It’s the law of the instrument.

Love Notes to Newton

August 01, 2018 by Andrei Chichak in News, Hardware, Software

Not that long ago, tech pundits would run articles like “The biggest technology flops in history” and “Apple’s Worst Products and Biggest Failures”. These lists would always contain Apple’s Newton handheld computer. Was it a failure? I don’t think so, but you can decide for yourself.