Instance

Array
(
    [title] => Recent Forum Threads
    [title_url] => 
    [ignore_sticky] => 0
    [exclude_current] => 0
    [limit] => 10
    [sluglist] => ["jobs-dashboard"]
    [rw_opt] => Array
        (
            [widget_select] => 1
            [pageid_281769] => 1
            [pageid_281772] => 1
        )

    [display_widget_mobile] => 
    [rw_opt_exclude] => Array
        (
            [pageid_274493] => 1
            [cpt_podcast] => 1
            [cpta_podcast] => 1
            [category_16613] => 1
            [category_16631] => 1
            [taxonomy_series] => 1
            [pageid_354254] => 1
        )

    [node_id] => Array
        (
            [0] => 2
        )

)

Threads

Recent Forum Threads

Radically Improved Battery, Uses Silicon instead of Graphite

latest reply by count on July 16, 2025

started by Arthur Hanson on April 7, 2021
Why Samsung is losing its top talent to SK hynix

latest reply by benb on July 16, 2025

started by Fred Chen on July 15, 2025
Intel Nova Lake-S uses TSMC N2 process Tape-Out

latest reply by Daniel Nenni on July 16, 2025

started by Daniel Nenni on July 13, 2025
The United States' NSTC EUV Accelerator is coming to Albany

latest reply by Fred Chen on July 15, 2025

started by Daniel Nenni on October 31, 2024
Semiconductor Fundamentals Are All Weak

latest reply by hist78 on July 15, 2025

started by Daniel Nenni on July 14, 2025
The $500 billion fab race: US ramps up, China closes in

latest reply by hist78 on July 15, 2025

started by Daniel Nenni on July 15, 2025
Nvidia CEO Jensen Huang says 'we don't have to worry' about the Chinese military using US chips to improve their capabilities because 'they simply can

latest reply by swka on July 15, 2025

started by Daniel Nenni on July 15, 2025
Intel’s Pivot: Why It’s Betting on UMC—Not TSMC—in the Legacy Node Wars

latest reply by Artificer60 on July 15, 2025

started by XYang2023 on July 14, 2025
Will Retail Automate?

latest reply by blueone on July 15, 2025

started by Arthur Hanson on July 15, 2025
Intel’s CEO: ‘We are not in the top 10’ of leading chip companies

latest reply by XYang2023 on July 15, 2025

started by osnium on July 10, 2025

Recent Article Comments

Moore’s Law Wiki
Yes, I am trying to teach AI how to do semiconductor wikis and put the Wiki back in SemiWiki. Should…

— Daniel Nenni on July 14, 2025
TSMC N3 Process Technology Wiki
I am trying to teach AI to speak semiconductor wikis. The problem is the date of the references. A 2023…

— Daniel Nenni on July 14, 2025
TSMC N3 Process Technology Wiki
Hmm - what's the source for 0.015-0.016? -- this thread shows 0.0199 (N3B) and 0.021 (N3E) https://semiwiki.com/forum/threads/tsmc-officially-halts-sram-scaling.17223/ Perhaps this source…

— Xebec on July 14, 2025
Moore’s Law Wiki
Are these AI Generated? :)

— Xebec on July 14, 2025
TSMC N3 Process Technology Wiki
It should be 25-30% smaller? Process Node Typical SRAM Cell Size Density Improvement TSMC N5 ~0.021 µm² — TSMC N3…

— Daniel Nenni on July 14, 2025
TSMC N3 Process Technology Wiki
~1.6x denser vs. N5 SRAM I thought the scaling was more like 1.05X? (Various threads here on 'SRAM scaling dead…

— Xebec on July 14, 2025
Facing the Quantum Nature of EUV Lithography
This presentation considers 5 nm Gaussian acid blur: https://www.youtube.com/watch?v=MYLdE69RDBg

— Fred Chen on July 7, 2025
Flynn Was Right: How a 2003 Warning Foretold Today’s Architectural Pivot
Appreciate your take, Rahul. You’re absolutely right that market scale drives architectural investment—scalar dominated when desktop and enterprise ruled, and…

— Jonah McLeod on June 29, 2025
Flynn Was Right: How a 2003 Warning Foretold Today’s Architectural Pivot
Well.. I found this to be a funny article. Flynn's critique is fine and good...but not really the driving factor…

— Rahul Razdan on June 29, 2025
Reachability in Analog and AMS. Innovation in Verification
Apologies for that slip-up on our part. Failing memories!

— Bernard Murphy on June 27, 2025

WP_Term Object
(
    [term_id] => 3611
    [name] => IoT
    [slug] => iot-internet-of-things
    [term_group] => 0
    [term_taxonomy_id] => 3611
    [taxonomy] => category
    [description] => Internet of Things
    [parent] => 0
    [count] => 551
    [filter] => raw
    [cat_ID] => 3611
    [category_count] => 551
    [category_description] => Internet of Things
    [cat_name] => IoT
    [category_nicename] => iot-internet-of-things
    [category_parent] => 0
)

July 27, 2015 by Majeed Ahmad

6 Memory Considerations for IoT Designs Built Around Cortex-M7 MCUs

6 Memory Considerations for IoT Designs Built Around Cortex-M7 MCUs
by Majeed Ahmad on 07-27-2015 at 12:00 pm
Categories: IoT

Tightly coupled memory (TCM) is a salient feature in the Cortex-M7 microcontrollers as it boosts the MCU performance by offering single cycle access for the CPU and by securing the high-priority latency-critical requests from the peripherals.

The early MCU implementations based on the ARM’s M7 embedded processor core—like Atmel’s SAM E70 and S70 chips—have arrived in the market. So it’d be worthwhile to have a closer look at the configurable memory aspects of M7 microcontrollers and see how the TCMs enable the execution of deterministic code and fast transfer of real-time data at the full processor speed.

Cortex-M7: ARM’s embedded processor that features TCM

Here are some of the key findings regarding the advanced memory architecture of Cortex-M7 microcontrollers.

1. TCM is Configurable
First and foremost, the size of TCM is configurable. TCM, which is part of the physical memory map of the MCU, supports up to 16 MB of tightly coupled memory. The configurability of the ARM Cortex-M7 core allows SoC architects to integrate a range of cache sizes. So that industrial and Internet of Things (IoT) product developers can determine the amount of critical code and real-time data in TCM to meet the needs of the target application.

The M7 architecture doesn’t specify what type of memory or how much memory should be provided; it leaves these decisions to designers implementing M7 in a microcontroller as a venue for differentiation. Consequently, a flexible memory system can be optimized for performance, determinism and low latency, and thus can be tuned to specific application requirements.

2. Instruction TCM
Instruction TCM or ITCM implements critical code with deterministic execution for real-time processing applications such as audio encoding/decoding, audio processing and motor control. The use of standard memory will lead to delays due to cache misses and interrupts, and thus will hamper the deterministic timing required for real-time response and seamless audio and video performance.

The deterministic critical software routines should be loaded in a 64-bit instruction memory port (ITCM) that supports dual-issue processor architecture and provide single-cycle access for the CPU to boost MCU performance. However, developers need to carefully calibrate the amount of code that need zero-wait execution performance to determine the amount of ITCM required in an MCU device.

The anatomy of TCM inside the M7 architecture

3. Data TCM

Data TCM or DTCM is used in fast data processing tasks like 2D bar decoding and fingerprint and voice recognition. There are two data ports (DTCMs) that provide simultaneous and parallel 32-bit data accesses to real-time data. Both instruction TCM and data TCM—used for efficient access to on-chip Flash and external resources—must have the same size.

4. System RAM and TCM
System RAM, also known as general RAM, is employed for communications stacks related to networking, field buss, high-bandwidth bridging, USB, etc. It implements peripheral data buffers generally through direct memory access (DMA) engines and can be accessed by masters without CPU intervention.

Here, product developers must remember the memory access conflicts that arise from the concurrent data transfer to both CPU and DMA. So developers must set clear priorities for latency-critical requests from the peripherals and carefully plan latency-critical data transfers like the transfer of a USB descriptor or a slow data rate peripheral with a small local buffer. Access from the DMA and the caches are generally burst to consecutive addresses to optimize system performance.

It’s worth noting that while system memory is logically separate from the TCM, microcontroller suppliers like Atmel are incorporating TCM and system RAM in a single SRAM block. That allows IoT developers to share general-purpose tasks while splitting TCM and system RAM functions for specific use cases.

A single SRAM block for TCM and system memory allows higher flexibility and utilization

5. TCM Loading
The Cortex-M7 uses a scattered RAM architecture to allow the MCU to maximize performance by having a dedicated RAM part for critical tasks and data transfer. The TCM might be loaded from a number of sources, and these sources aren’t specified in the M7 architecture. It’s left to the MCU designers whether there is a single DMA or several data loading points from various streams like USB and video.

So it’s imperative that, during the software build, IoT product developers identify which code segments and data blocks are allocated to the TCM. It’s done by embedding programs into the software and by applying linker settings so that software build appropriately places the code in memory allocation.

6. Why SRAM?
Flash memory can be attached to a TCM interface, but the Flash cannot run at the processor clock speed and will require caching. And that will cause delays when cache misses occur, threatening the deterministic value proposition of the TCM technology.

DRAM technology is a theoretical choice but it’s cost prohibitive. That leaves SRAM as a viable candidate for fast, direct and uncached TCM access. SRAM can be easily embedded on a chip and permits random accesses at the speed of the processor. However, cost-per-bit of SRAM is higher than Flash and DRAM, which means it’s critical to keep the size of the TCM limited.

Atmel’s M7 MCUs

Take the case of Atmel’s SMART SAM E70, S70, V70/71 microcontrollers that organize SRAM into four memory banks for TCM and System SRAM parts. Atmel has recently started shipping volume units of SAM E70and S70 microcontrollers for IoT and industrial markets and claims that these MCUs provide 50 percent better performance than the closest competitor.

Large configurable SRAM enables robust memory and connectivity features

Atmel’s M7-based microcontrollers offer up to 384 KB of embedded SRAM that is configurable as TCM or system memory for providing IoT designs with higher flexibility and utilization. For instance, E70 and S70 microcontrollers organize 384 KB of embedded SRAM into four ports to limit memory access conflicts.

Atmel’s M7 microcontrollers allocate 256 KB of SRAM for TCM functions—128 KB for ITCM and DTCM each—to deliver zero wait access at 300 MHz processor speed, while the remaining 128 KB of SRAM can be configured as system memory running at 150 MHz. However, the availability of an SRAM block organized in the form of a memory bank of 384 KB means that both System SRAM and TCM can be used at the same time.

The large on-chip SRAM of 384 KB is also critical for many IoT devices since it allows them to run multiple communication stacks and applications on the same MCU without adding external memory. That’s a significant value proposition in the IoT realm because avoiding external memories lowers the BOM cost, reduces the PCB footprint and eliminates the complexity in the high-speed PCB design.

Comments

There are no comments yet.

You must register or log in to view/post comments.

Moore’s Law Wiki
Yes, I am trying to teach AI how to do semiconductor wikis and put the Wiki back in SemiWiki. Should…

— Daniel Nenni on July 14, 2025
TSMC N3 Process Technology Wiki
I am trying to teach AI to speak semiconductor wikis. The problem is the date of the references. A 2023…

— Daniel Nenni on July 14, 2025
TSMC N3 Process Technology Wiki
Hmm - what's the source for 0.015-0.016? -- this thread shows 0.0199 (N3B) and 0.021 (N3E) https://semiwiki.com/forum/threads/tsmc-officially-halts-sram-scaling.17223/ Perhaps this source…

— Xebec on July 14, 2025
Moore’s Law Wiki
Are these AI Generated? :)

— Xebec on July 14, 2025
TSMC N3 Process Technology Wiki
It should be 25-30% smaller? Process Node Typical SRAM Cell Size Density Improvement TSMC N5 ~0.021 µm² — TSMC N3…

— Daniel Nenni on July 14, 2025
TSMC N3 Process Technology Wiki
~1.6x denser vs. N5 SRAM I thought the scaling was more like 1.05X? (Various threads here on 'SRAM scaling dead…

— Xebec on July 14, 2025
Facing the Quantum Nature of EUV Lithography
This presentation considers 5 nm Gaussian acid blur: https://www.youtube.com/watch?v=MYLdE69RDBg

— Fred Chen on July 7, 2025
Flynn Was Right: How a 2003 Warning Foretold Today’s Architectural Pivot
Appreciate your take, Rahul. You’re absolutely right that market scale drives architectural investment—scalar dominated when desktop and enterprise ruled, and…

— Jonah McLeod on June 29, 2025

Search Semiwiki

Recent Forum Threads

Recent Article Comments

Recent Podcast Episodes

Comments

Recent Forum Threads

Recent Article Comments