NVIDIA GeForce GTX 1080 Poised To Claim The Gaming And VR Performance Crown

NVIDIA GeForce GTX 1080 Poised To Claim The Gaming And VR Performance Crown
by Patrick Moorhead on 06-08-2016 at 12:00 pm

NVIDIA has been teasing the promise of Pascal for years, but this year Pascal’s performance became much more real. With the announcement of the Tesla P100 based on Pascal, NVIDIA was able to show the neural-network world what Pascal was really capable of. However, that chip is very different from the gaming-focused Pascal chip that NVIDIA launched on May 6th in Austin at an event VR analyst Anshel Sag and I both attended. NVIDIA introduced a plethora of new features and improvements that enable their new Pascal-based GeForce GTX cards to really shine against the competition. NVIDIA’s own Founder and CEO, Jen-Hsun Huang, deemed the GTX “1080 is the new King.”


NVIDIA CEO Jen-Hsun Huang at the GeForce GTX 1080 launch event held in Austin, TX (Photo Credit: Patrick Moorhead)

GTX 1080 “firsts”
The Pascal GPU architecture is NVIDIA’s most ambitious ever. The biggest version of the Pascal family is the GP100 inside of the Tesla P100 high performance computing (HPC) GPU. That version of Pascal features 15.3 billion transistors, the most of any chip, ever created by anyone, ever. The newly announced GeForce GTX 1080 features the GP104 graphics chip with fewer transistors paired with 8GB of Micron GDDR5X, a brand new type of graphics memory. The improvements in memory bandwidth and latency mirror NVIDIA’s improvements in the Tesla P100 with the use of HBM2. However, while AMD didintroduce HBM in their Fiji family of GPUs, it was limited to 4GB of VRAM capacity and this new generation that NVIDIA is using does not have such a limitation. Additionally, AMD is not using GDDR5X anywhere in any of their cards, so NVIDIA is the first to introduce this technology to the market in a product.

16nm FinFET helping
The biggest reason why NVIDIA is able to cram so many transistors inside of a relatively small chip is thanks to the new 16nm FinFET process from TSMC. This process allows for NVIDIA to not only build a small chip but to pack it with plenty of performance and to do it at a very low wattage. In fact, NVIDIA claims to be cramming approximately 29% more performance out of the GeForce GTX 1080 than they do from their most powerful graphics card right now, the TITAN X. Not only is NVIDIA claiming to be about 29% faster than the previous generation, they are able to accomplish this with only 180 watts of power. This represents a power reduction of 28% while at the same time delivering a 29% boost in performance. Traditionally, GPU manufacturers have had to give up some power savings for additional performance or vice versa, but with the new Pascal architecture and 16nm FinFET from TSMC you get the best of both worlds.

VR-focused performance and features
The GeForce GTX 1080 delivers even more performance once you start looking at performance in Virtual Reality (VR). It appears the entire GTX 1080 and Pascal were specifically designed with VR in mind. This includes support for hardware asynchronous computing as well as some of their VR Works software features like simultaneous multi-projection (SMP). NVIDIA claims that with the added hardware and software features on the GeForce GTX 1080 actually bring its VR performance to approximately two times that of the TITAN X, NVIDIA’s next fastest graphics card.

However, in order to accomplish such performance, game developers have to implement NVIDIA’s Game Works proprietary SDK which includes their VR Works that enables the use of features like SMP. As many middleware programs go, the chance that it will be broadly adopted is to be determined, but there’s a good chance that there will be a handful of games that do and those that do could benefit from this performance boost. This is the promise of NVIDIA’s new technology inside of Pascal and the software that accompanies it. Plus, the VR industry is looking for anythingto boost the category, so I expect developers and game engines will support both SMP and Async. Already, VRWorks is integrated into game engines today with features like VR SLI supported.

Hardware Async compute
Prior to Pascal, NVIDIA’s Maxwell architecture supported asynchronous compute in hardware. Improvements to work scheduling in Pascal increases the overall performance in VR and improves the overall user experience allowing both compute and graphics functions to happen as they need to. This feature is extremely important in order to deliver a smooth VR experience and I suspect that NVIDIA knows this and has made sure their architecture supports it. NVIDIA prides themselves in the best possible gaming experience, so it only seems logical that they would want to deliver the best VR experience possible with the GTX 1080.

Exceptional pricing for high-end card
In addition to all of the features in the GeForce GTX 1080 that make it so VR friendly and power-friendly, NVIDIA’s new GPU is also extremely price friendly. The “Founder’s Edition” from NVIDIA comes in at a relatively low $699, which is a solid $400 less than their current top-end card the TITAN X, for more performance. NVIDIA also introduced the Founder’s Edition GTX 1070 that will be $379, which claims approximately TITAN X performance for about 1/3 the price. These new GPUs from NVIDIA deliver on performance, power and price, they are a complete win for virtually any gamer looking to spend more than $300 on a GPU. They will also deliver the best possible performance today, even if NVIDIA’s claims are overstated.

NVIDIA should continue to dominate over $300
NVIDIA is trying to capture the entire segment of the market above $300 that represents the “enthusiast segment”, which is about 20% of the overall GPU unit market volume and where the highest margins and revenue are. The biggest market segment is the “performance” segment which is below $300 and while it represents the most units sold this is where margins are thinner in this segment.
Unless something unforeseen happens, which I don’t think will happen, I believe NVIDIA should continue to dominate in this high ASP, lower units market above $300.

The real fight will be from $149- $299
GPU’s that are more than $150 and less than $300 are the sweet spot for unit volume and unit market share which is why NVIDIA will likely need to extend the GTX 1000 family down to something like a GTX “1060” soon. This will be especially important for NVIDIA because Advanced Micro Device’s upcoming Polaris GPU will likely be squarely aimed at this price segment in order to recapture unit market share and potentially profits. After all, this is Advanced Micro Device’s self-proclaimed “year of graphics”.
If NVIDIA can successfully launch and price a GTX “1060” within this price segment they can defend their market share from Advanced Micro Devices in this price point, even if they effectively own the high-end. NVIDIA’s market share position is strong right now, but they already have a pretty strong command of the high-end and that’s why it’s important that they can launch into the lower price segments as well.
While AMD hasn’t yet launched Polaris, I believe it’s going to be an absolute fight from $149 to $299, so get ready for it.

Wrapping up
The NVIDIA GeForce GTX 1080 and GTX 1070 are absolutely marvels of engineering from a company that knows what gamers want and who has been executing incredibly for the last few years. The company are delivering on the promise of Pascal and 16nm FinFET and they are even doing all of this while bringing down the price of their GPUs compared to the previous generation.

These new cards are important for the PC VR category as they radically improve the price-experience curve. While HTC is delivering and Oculus has been choking, at least we know that higher-end VR will look much better from a graphics perspective.

NVIDIA’s launch of the GTX 1080 has been extremely well received by gamers around the world and those who wish to see high performance and lower cost VR solutions. NVIDIA has without a doubt introduced what appears to be the fastest GPU for VR and it will be interesting to see how it actually stacks up in real world scenarios.

Thankfully, NVIDIA has provided us with a GTX 1080 in order for us to test their claims ourselves. NVIDIA has finally, fully committed themselves to VR and they don’t appear to be stopping any time soon, I expect to see more announcements in the VR space from NVIDIA as they continue to push Pascal through their different product lines.

More from Moor Insights and Strategy


TSMC Update at #53DAC!

TSMC Update at #53DAC!
by Daniel Nenni on 05-31-2016 at 4:00 pm

TSMC is having an interesting year for sure. I was at the TSMC Symposium in Hsinchu last week and everyone was talking about the new 16FFC process. Silicon is out and it is exceeding expectations leading some people (me included) to believe that TSMC 16FFC will be the next TSMC 28nm in regards to popularity. To be clear, 16FFC is currently the “BEST” process node in regards to PPPA (price, performance, power, and area) available today, absolutely.

The proof is in the pudding of course and that pudding will arrive via the iPhone 7 this fall and yes, I am buying an iPhone 7 Pro to match my iPad Pro which I use every day. Netflixing on an iPad Pro at 30,000+ feet, priceless! It should have a “TSMC Inside” sticker on it for sure.

Speaking of FinFETs, we have published 64 FinFET related blogs thus far starting with Tom Dillinger’s three part Introduction to FinFET Technology series. The total views for the SemiWiki FinFET blogs exceeds 500k which is a lot of reading. FinFET blogs also have low bounce rates and high time-on-page numbers which means they are very engaging. Designing with FinFETs is still a hot topic so there are certainly more blogs to come.

At DAC, TSMC pioneered the partner theater allowing the fabless semiconductor ecosystem to shine, and this year it will be no different. You can see the latest TSMC Theater schedule HERE. If you are looking for me on the DAC exhibit floor that would be a good place to start. Or wherever there is free food.

Speaking of free food, TSMC is also very busy with other DAC activities that should be of interest. You can see the agenda HERE and please note that it includes breakfast, lunch, and dinner presentations so you can also find me there enjoying the free food.

As a pre #53DAC “Designing with FinFET” primer you should catch the TSMC and Solido Collaborate for Variation-Aware Design of Memory and Standard Cell at Advanced Process Nodes webinar on Wed, June 1st, 2016 10:00 AM – 11:00 AM PDT. If the time does not work for you, sign up anyway and they will send you a link to the replay:

“Variation effects have an increasing impact on advanced process nodes, and at each, new sources of variation must be considered. Furthermore, increased competition is forcing tighter design margins to make high-performance, low-power, low-cost products. Designers must now do more variation analysis than ever to achieve these tighter margins, using advanced variation-aware technology for speed, accuracy and coverage to deliver competitive chips on schedule.

This webinar will discuss how TSMC and Solido collaborate to offer variation-aware design techniques for memory and standard cell with TSMC advanced processes using Solido’s new Variation Designer 4.”

And don’t forget that SemiWiki will again be hosting a DAC Networking reception on Wednesday night from 6:00pm to 7:00pm in the Trinity Street Foyer. This year we will be giving away copies of our new book “Prototypical”. My beautiful wife and I hope to see you there!


How TSMC Tackles Variation at Advanced Nodes

How TSMC Tackles Variation at Advanced Nodes
by Pawan Fangaria on 05-27-2016 at 12:00 pm

The design community is always hungry for high-performance, low-power, and low-cost devices. There is emergence of FinFET and FDSOI technologies at ultra-low process nodes to provide high-performance and low-power requirements at lower die-size. However, these advanced process nodes are prone to new sources of variation. Moreover, cutting-edge designs with best PPA (Power, Performance, and Area) leave very less design margins.

In such a situation with high variation and low design margin, the designers have to struggle doing more variation analysis, thus impacting design schedule. To achieve a successful design closure in time, it’s important that the variation analysis tool must be robust to provide high performance, accuracy, and coverage.

In 2015, at 52[SUP]nd[/SUP] DAC, Cypress, Applied Micro Circuits, and Microsemi had presented their successful stories about dealing with variations in their designs. They used Solido’s Variation Designer which is scalable over large number of process variables and prioritizes simulations for most-likely-to-fail cases. I had blogged about this at that time, the link is provided below; the blog also contains the links to their video presentations.


Over the year, the Variation Designer is further improved in verification speed, accuracy, and coverage for leading-edge designs with high-performance, low-power, and low-voltage. The Variation Designer platform has a very efficient variation debugging environment. This is the next generation ‘Variation Designer 4’ from Solido.

Solido will be coming up with their new developments in this year’s 53[SUP]rd[/SUP] DAC as well, but before that I wanted to highlight how TSMCand Solido are collaborating to realize variation-aware designs at advanced process nodes.

TSMC and Solido are jointly offering the following free webinar

TSMC and Solido Collaborate for Variation-Aware Design of Memory and Standard Cell at Advanced Process Nodes

Abstract:
Variation effects have an increasing impact on advanced process nodes, and at each, new sources of variation must be considered. Furthermore, increased competition is forcing tighter design margins to make high-performance, low-power, low-cost products. Designers must now do more variation analysis than ever to achieve these tighter margins, using advanced variation-aware technology for speed, accuracy and coverage to deliver competitive chips on schedule. This webinar will discuss on how TSMC and Solido collaborate to offer variation-aware design techniques for memory and standard cell with TSMC advanced processes using Solido’s new Variation Designer 4.


Speakers Jacob Ou, Technical Marketing Manager at TSMC (on left) andKristopher Breen,VP Customer Applications at Solido

Date: June 1, 2016
Time: 10am Pacific
Duration: 55 minutes

Click here to register!

Also read: Moving with Purpose for Certainty


More Articles from Pawan


ARM tests out TSMC 10FinFET – with two cores

ARM tests out TSMC 10FinFET – with two cores
by Don Dingee on 05-25-2016 at 4:00 pm

About 13 months ago, the leak blogs posted news of “Artemis” on an alleged ARM roadmap slide, supposedly a new 16FF ARM core positioned as the presumptive successor to the Cortex-A57. Now, we’re finding out what “Artemis” may actually be, inside a multi-core PPA test chip on TSMC 10FinFET. Continue reading “ARM tests out TSMC 10FinFET – with two cores”


TSMC Leads Again with 3-D Packaging!

TSMC Leads Again with 3-D Packaging!
by Daniel Nenni on 05-24-2016 at 4:00 pm

Continuing to find new ways to extend Moore’s Law, the foundry and technology leader is ready to show off its wafer level system integration prowess with two scalable platforms targeting key growth markets.

CoWoS® (Chip-On-Wafer-On-Substrate) goes after high-performance applications, providing the highest bandwidth and pin count in a large package size. CoWoS is well-suited for diverse markets including graphics, networking and high-performance computing.

InFO, on the other hand, offers an ideal fit for today’s high-volume mobile, consumer and IoT devices that require compactness, integration flexibility and cost-effectiveness. Compared to existing options, InFO delivers a 20% thinner package, 20% performance gain and a 10% improvement in power dissipation. Based on wafer molding and metal process without a substrate, InFO’s reduced thickness and optimized performance make it a superior replacement for traditional Flip Chips.

With molding and metal between the logic die and the package I/Os there is neither an interposer nor a separate package – the metal and molding compound is the package. With its 5-micron metal pitch and no substrate, InFO makes for a very slim package (less than 1mm), reducing the thickness of smartphones and wearables for example. TSMC has also introduced InFO-POP with a DRAM die connected by a new “Through-InFO-Via,” and InFO_S that integrates multiple dies and will be launched by the end of 2016.

The following picture shows a cross-sectional view of an InFO PoP technology platform, with the logic chip at the bottom and a standard, industry-available DRAM package. The technologies are integrated using TIV to produce the thinnest solution in the industry. InFO PoP enables a thinner PoP stack with better routing density, higher operating frequency (Fmax), higher memory bandwidth DRAM and better heat dissipation.

In the critical area of InFO design support, TSMC helped pioneer EDA solutions for congruent IC and package design, including packaging layout and DRC signoff, along with its Open Innovation Platform® (OIP) partners last year. This ensures that InFO designs are fully compliant with TSMC’s packaging design rules and advances the company’s plans to provide a complete InFO design flow for its customers. Through OIP, the company is expanding InFO tool support, including electrical analysis and signoff such as RLC extraction for designers to analyze the parasitic impacts from InFO and its neighboring layers. The analysis of electrical migration and IP drop are also essential to ensure design reliability for the multiple dies on InFO. In addition, TSMC and its ecosystem partners are enhancing physical implementation with inter-die connection and physical signoff with inter-die DRC and LVS solutions.

To serve its customers as high-performance computing and mobile markets accelerate their pace of innovation, TSMC plans to invest not only on the front end silicon side, but the backend technology as well. The company has completed a new facility in Longtan InFO manufacturing and will begin volume production in 2Q16.


What Does an MPW and a Pizza Have in Common?

What Does an MPW and a Pizza Have in Common?
by Daniel Nenni on 05-23-2016 at 12:00 pm

Design starts are critical to the growth of the semiconductor industry so enabling them is a common theme on SemiWiki. One thing we have not covered in detail is multi-project wafer services (MPW) which is the equivalent of ride sharing through the initial mask and wafer process. Larger semiconductor companies already do this internally but what about the rest of the world, especially budget constrained Universities? We need their design starts too!

Let’s face it, the costs of mask sets are increasing with every node. Currently, a 6mm[SUP]2[/SUP] area tile at 28nm can cost more than $100,000 so why not share the ride with someone that has similar requirements? The bigger question is how do I find that special someone? Of course there is an app for that (think Tinder/Uber/AirBnB/etc… for an MPW).

eSilicon MPW Explorer: Fast, Accurate Quotes
Evaluate Options and Get Fast, Accurate Quotes for Multi-Project Wafer Shuttle Services. Quote and Compare: Free, Automated Online Multi-Project-Wafer Quote System Delivers Instant Pricing…

Here are some recent examples:

180nm MPW Sharing Platform:

  • Flavor: TSMC 180nm MS RF GP
  • Metal stack: 1P6M_4x1u (40KA top metal thickness)
  • I/O: 3.3V
  • MiM cap density (if used): 2fF/um[SUP]2[/SUP]
  • Price: $1,000/mm[SUP]2[/SUP]; 5mm[SUP]2[/SUP]minimum area

40nm MPW Sharing Platform

  • Flavor: TSMC 40nm MS RF G
  • Metal stack: 1P10_7x1z1u
  • I/O: 1.8V
  • Price: $7,500/mm[SUP]2[/SUP]; 1mm[SUP]2[/SUP]minimum area

eSilicon even hosts tapeout parties for Universities: MPW Tapeout Parties & eSilicon MPW Team (Arizona State University and Texas A&M January 2016, UCLA February 2016, University of Illinois, Urbana-Champaign March 2016, University of Minnesota April 2016).

For you Millennials, the term tapeout comes from a prehistoric time when we used to stream the design data to magnetic tape reels that were then sent to photomask facilities for processing.

eSilicon Multi-Project Wafer Service FAQ:

What does the multi-project wafer (MPW) price include?
Services included are listed in Appendix B of your MPW quote.

Can I get my MPWs packaged?
Yes, standard MPW die packages are available using our online form for quick cost comparisons. You can also package some of your die and receive the rest in bare die form if you like. If you are interested in other standard or custom IC packages, please contact the eSilicon sales manager assigned to you in your account confirmation email. We can probably help you. Please visit our Multi-Project Wafer Services overview page for an up-to-date list of our online MPW package offering.

What about testing? Can I get my parts tested before delivery?
Yes, testing services are available on request. Please contact the eSilicon sales manager assigned to you in your confirmation email for details.

Can I find out when the various foundries run their MPWs?
Yes, please, access the MPW schedules link on the top menu of the MPW Explorer interface.

Can I cancel an MPW reservation?
Yes, MPW reservations can be canceled. The process and associated fees, if any, are as described in Section 5 of Appendix D of your MPW quote.

Do I need an export certification?
It depends on the application. eSilicon will provide you with a very simple questionnaire to determine what, if anything, you need. This form will soon be online.

On the quote request form, what is “MPW tile size?”
“MPW tile size” is the minimum block size for an MPW order at a specific foundry and technology. For example, at TSMC 65GP, the minimum is 12mm2. Normally, if a customer orders an MPW with a 6.1mm2 die area at TSMC 65GP, they will be invoiced for a 12mm2 die area.

Is there any way to reduce the “MPW tile size”
Yes, eSilicon has developed a worldwide MPW sharing program. We can work with you to find other customers to share the cost of the MPW tile of your choice. Please visit our MPW Sharing Page to see the latest sharing opportunities.

Can eSilicon offer MPWs from other foundries or technologies besides the ones listed?
Yes, we are always working to add new foundries. Contact us at STAR@eSilicon.com with the foundry you’re interested in and we’ll give you the status.

Sign Up Log In

About eSilicon
eSilicon guides customers through a fast, accurate, transparent, low-risk ASIC journey, from concept to volume production. Explore your options online with eSilicon STAR tools, engage with eSilicon experts, and take advantage of eSilicon semiconductor design, custom IP and IC manufacturing solutions through a flexible engagement model. eSilicon serves a wide variety of markets including the communications, computer, consumer, industrial products and medical segments. Get the data, decision-making power and technology you need for first-time-right results. www.esilicon.com


EUV is coming but will we need it?

EUV is coming but will we need it?
by Scotten Jones on 04-12-2016 at 4:00 pm

I have written multiple articles about this year’s SPIE Advanced Lithography Conference describing all of the progress EUV has made in the last year. Source power is improving, photoresists are getting faster, prototype pellicles are in testing, multiple sites around the world are exposing wafers by the thousands and more. The current thinking is that EUV will be ready for production around 2018. All of this is very promising but while we have been waiting for EUV the industry has been moving on and a possible scenario is emerging where by the time EUV is available it won’t be very useful. In the balance of this article I will lay out a possible scenario where changes in device structures and fabrication processes could make EUV largely unnecessary.

My Advanced Lithography Articles summarizing the recent progress of EUV are available here:

There are three major product categories that drive capital equipment purchases in the semiconductor industry today, NAND Flash, DRAM and Logic.

For many years NAND Flash drove the requirement for the latest lithography tools. 2D NAND Flash devices went through lithography shrinks yearly eventually reaching 16nm devices manufactured in high volume with Self Aligned Quadruple Patterning (SAQP), but difficulties with 2D NAND device scaling and the cost of the complex patterning schemes required have brought 2D NAND scaling to an end. Specifically, adjacent cell interference, control to floating gate coupling and the shrinking number of electrons in a cell are just some of the device related issues. The solution to this issue for NAND has been the move to 3D. 3D NAND creates strings of NAND cells vertically with the cells created by alternating layers of material deposited using CVD techniques. The lithography requirements for 3D NAND are relaxed, for example Samsung’s 32-layer part has only one double patterned layer. Scaling is accomplished by adding layers, not by shrinking the photolithography defined dimensions. It is expected that scaling to >100 layers will yield devices with over 1Tb of capacity. 3D NAND has therefore made EUV unnecessary for NAND.

DRAM has followed a path similar to 2D NAND with yearly shrinks and the use of complex multi-patterning schemes. Recently DRAM scaling has slowed due to device scaling issues. DRAM stores values as a charge or absence of charge on a capacitor fabricated in series with an access transistor that controls the capacitor. Access transistors need a relatively long channel length to minimize leakage. This has led to a variety of access transistor structures such as RCAT, SRCAT and Saddle fin. The next step in access transistor scaling is expected to be VCAT but to-date fabrication of the vertical VCAT has been difficult to achieve. In parallel to this the DRAM capacitors need to scale down in horizontal area while maintaining a minimum acceptable capacitance value. Capacitor scaling to-date has involved vertical structures, rough surfaces and high-k dielectrics. Further vertical scaling has been limited by mechanical issues. and there is also a fundamental trade-off between the dielectric constant (k) of a material and band gap. As k increases the band gap decreases leading to leakage problems. Achieving acceptable leakage through the capacitor constrains the materials that can be used. There are some options still available, for example bit line optimization may allow smaller capacitance values to be used and there are rumors of a new film. At present the device scaling issues have moved DRAM away from being a leading candidate for EUV usage. DRAM also appears to be a leading area of Directed Self Assembly (DSA) research.

Longer term a DRAM alternative is needed. Conventional wisdom is that STT MRAM will eventually replace DRAM. To-date MRAM density and therefore cost is not competitive with DRAM (and there are other developmental issues). MRAM cells are fabricated in the metal layers over logic devices opening up the possibility to move to some kind of 3D Structure, possibly similar to the recently disclosed 3D XPoint memory (more on that later).

In the logic space the leading companies, Intel, TSMC, Samsung and Global Foundries are all in production of 16nm/14nm FinFETs. 10nm is expected to start to enter use in late 2016 at the foundries and in late 2017 at Intel. TSMC is currently forecasting that 7nm will be available in late 2017. TSMC is guiding that they will “exercise” EUV at 10nm for 5nm use. Intel is leaving the door open on EUV use at 7nm and assuming they don’t produce 7nm until 2019 or later that would make sense. Global Foundries has said they are developing 7nm based on what they can reasonably do without EUV and EUV would be a possible second generation 7nm cost reduction. All of this lines EUV up for a projected late 7nm node or 5nm node insertion.

Against this backdrop it is interesting to look at the evolution of logic devices. Intel introduced FinFETs at 22nm, shrunk them for their second generation at 14nm and they are guiding that at 10nm the third generation FinFETs will not have new materials. 16nm/14nm at the foundries was the first generation FinFET for all of them, 10nm will be the second generation and 7nm the third generation FinFETs for them (we should note here that from a pitch perspective the foundries 7nm “node” is similar to Intel’s 10nm node). At one time I thought we might start to see FinFETs with high mobility channels by 7nm or possibly even 10nm but due to a variety of challenges achieving high performance with high mobility channels in actual devices and the challenge of changing an existing structure to a new material I am now thinking FinFETs will likely stay with silicon channels until they are replaced by a new device. This leads to the question of when we might see a new devices and what it might look like.

IMEC is one of, if not the leading semiconductor technology research institution in the world. IMEC appears to be settling in on stacked horizontal nanowires as the successor to FinFETs. The devices experts I talk to are also optimistic on this approach. Horizontal nanowires are fabricated by depositing a stack of alternating materials using CVD techniques and then pattering them. This technique can create a stack of multiple nanowires. One really intriguing possibility is for example to create a 4 nanowire stack where 2 wires are NMOS and 2 are PMOS. This would yield a stacked CMOS devices and be equivalent to a node or more of scaling without shrinking the lithographic dimensions. If you take this idea a step further to 8 stacked wires you could have a stack of two CMOS pairs. You could also look at stacking layers while relaxing the horizontal width to scale the device density while taking the pressure off of lithography to provide shrinks. This would be analogous to what has been done with 3D NAND.

Of course we also need to look at when this might happen. My best guess is around 5nm at least for the foundries. With the foundries lining up to not use EUV at 7nm or only late in 7nm, if a 5nm solution emerges that doesn’t need EUV how much of a EUV investment are they likely to make. For Intel I am thinking horizontal nanowires might be a 7nm solution but with Intel now on a 3-year node cadence that would put Intel’s 7nm node at around 2020 likely around when the foundries would be introducing their 5nm nodes.

The picture all this paints is that NAND no longer drives the need for EUV by going to a 3D structure and logic also has the potential to move to a 3D structure with relaxed requirements. DRAM scaling has slowed due to device scaling issues and is a leading DSA candidate, so what will drive the need for EUV?

Intel and Micron recently introduced their 3D XPoint memory architecture. Faster and with better endurance than NAND and cheaper than DRAM, 3D XPoint is positioned to be used as Storage Class Memory – a kind of buffer between DRAM main memory and non-volatile storage such as NAND and hard disc drives. The first 3D XPoint memory has 2 memory layers fabricated in the interconnect stack over a logic circuit that controls the memory. We estimate the memory layers take 2 mask layers each and are a 25nm technology requiring multipattering for each layer. 3D XPoint scaling offers the ability to scale by adding layers and also by shrinking the memory layer pitch. If 3D XPoint is scaled simply by adding memory layers EUV might not be interesting. If 3D XPoint were to begin scaling pitch, EUV would become attractive. With 3D XPoint not expected to be in production until 2017 and then needing to become established in the market it is hard to envision 3D XPoint successfully driving EUV adoption.

This is of course just one possible scenario for the direction of semiconductor technology but clearly while we have been waiting for EUV the industry has been moving forward on other fronts. Multipattering also continues to get better and cheaper. By 2018 when EUV is currently projected to be ready for production it is possible the evolution of semiconductor devices may make it unnecessary.


TSMC and Flex Logix?

TSMC and Flex Logix?
by Daniel Nenni on 03-26-2016 at 7:00 am

There was a lot to learn at the TSMC Technical Symposium last week, in the keynotes for sure but also in the halls and exhibits. Tom Dillinger did a nice job covering the keynotes in his posts Key Take aways from the TSMC Technology Symposium Part 1 and Part 2 but there was something interesting that many people may have missed in the exhibit hall.

As you may know this event is invitation only and that includes the companies who exhibit. To exhibit you must have a formal relationship with TSMC and more importantly with TSMC’s top customers so it interesting to see new companies in the exhibit hall and speculate why they are there.

The most interesting new company exhibiting this year in my opinion was Flex Logix Technologies:

FLEX LOGIX ANNOUNCES PROGRAM FOR FAST-TRACK EVALUATION AND PROTOTYPING
Reconfigurable RTL Enables One Design to Serve Varying Customer Requirements

“Architects, front-end designers and physical design teams all need to become familiar with this new technology for applications from MCU to IOT to Networking and more. Like with any technology, it is best to learn by doing and starting simple,” explained Flex Logix CEO and co-founder Geoff Tate. “This new program allows customers to fully evaluate EFLX in detail and in silicon at very low cost.”

Geoff Tate and Andy Jaros manned the booth (Andy and I worked at Virage Logic together years ago). Talking to both the CEO and VP of sales was a great opportunity to understand the Flex Logix value proposition so here it goes:

More and more companies are trying to build flexibility into their SoC designs. The traditional approach has been to overdesign an SoC or functional block to try and anticipate all possible requirements and simply select an option: blow a fuse, spin a metal mask, or make a bond out option to “personalize” a particular chip for a customer or market application.

The theory goes, with advanced process nodes, gates are “cheap”, so this design philosophy is easily justifiable. But what is not cheap are the mask costs not to mention the engineering and validation cost. And there’s the cost of missing a market window if a spec changes or a customer decides they want to tweak a custom built hardware accelerator because their algorithm changed or they want to modify the pinout due to system constraints.

As market requirements and customer demands are changing even more rapidly, designing SoCs with more flexibility in mind is making more and more financial sense. Even if it uses a few more “cheap” gates, can save money on multiple tape outs, and helps keep up with changing requirements.

It requires a slightly different approach to designing chips of course and Flex Logix has the right idea with their Fast Track program to help architects and designers experiment with adding more flexibility to their projects. Additionally, the ability to have one die that can be retargeted to multiple applications improves ROI.

Additionally, the ability to upgrade features in the field, in system, offers the possibility of a new revenue stream: providing optional upgrades that permit better, faster operation. Often the alternative is to fall back to emulation in software which can suck up a lot of processor bandwidth (not to mention power) that can be used elsewhere.

For more detailed information, Don Dingee is our embedded design expert and he has written about Flex Logix twice thus far. Or you can give Andy a ring, he is always good company for a coffee or lunch.

Creating a better embedded FPGA IP product

Reconfigurable redefined with embedded FPGA core IP

FLEX LOGIX, founded in March 2014, provides solutions for reconfigurable RTL in chip and system designs using embedded FPGA IP cores and software. The company’s technology platform delivers significant customer benefits by dramatically reducing design and manufacturing risks, accelerating technology roadmaps, and bringing greater flexibility to customers’ hardware. Flex Logix recently secured $7.4 million of venture backed capital. It is headquartered in Mountain View, California and has sales rep offices in China, Europe, Israel, Taiwan and Texas. More information can be obtained at http://www.flex-logix.com


10nm SRAM Projections – Who will lead

10nm SRAM Projections – Who will lead
by Scotten Jones on 03-25-2016 at 12:00 pm

At ISSCC this year Samsung published a paper entitled “A 10nm FinFET 128Mb SRAM with Assist Adjustment System for Power, Performance, and Area Optimization. In the paper Samsung disclosed a high density 6T SRAM cell size of 0.040µm[SUP]2[/SUP]. I thought it would be interesting to take a look at how this cell size stacks up to 6T SRAM cells we have seen to-date and some projections for what other companies 10nm 6T SRAM cell sizes might be.

[TABLE] align=”center” border=”1″
|-
| style=”width: 71px” |
| style=”width: 60px” | 45nm
| style=”width: 60px” | 32nm/
28nm
| style=”width: 66px” | 22nm/
20nm
| style=”width: 66px” | 16nm/
14nm
|-
| style=”width: 71px” | Intel
| style=”width: 60px” | 0.3460
| style=”width: 60px” | 0.1710
(32nm)
| style=”width: 66px” | 0.0920
(22nm)
| style=”width: 66px” | 0.0588
(14nm)

|-
| style=”width: 71px” | Samsung
| style=”width: 60px” | 0.3700
| style=”width: 60px” | 0.1490/
0.1200
| style=”width: 66px” | NA
| style=”width: 66px” | 0.0640
(14nm)
|-
| style=”width: 71px” | TSMC
| style=”width: 60px” | 0.2420
| style=”width: 60px” | 0.1270
(28nm)
| style=”width: 66px” | 0.0810
(20nm)

| style=”width: 66px” | 0.0700
(16nm)
|-

6T SRAM cell size versus node (µm[SUP]2[/SUP]).

Looking at this data you can see that at 45nm and 20nm TSMC led and at 28nm Samsung led (the leaders at each node are in bold). At 16nm TSMC chose to take a conservative approach and leverage their 20nm process pitches for their first FinFET resulting in a larger SRAM cell size than would otherwise have been expected. Intel very aggressively scaled their process and took the lead.

I have taken 6T SRAM cell size data for Intel back to 130nm, Samsung back to 90nm and TSMC back to 130nm and plotted SRAM cell size versus node. Using a power law to fit the curves the R[SUP]2[/SUP] values are >0.98 for Intel and TSMC and >0.97 for Samsung clearly indicating a very good fit. Using the resulting equations, I have projected Intel and TSMC 10nm 6T SRAM cell sizes. For Intel I project a 6T SRAM cell of 0.0284µm[SUP]2[/SUP] and for TSMC of 0.0238µm[SUP]2[/SUP].

Assuming TSMC returns to their historical SRAM trends they will once again have the smallest SRAM cell size. This may be optimistic because Intel is expected to have a smaller contacted gate pitch and minimum metal pitch than TSMC at 10nm. In fact, we expect TSMC’s 7nm process to have similar pitches to Intel’s 10nm process. We should note here that TSMC is expected to begin ramping 10nm at the end of 2016 and they are targeting the end of 2017 for a 7nm ramp. With Intel delaying 10nm to 2017 TSMC’s 7nm and Intel’s 10nm may be ramping around the same time.

The bottom line is based on my analysis the Samsung 10nm 6T SRAM cell size looks significantly larger than what I would expect from Intel and TSMC.


Key Takeaways from the TSMC Technology Symposium Part 2

Key Takeaways from the TSMC Technology Symposium Part 2
by Tom Dillinger on 03-22-2016 at 4:00 pm

In Part 1, we reviewed four of the highlights of the recent TSMC Technology Symposium in San Jose. This article details the “Final Four” key takeaways from the TSMC presentations, and includes a few comments about the advanced technology research that TSMC is conducting.
Continue reading “Key Takeaways from the TSMC Technology Symposium Part 2”