DAC2025 SemiWiki 800x100

CyberEDA adds a Transistor-Level Debugger

CyberEDA adds a Transistor-Level Debugger
by Daniel Payne on 06-13-2011 at 6:34 pm

Intro
I met with CK Lee, founder of Cyber EDA at his booth on Monday evening in San Diego. Last year I learned about their new SPICE circuit simulator named PCSIM, this year the new product is called ADDS-Debugger.



Notes

2010 – Announced a debugger

2011 ADDS Debugger – trace at the transistor level your design
– Signal tracing
– Post-layout debug

o Tracing – which signal triggered that net that rose or fall?
o Trace back to a Primary input
o Cross-probe between auto-generated Schematic and the waveforms
o Pricing: $50K per year
– ADDS Wave
o Pricing: $2K per year
– PCSim
o Pricing: $25K per year
o Post-layout simulation speed improved, 3X
o True SPICE simulator, flattened
o Compete with HSPICE, SPectre, ELdo

Customers – Ali (Taiwan)
– Extreme DA

Based: Santa Clara, CA

Employees: 10

Next Year: Double revenues

Sales: Direct mostly, some distributors

Summary
ADDS Debugger reminded me a lot of what Concept Engineering has been offering for several years now in a transistor-level debugger. What makes this different is that you’re inside of a circuit simulation run when you can visualize the netlist as a schematic and see the node voltages and branch currents.

The SPICE circuit simulator market is crowded with many EDA vendors (Synopsys, Mentor, Cadence, Magma, Berkeley, Tanner EDA), so Cyber EDA has to do something special in order to make PCSIM stand out from the crowd (speed, accuracy, capacity, features).


QuickCap for IC Extraction at DAC 2011

QuickCap for IC Extraction at DAC 2011
by Daniel Payne on 06-13-2011 at 6:08 pm

Intro
John and Ralph from Magma gave me an update on QuickCap at DAC on Monday afternoon in their demo suite.

Notes
John Schritz – Sr AE
Ralph Iverson, Ph. D. (wrote QuickCap)

John Schritz
– Digital Signoff, extraction
– QCP: 2.5D RC for full ASIC designs
– QuickCAP NX: 3D field solver
– QCP:

Demo – 1.5 million instance design, 1.59 million nets, fully P&R, three libraries
– QCP: Gate Level (Star RC competitor)
– QuickCapNX: 3D field solver (Raphael competitor)
– QCP TX: Transistor Level (RC XT competitor)
– QCP
o 10X faster than Star RC
o 20X faster with multi-corner extraction
o Accuracy: +-2% of QuickCap NX
o TSMC qualified at 28nm
o Inputs: LEF, DEF, GDS II
o Scaling: can add multiple corners using only 20% run time, can be distributed across more boxes
o Capacity: 50M instance designs, memory efficiency
o Accuracy (vs Star RC)
o Example: TSMC 40nm, 38K nets
o 1 corner at 5 hrs, 12 corners at 8.4 hours (27X faster than Star RC), using 3.3GHZ CPU, 148GB and 12 CPUs
o One license per four cores (after that add a multi-core to add up to 32 cores)
o 16 Synopsys licenses for a runtime of 60 minutes, vs 2 licenses in 34 minutes
o ½ the run time, ¾ the HW, 1/8 the licenses (compared to Synopsys with 80% market share)
o 3D on demand (add QuickCap NX), name the nets you want best accuracy on
– Tekton – a STA tool
o Reads SPEFS
o Fast STA in minutes
o Concurrent MM/MC
o Accurate with SPICE integration
– Old flow
o Implement, output a GDS II
o Extract, output SPEF
o STA
– New Flow
o Implement
o QCP and Tekton
– QuickCap NX: Industry Golden 3D Field Solver
o Same run time independent on
o Three methods:
 Finite Element (Slowest)
 Boundary Integral (Slow, what Mentor 3D XACT uses)
 QuickCap (Fastest)
o Field-solver accuracy (within 1% of Silicon)
o Good for library characterization, custom, analog
o Flow Inputs: ITF or iRCS from foundary, GDS II layout
o Industry standard solver at IDMs, Foundries
o Edge effects
o Qualified by TSMC
– QCP TX – transistor level extraction
o Millions of transistor capacity
o To be released in 6 months
o For: Custom design, memory, AMS
o CCI – Calibre Connectivity Interface, stress parameters, well proximity
– QCP Demo
o 18 million nets extracted per hour on 8 threads

Summary
Star RC (Synopsys) has plenty of competition from Magma, Mentor and Cadence for IC extraction. I also learned more about 3D Field Solvers on Tuesday at the Pavilion Panel session.


HSPICE gets Faster, better Convergence

HSPICE gets Faster, better Convergence
by Daniel Payne on 06-13-2011 at 5:53 pm

Hany El Hak – Product Marketing Manager

Frederik Iverson – AE

Scott Wetch – HSPICE Architect

HSPICE – 5 years ago convergence was not so good, while 95% of analog circuits today converge out of the box, no options are required.

Synopsys AMS Portfolio – wide range of tools
– Custom Designer: IC schematic and layout tools
– HSPICE – circuit simulation
– CustomSim – full chip circuit simulation
– IC Validator/Star RC – extraction, DRC, LVS

HSPICE – Golden SPICE standard for about 30 years now, the same tool the foundry used to create their cells
– Used in IC, AMS, PCB for SI

Issues – Long run times, could be days to cover all of the corners required, more runs

Improvements in HSPICE – 5X faster than 2007 on a single core, averaged over 100’s of circuits
– Precision Parallel, 7X faster when using 8 cores compared to a single core
– 10X more capacity, about 10 million elements
– New Analysis added: High Frequency, Statistical Eye Diagrams, Transient Noise, Loop Stability
– Convergence, 95% out of the box, no settings required
– Distributed Processing (MC, Corners, Sweeps) – distribute over the network (17X on 20 CPUs)

HSPICE Precision Parallel – run times reduced from days to hours
– Up to 7X faster on 8 cores (use –hpp option)
– PLL runs now reduced to 4 hours instead of days
– PLL example with 7K MOS and RC, 12.5 hours (148 hours, competitor)
– Clock Tree with 10M elements, 7 hours (108 hours, competitor)
– Sigma Delta Converter, 7.7 hours (16 hours, competitor)

HSPICE Distributed Processing – divide and conquer (MC, corners with .ALTER, sweeps)
– 10 CPUs at 8.7X faster
– 20 CPUs at 17.3X faster (some overhead to collect all that data)

Post-layout – selective net back-annotation (use parasitic only where needed)
– Check and find only the active nets for extraction (automatic or manually identified)
– Apply parasitics only to critical nets that are identified

Transient Noise Analysis – Include noise in time domain simulations (about 2X to 3X slower than transient)
– Full nonlinear analysis of noise effects in the time domain
– All devices are taken into account (thermal noise, channel noise, flicker noise)
– Today a single CPU, in September see the parallel version

Custom Designer (Schematics and Layout)
– HSPICE integrated within

Frederick Iverson, demo of Duty Cycle Corrector (40nm node, Used in IP group for USB 3.0, 450 analog designers at SNPS)
– Custom Designer uses OA for a db. Any circuit simulator can be added to Custom Designer.
– Normal simulation is 5 minutes to complete, Precision Parallel completes in about 1 minute
– Command line has Tcl, so it’s easy to save and re-run commands.
– Si2 is showing how to run tools with many languages: Tcl, Perl, Ruby, etc.
– Plot of simulation results shown in WaveView tool, measurement tool to show % duty cycle.
– HSPICE uses one license for two threads, so use it at no extra cost
– Back annotate a full netlist or a partial netlist (used a DSPF file from Star RC)
– Names in HSPICE are the pre-layout names even with back-annotated values
– Transient noise demonstrated, the output does show jitter, wave view shows jitter versus time, histogram shows standard deviation on jitter values

Summary
HSPICE has to continually improve in order to stay current with Berkeley DA, Eldo, Spectre and FineSIM circuit simulators.


Physical IP Group at ARM

Physical IP Group at ARM
by Daniel Payne on 06-13-2011 at 5:45 pm

After lunch on Monday I met with John Heinlin, Ph.D. – VP Marketing of Physical IP Division

Back in the day I knew the founders of Artisan (VLSI Libraries) when we worked together at Silicon Compilers (Mark Templeton, John Malecki, Scott Becker).

Q: Do you favor any EDA tools for creating your IP?
A: No, we don’t really endorse a specific EDA vendor tool or flow.

Q: What’s new at ARM for 2011?
A: Just started Process Optimization Packs. Linking IP that is tuned for ARM cores.

Artisan – Generic cells. Used to wait for a node to become stable.

ARM – Focused on leading edge cells, now at 28nm nodes. Now doing test chips quite early to tweek and optimize cells, as requirements into the process.

2010 – 32nm last year announced.

2011 – 28nm IP is now ready as soon as announced. TSMC, SMIC, Samsung, IBM, GlobalFoundries (CP), TSMC.

Q: How to reach Power Performance and Area for your design?
A: We have a User Guide on how to get best results.

Q: Any preference for PDKs?
A: Support whatever PDK is available, no preference.

Synopsys – renewed tools agreement for a period of time.

Cell Library – about 1,000 cells to choose from, with about 100 different functions. Simple gates, flip-flops (power saving modes)

Q: What is your royalty model?
A: Royalty Model – same as always, no upfront costs. Foundries pay for library development.

Memory compilers cells – Standard RAMS, then lower VDD RAMs for an extra price. Typically 5 to 7 compilers per node.

Foundries – Some are offering their own libraries.

ARM Artisan IP (re-using)

Memory Compilers – Virage (Synopsys) has a piece of it. ARM invests heavily in these at leading edge nodes. Area efficient nodes too.

SOI – French based (SOISIC), acquired about 5 years ago, used at IBM.

Physical – GP IOs, 40nm Low Power. Will add DDR libraries soon. Have controller, PHy and IO in one piece.

Q: What do you think of Intel’s TriGate?
A: FinFET – At 20nm planar CMOS still works, 14nm then FinFET looks viable.

Q: What is success for ARM?
A: Success – We think rich solutions across a wide range of foundries at 28nm now, 20nm soon (test chips with Samsung).

Q: How do you use RDR?
A: RDR – We’ve been dealing that for a few nodes now (20nm needs double patterning). We work closely with foundries for all nodes.

Common Platform – ARM is the IP you get.

Q: Where do I find out more?
A: Visit our SOC Design Community


FineSIM adds RF Analysis plus new Tcl Circuit Checks

FineSIM adds RF Analysis plus new Tcl Circuit Checks
by Daniel Payne on 06-13-2011 at 4:58 pm

At DAC I spent time in the Magma FineSIM demo suite on Monday morning.

Greg Curtis – Product Director, Custom Design Business Unit

– Talus for Digital Design
– FineSim does: SPICE, FastSPICE, Characterization
– Flows Demoed at DAC: High Performance Core, SOC, ASIC/ASSP, AMS, Memory
– What’s New in FineSim?o RF
o TCL circuit checks
– ADC design trendso Parasitics dominate performance now
o Sensitivity to noise and cross talk
o Operating at low power
o Requires more SPICE, more corners, more analysis
– Cell phone trendso 10+ radios per phone
o Maximum battery life
– FineSim SPICEo Multi-cpu and multi-machine capability
o Standard netlist inputs (HPSICE, ELdo, Spectre)
o ER and IR drop analysis
o Co-simulate with Verilog and VHDL
o New:
 Tcl based circuit checks (interactive simulation)• Customized check that you write to check currents, voltages, timing
• Set breakpoints
• Demo: find all outputs above 2.6V
• Demo: At 5th rising edge, wait 25ns, capture signals, check sum of currents, trigger if >50uA used
• Could write Tcl code to calculate the jitter of a PLL real time during simulation
• Technology could traverse a netlist similar to Calibre PERC, but not there yet
 RF Analysis (Harmonic Balance, Periodic AC Analysis, Periodic Noise, Periodic Transfer, Oscillator Phase Noise)• Multi-CPU capability
• Due in November 2011
• Beta in September 2011
• A new option to FineSim SPICE
• Demo of a 9 stage ring oscillatoro One CPU run first, 25 seconds
o Four CPUs run next, 12 seconds
o FineWave used to show the waveform results
o AC wave, phase noise plot shown
 EM/IR Analysis• Titan is being used for EM/IR Analysis
• Demo of NAND gates in a layout and schematic
• Titan uses the Analog Simulation Environment (AVE)
• EM/IR analysis results viewed on top of the layout using color codes, layout zoomed in when clicking on a specific EM/IR result
• Capacity is a few million MOS devices
o 1.5X Faster than Berkeley (PLL)
o 1.4X faster than Cadence APS (Sigma Delta Modulator)
o 5X faster than Eldo
– FineSim PROo Multi-rate engine, hierarchical processing (but not hierarchical simulation)
o Same features as FineSim HSPICE
o Capacity example had only 4.6M devices
o 7.9X to 16X faster than NanoSim, Hsim,XA (under 2M MOS devices)
Improvements:
– 2011.04 release versus 2010.08o 1CPU, 7.5% less memory. 1.23X run time improvement
Foundry Support
– TSMC, GlobalFoundries, TowerJazz, Lfoundry

Customer Usage:
– 12 of top 20 Semiconductor companies using FineSIM
– Analog Bits, 10X speed up wit FineSim
– ESNUG user: 3X to 10X over HSPICE

Anand Ganesan – Senior Product Engineer (demonstrations)

Summary
FineSIM catches up to HSPICE, Eldo and Spectre in simulating RF circuits.

The Tcl Circuit Check looks similar to the technology in Mentor’s Calibre PERC. Let’s see if they create a product to detect ESD best practices like PERC does.


Nimbic (formerly Physware) – 3D Field Solver in the Cloud or Desktop

Nimbic (formerly Physware) – 3D Field Solver in the Cloud or Desktop
by Daniel Payne on 06-13-2011 at 12:57 pm

I met with Bala Vishwanath, CMO at Nimbic on Monday morning. They had just announced a $6.9M round of venture capital which is something that you rarely hear about in EDA these days, especially during a slow economic recovery.

Intro

Physware – served the package and board markets, co-design challenges (can add IC noise sources). As complexity increased customers wanted more capacity in compute, how about unlimited capacity using the cloud ?

What’s New
Parallelization – both distributed and multi-core. The cloud can handle this configuration, always accurate with full 3D field solving. Product name is nCloud.

Still have enterprise tools on your desktop or on a compute farm.

Cloud – can pay by the hour, or mixed with enterprise.
SaaS – pay by the hour.

Pricing models – by the hour, subscribe annually on the cloud. If Enterprise is 100% , then the could would be double the cores to 8 for the same price.

Security – Even Nimbic cannot see the data. Each HW is separated per client, secure channels used with RSA key pairs in transit (data at rest, on dedicated HW, encrypted Disk, key lives in the instance, all the virtualized layers come from Amazon, then we add on top of that).

Private Cloud – sure that could be used. The Amazon API has been broadly used.
Cloud machines – they reside in Virginia, but could be located anywhere they want.
Managers – can setup monthly budgets to not exceed
Venture financing – $6.9M in Series B, existing Investor and one new one added.

Runing Ubuntu Linux in the cloud right now.

Plan to add more of a flow management system.

How long is your data there? Not archival duration. Results stay there as long as you want. Don’t consider it archival. S3 = Simple Storage Service.
Chile, development (Founder from Chile)

Partnership release with Amazon to be coming.
Nimbic GUI (nWave) – running on Windows, 10 Amazon machines with 8 cores each. GUI also runs on Linux.

Summary
Nimbic re-invents itself by offering a 3D Field solver in the Cloud and securing new venture funding. The trend of EDA companies offering their tools in the Cloud continues at DAC 2011.


The Secret of Analog Design

The Secret of Analog Design
by Paul McLellan on 06-09-2011 at 5:15 pm

Everybody knows that digital designers run on pizza and soda, what one might describe as poor food and weak drinks. At DAC in San Diego I discovered a restaurant that gave away the secret to analog design. And you thought it was a good layout editor and a good circuit simulator. But it turns out that the secret to analog is good food and strong drinks.


Magma, ARM, GLOBALFOUNDRIES

Magma, ARM, GLOBALFOUNDRIES
by Daniel Payne on 06-06-2011 at 11:07 am

Introduction
Monday morning at DAC I attended the breakfast presentation from Magma, ARM and GLOBALFOUNDRIES. The 28nm node is ready for business using Magma tools and ARM libraries.

During breakfast I met Karim Arabi, Ph.D. from QualComm. He’s a senior director of engineering in San Diego and wanted to learn more about the 28nm node and how Magma tools could be used in a flow.

Notes

Rod Metcalf – Magma, 28nm Reference Flow Development, using the Talus flow manager for the entire IC design flow. We ran a testcase to ensure silicon validation. Goal is to use the Magma flow on a 28nm Global Foundries process and get 1st silicon success.

– Use the existing libraries which are characterized at specific corners, or create your own corners.
– They used an OpenCore design as their reference example.
– ARM supplied the 28nm libraries, standard cells, memories. Multi Vt libraries were used.
– Talus from Magma: The RTL-to-GDS II tool flow. Offers high capacity, silicon proven at 28nm, high speed designs.
– Low power technologies used, CPF and UPF.
– Talus Flow Manager: a way to capture a complete tool flow. Analysis results are in HTML, easy to read or email complete with links for more details. Pie charts, timing violations.
– Talus Flow Manager – the reference flow provides correct setups for all tools needed, much less work for users to

GLOBALFOUNDRIES – Our largest competitor reports their test chips as tape-outs, we don’t count those as tape-outs.

Summary: Flow of Magma tools and ARM libraries on the GLOBALFOUNDRIES 28nm process are ready to go now, proven, validated.

Jim Ballingall, Ph.D. VP Marketing – GLOBALFOUNDRIES uses high K metal Gate (HKMG). Photo of a quad-core CPU with a GPU from AMD producing 500Gflops performance (beats Sandy Bridge from Intel).

– Ramp of HKMG process is going well. Gate first HKMG approach (Intel uses Gate last HKMG). Intel is using FUD, don’t believe it.
– Market requirements for foundry customers.
o 28nm SLP (Super low power) – low mask count, no stress engineering for lowest costs
o 28nm HPP (High performance process) – uses stressors, >3GHz performance,
– Gate first can be 10 to 20% smaller std cells with Gate First compared to Gate Last approach
– CPU speed versus cost: SLP is lower clock speed and lower cost, while HPP is higher speed and higher cost

Global Solutions – an ecosystem with EDA tool vendors, mask making, assembly. A successful program.
– Design Kits for 28nm SLP are ready now, HPP coming soon.
– Investment of $5.4B in 2011 for GLOBALFOUNDRIES.

Common Platform Partners – IBM, Samsung, ST (Fab synch agreement)

Dresden Fab 1 – plan for 1 million wafers per year at 45nm and below nodes

Fab 8 in NY – shell is complete, equipment moving in, on schedule. 60K wafers/month.

Multi Program Wafers (MPW) – run every quarter, fully subscribed, allow lower costs.

20nm – In development now, partnered with IBM. Results due in 2012/2013. First shuttles started in 2011 Q4.

Q&A

Q: What about fin FET?
A: Looks like a 14nm technology to us.

Q: Is 20nm Gate Last?
A: Yes, that’s true. We choose that for fewer design rules. Gate First was good down to 28nm. At 20nm litho dominates and we choose Gate Last.

Q: What about DFT?
A: We partner with companies like SynTest for DFT tools, they can be integrated into our Talus Flow Manager like any Magma tool.

Largest DAC Banner
Yes, it’s the number 1 EDA company, Synopsys:


Sunday Night at DAC

Sunday Night at DAC
by Daniel Payne on 06-05-2011 at 7:23 pm

San Diego Arrival
It’s another picture perfect day in San Diego as I arrived and checked into the Hyatt. The view from the 40th floor looked magnificent, with the Convention Center just a few minutes away in the distance:

Registration

Check in at DAC is quite automated and it took only a minute to receive my official badge with an Independent Media sticker.

EDAC Party
Tonight is the kick-off event first at 6PM sponsored by EDAC and it’s the annual networking event of the year for us in EDA.

Here I met dozens of former co-workers and EDA clients.

Kathryn Kranen (CEO of Jasper) tried valiantly to address the crowd however the din of networking was just too great. Next year I would try just projecting the key messages on a wall, and let them cycle through instead of live speeches.

Gary Smith
Next up is the 7PM event from Gary Smith EDA. I’m really hoping that these two events are in different rooms so that we may actually get to hear over the roar of networking in the back of the room.

We enjoyed a separate meeting room at Gary Smith’s DAC event on Sunday night. 4G signal coverage was not available with my CLEAR device, so no tweeting this year. Others nearby had zero cell-phone signal as well.

Mary Olsson – 3D packaging. Qualcomm is a big user of TSV and 3D packaging.

Current State of Technology
– TSV: not really cheaper, faster or smaller
– Whatever happened with Multi Chip Modules (MCM)?
– 3 fabs are ready at 200nm now, 300 mm ready in 5 years (too much stress and cracking)
– Today: Mostly memory stacking, wafer stacked SOC, image sensors

3D TSV
– Mostly research and University oriented still
– Bleeding Edge Drivers: Intel, Samsung, Qualcomm, TI, STM, Altera
– Xilinx, Nokia, Nvidia, Broadcom, Cisco
– Tessera, Tezzaron, eSilicon

Standard Drivers : GSA, JEDEC, Si2 Open3D

TSMC Revenue by Technology – 1st quarter numbers, 65% of revenue for 90nm to 40nm nodes, mostly standard packages
– 35% of revenue from above 135nm nodes and larger

Top IDMs and Foundries: Where is their capacity?
– Sweet spot is between 45nm and 60nm
– Intel/Samsung vs everybody else
– Qualcomm driving the bleeding edge of 3D packaging technology

TSMC Foundry Drivers
– 48% communications
– 23% consumer
– 18% industrial
– 11% industrial

PCs – shipped 379M units in 2010

iPad (Tablets) – 136m units in 1Q2011

2015 Time frame – next big push from emerging nations to start consuming personal electronics

What is true 3D/TSV?
– Today we see 2.5 and 3D TMV (stacked die for memory, SOC, SiP, Flip Chip, PoP, Cu pillar bump, many via types

AMKOR photos of molded via, package on package techniques

EDA – Sigrity (Orbit10 Planner), Micro Magic (Max 3D), Apache (RedHawk, Sentinel)

What to look for – OSATs, eSilicon, Tessera, Tezzaron

RF challenges

3D Summary

Read the GSA Guide (Herb Reiter)

Follow the money – OSATs spend money on copper wire bonders, new test equipment

Watch those hiring – Intel, Qualcomm

Gary Smith

EDA 2Q 2011 Forecaast
$4896 2011
$5325 2012
$5671 2013
$5885 2014
$6612 2015

The Semiconductor Infrastructure
– The semi cycle: VC funding, EDA, embedded sw, compute power, process r&d, semi equipment, foundry, packaging
EDA – tools to enable design (at a cost that allows operating at a profit)

Costs – Design companies cost more than EDA tools (embarrassed by low price of EDA tools)

Cost of SOC Design – Keep the cost of a new SOC below $25M will attract VCs again.
– Costs above $50M hurts even IDMs

ITRS Cost Chart for 2010

Is EDA affordable?

Design Teams – 100 to 200 engineers
– For 104 million gates (should be about 30 HW design engineers, costing $18.7m)
– 160 SW engineers cost $56M, total of $75M

Cost Curves for HW Design Teams (Needs to be kept under $25M)

Ideal Number of Blocks: 5
– Most designs have 25 to 35 blocks (more blocks slows design process down)

What is Block Size?
– 100 million gate (90 million gates on platform, 10 million left to design)
– Number of engineers per block 6 engineers

EDA Tools Handle gates?
– 4 million minimum, 20 million needed

At 22nm – need 88m gates capacity
At 16nm – need 177m gates
At 11nm – need 354m gates

R&D Engineers – do you know how design uses the EDA tools?

How to Follow Moores Law?
– Productivity tools, keep close to design engineers
– Get involved in sw productivity

EDA History
– Cadence as technology leader 1990 to 1997
– 2004 to 2007 Fister’s Folly as
– 2003 SNPS is #1 technology after Avant! Acquisition
– 2008 and 2009 SNPS as #1 tech leader
o Mentor and Magma still close in technology
o Who handles 22nm design? Large or small companies?

Summary
The mood at DAC this year is optimistic, there’s talk of hiring and people are eager for the Keynote address on Monday and the flurry of events planned. I’ll be blogging each day as I visit EDA vendors that offer transistor-level tools like: SPICE, DRC, LVS, 3d extraction, Custom IC layout tools, and more. On Tuesday I’ll be at the Panel Session on “3D Extraction: Coming to a Design Near You?