Semiconductor Stocks and the AI Wave: Separating Signal from Hype

NVIDIA's run has made everyone a semiconductor analyst, which means the easy money has probably already been made. The stock went from interesting to consensus, and consensus trades are rarely where you generate alpha. The more interesting questions right now are about what comes next: custom silicon from hyperscalers, the inference versus training shift, and the memory bottleneck that most investors aren't paying enough attention to.

Semiconductors are the most important industrial sector of the next decade. Most investors don't understand them well enough to size positions confidently. That's worth fixing. Let's walk through the full picture.

The Semiconductor Value Chain: Four Distinct Businesses

"Chipmaker" describes four structurally different businesses, each with different margins, capital intensity, and competitive dynamics. Treating them as interchangeable is how you end up with a confused portfolio.

Fabless design companies (NVIDIA, AMD, Qualcomm, Broadcom) design chips and outsource manufacturing. Asset-light, high gross margins (50-70%), but entirely dependent on foundries for production capacity. The intellectual property is in the architecture. The competitive moat is in design talent and software ecosystem.
Integrated Device Manufacturers (IDMs) (Intel, Samsung, Texas Instruments) both design and manufacture chips. High capital intensity, cyclically variable margins, but control over manufacturing timelines and yields. Intel's struggles to maintain process leadership while carrying $50B+ in annual capex are the defining case study of the IDM model's challenges.
Pure-play foundries (TSMC, GlobalFoundries, SMIC) manufacture chips for fabless customers. Enormous capital intensity ($30-40B annual capex for TSMC), but relatively stable demand over cycles because they serve all fabless customers. TSMC earns 50%+ gross margins at leading-edge nodes where it has no peer.
EDA, IP, and materials companies (Synopsys, Cadence, ASML, Applied Materials) provide the design tools, photolithography equipment, and process materials that the entire supply chain runs on. ASML's EUV monopoly is arguably the highest-quality business model in the sector: one supplier for the equipment that enables every advanced chip in the world.

NVIDIA's Moat: What It Is and What Threatens It

NVIDIA's competitive position is frequently described as "GPU dominance," but that understates the actual source of the moat, which is CUDA. The CUDA programming model and its associated libraries (cuDNN, NCCL, Thrust), framework integrations (PyTorch, TensorFlow), and developer tooling have been accumulating since 2007. Every major AI research lab, every hyperscaler ML platform, and virtually every production AI system has been built on CUDA primitives. The switching cost isn't the hardware. It's the accumulated codebase, the retraining of engineers, and the loss of CUDA-specific optimizations.

The realistic threats to NVIDIA's position are not from AMD ROCm or Intel oneAPI in the near term. Despite genuine technical progress, software adoption lags hardware performance improvements by 3-5 years. The credible threat vectors are:

Custom silicon from hyperscalers: Google TPUv5, Amazon Trainium/Inferentia, Microsoft Maia, and Meta's MTIA are purpose-built accelerators that eliminate NVIDIA's economics for specific workloads. These chips won't be sold externally, but they reduce NVIDIA's total addressable market as hyperscalers move a fraction of their training and inference workloads off H100/H200 clusters.
Inference market economics: training is a small fraction of total GPU-hours. Inference at production scale is the much larger market. Inference workloads are more amenable to custom silicon because the model weights are fixed and the computation graph is predictable. NVIDIA's H100 is over-provisioned for many inference tasks, a gap that AMD, Groq, and Cerebras are targeting aggressively.
Export control escalation: the A800/H800 China market, already restricted, could be further tightened. China represented roughly 20-25% of NVIDIA's data center revenue before restrictions. Incremental tightening reduces the addressable market for a meaningful revenue segment.

NVIDIA's moat is real and durable, but investors pricing the stock at 30x forward earnings are pricing in years of dominance across both training and inference markets. The inference market will be more competitive than the training market, and inference is where AI compute demand will compound most aggressively over the next decade.

The TSMC Bottleneck and Geopolitical Risk

TSMC manufactures approximately 90% of the world's most advanced logic chips (below 5nm) and 60%+ of total advanced logic capacity. This concentration creates a systemic dependency that geopolitical risk has made impossible to ignore. The Taiwan Strait scenario, ranging from economic coercion to military blockade, is no longer a tail risk that sophisticated investors can dismiss. Defense and intelligence analysts place the probability of some form of Chinese action against Taiwan within 10 years in the 10-30% range. The economic consequences of a TSMC disruption would dwarf the COVID supply chain crisis.

TSMC's geographic diversification strategy (Arizona fabs for N4 and N2, Japan fab for N12, Germany fab for automotive-grade nodes) is moving faster than originally planned, driven by customer pressure and government incentives. But the Arizona fabs are operating at higher cost than Taiwan operations. TSMC management has acknowledged 20%+ manufacturing cost premiums for US production. The global chip supply is getting more geographically diverse. It's not getting cheaper.

Investors holding TSMC (TSM) are explicitly accepting geopolitical concentration risk in exchange for exposure to the best secular growth story in technology infrastructure. The correct framework is position sizing, not avoidance: TSM at 5% of a technology portfolio is different from TSM at 25%.

Memory Markets: HBM, DRAM, and NAND Cycles

Memory semiconductors (DRAM, NAND flash) are the most cyclical segment of the semiconductor market, and the AI wave has introduced a new memory tier, High Bandwidth Memory (HBM), that has dramatically changed the economics for Samsung, SK Hynix, and Micron. HBM is more constrained than the market appreciates, and that matters.

HBM is 3D-stacked DRAM that sits directly on the GPU package via through-silicon vias, providing 5-10x the memory bandwidth of conventional GDDR6 at significantly higher average selling price. The H100 GPU requires 80 GB of HBM3e. At current production volumes, SK Hynix supplies approximately 50% of all HBM capacity and earns roughly 3x the ASP of conventional DRAM for each GB shipped. This has structurally improved SK Hynix's revenue mix and gross margins, partially decoupling the company from the commodity DRAM cycle.

The NAND flash market remains deeply cyclical and oversupplied following an aggressive 2022 production ramp. Recovery timelines depend on AI data center storage attachment rates and consumer electronics demand recovery, both of which are improving but from a deeply depressed base. Micron's NAND segment remains a drag on overall margins even as its HBM business outperforms.

The AI Accelerator Market: Training vs Inference

The AI accelerator market isn't monolithic. Training and inference have fundamentally different compute requirements, which drives fundamentally different competitive dynamics. The distinction is critical for understanding where NVIDIA's position is strong versus where it's vulnerable.

Training requires massive parallelism, high-precision arithmetic (BF16/FP32), enormous memory bandwidth (for moving model parameters during backward passes), and fast inter-GPU interconnects for distributed training across thousands of GPUs. NVIDIA's NVLink and NVSwitch interconnect fabric is a genuine competitive advantage here. Competing solutions require rethinking cluster topology from the ground up.

Inference is a different workload: fixed weights, lower-precision arithmetic (INT8, FP8, INT4), latency sensitivity rather than throughput maximization, and often smaller model sizes for edge deployments. This workload profile is far more amenable to purpose-built silicon. Groq's LPU (Language Processing Unit) achieves inference throughput that exceeds H100 by 10x on pure token-generation benchmarks. Cerebras's wafer-scale engine eliminates memory bandwidth bottlenecks for large models. These companies aren't competing with NVIDIA for training clusters. They're building the inference infrastructure layer for production AI deployment. That's a large and growing market.

CHIPS Act and Domestic Fab Economics

The CHIPS and Science Act allocated $52.7B for semiconductor manufacturing and research in the US, with $39B in direct fabrication subsidies. Intel received $8.5B in grants and up to $11B in loans. TSMC's Arizona operations received $6.6B. Samsung's Texas fab received $6.4B. The policy rationale is strategic (reducing supply chain vulnerability), but the economics are challenging.

US chip manufacturing costs run 30-50% higher than Taiwan or Korea operations due to labor costs, permitting timelines, and less mature supplier ecosystems. Government subsidies offset these premiums, but only partially. The realistic outcome: the US develops leading-edge manufacturing capability for strategic supply assurance rather than cost-competitive commercial production. For investors, CHIPS Act beneficiaries gain protected cash flows for domestic government and defense contracts. Commercial margins remain under pressure.

Valuation Framework: Cycling Through the Cycle

Semiconductors are still cyclical. I want to be direct about this: the AI narrative has not changed the fundamental cyclicality of the industry. Demand cycles, supply ramps, overshoots, and corrections are baked into the structure of the business. Standard P/E ratios are nearly useless at cycle peaks and troughs. The professional approach is to value on normalized earnings, meaning what the business earns on average across a full semiconductor cycle, rather than current-period earnings.

For pure-play equipment companies (ASML, Applied Materials, Lam Research), EV/Sales on mid-cycle revenue is the most stable metric because capital equipment demand is a leading indicator. These companies typically trade at 6-12x mid-cycle sales. Below 6x has historically been a strong entry point.

For fabless AI chip companies in the current cycle, the relevant question is whether AI compute demand will sustain the current capex super-cycle long enough to justify current valuations. Microsoft, Google, Amazon, and Meta have collectively committed $300B+ in 2025 capex guidance, a significant portion of which goes to AI infrastructure. The capital is allocated. The risk is whether AI monetization justifies sustained infrastructure investment, or whether the capex cycle peaks in 2025-2026 as capacity catches up to near-term demand. For the engineering view of where that capex actually goes (training vs inference, networking, model serving), see our AI infrastructure roadmap.

The automotive and IoT semiconductor tailwinds are real and underappreciated relative to the AI narrative. A modern EV contains $600-900 of semiconductor content versus $400-500 for an internal combustion vehicle. ADAS systems, software-defined vehicle architectures, and charging infrastructure are secular demand drivers that are uncorrelated with AI training budgets. NXP Semiconductors, Infineon, and ON Semiconductor offer semiconductor exposure with more predictable demand curves and less AI-cycle concentration risk.