XM은(는) 미국 국적의 시민에게 서비스를 제공하지 않습니다.

AI's next feat will be its descent from the cloud



<html xmlns="http://www.w3.org/1999/xhtml"><head><title>RPT-BREAKINGVIEWS-AI's next feat will be its descent from the cloud</title></head><body>

The author is a Reuters Breakingviews columnist. The opinions expressed are her own.

By Robyn Mak

HONG KONG, Oct 2 (Reuters Breakingviews) -It's been two years since ChatGPT made its public debut, kicking off a rush to invest in generative artificial intelligence. The frenzy has lifted valuations for startups like OpenAI, inventor of the chatbot, as well as technology titans whose cloud computing platforms train and host the models that enable these services. The current boom is already showing signs of strain. AI's next phase of growth may be in the palm of your hand.

So-called generative AI, where a model creates new content based on the data it’s trained on, today largely exists in the cloud. OpenAI, for example, uses Microsoft's MSFT.O Azure platform to train and run its large language models (LLMs). Anyone with an internet connection can make a query on ChatGPT using Azure's data centres around the world. But as models get larger and more complex, so does the infrastructure to train them and handle queries from users.

The result is a scramble to build bigger and more powerful data centres. OpenAI and Microsoft, for example, are in talks for a data centre project set to launch in 2028 that's projected to cost a whopping $100 billion, according to The Information.

All in all, Google owner Alphabet GOOGL.O, Microsoft and Meta Platforms META.O, which owns Instagram and Facebook, are forecast to spend a combined $160 billion in capital expenditures next year, per LSEG data, three-quarters more than in 2022. Most of that will go toward purchasing Nvidia's NVDA.O coveted $25,000 graphic processor units (GPU) and other related infrastructure to train models. The $3 trillion company's CEO Jensen Huang predicts investment in data centres will double to $2 trillion over the next four or five years.

These sums raise awkward questions about how sustainable this level of spending is, and whether chatbots and other applications can bring in enough revenue to generate a positive return on such staggering investments. Companies are also grappling with the challenge of finding land to house new data centres and the securing sufficient electricity supplies to power and cool the chips. Big Tech's dominance of LLMs and cloud computing is also attracting regulatory scrutiny. Last year, Microsoft, Amazon AMZN.O and Google accounted for 58% of global AI server procurement, Morgan Stanley analysts reckon.

These factors explain the latest tech buzzword: “edge AI”. This phrase refers to algorithms and models that run on smartphones or personal computers at the edge of a network rather than a centralised server farm. This approach has several advantages over cloud-based AI. Users will get responses on their devices in real time, without the need for a high-speed internet connection. Their personal data would also stay on the device, rather than being transmitted to a server owned by a third party. And given the ubiquity of handsets and PCs, adoption could be rapid. Analysts at UBS reckon nearly 50% of smartphones, roughly 583 million units, will have generative AI capabilities by 2027, up from just 4% in 2023.

The biggest hurdle is technological: today's devices do not have the computing power, energy and memory bandwidth to run a large model such as OpenAI's GPT-4, which contains an estimated 1.8 trillion parameters. Even Facebook's relatively smaller LLAMA models, with 7 billion parameters, would require an additional 14 gigabytes of temporary storage to work on a phone. Apple’s latest iPhone 16 only comes with 8GB of such random access memory (RAM).

Even so, there are reasons to be optimistic. Companies and developers are increasingly turning to smaller models which are customised for specific tasks. They require less data and effort to train - Google's self-described "lightweight" Gemma architecture contains as little as 2 billion parameters - and are typically open-source and free to use. And because of their highly-specialised nature, smaller models often outperform their larger and more generalised counterparts, with fewer errors.

Besides, most contemporary day-to-day use cases for AI, such as photo-editing tools and personal assistants, probably won't require large models. Some smartphones already boast live translation and real-time transcription functions. And it makes sense for cloud providers to shift basic AI functions to the edge, freeing up powerful data centres for more complex tasks.

At the same time, makers of semiconductors and other components are cramming more processing power and memory into a phone or PC. Research firm Yole Group forecasts the proportion of smartphones that can support an LLM with 7 billion parameters will grow to 11% this year, up from 8% last year. Leading chipmakers such as Taiwan's TSMC 2330.TW and South Korea's Samsung Electronics 005930.KS and SK Hynix 000660.KS are pioneering new methods such as advanced packaging in semiconductors, whereby they stack multiple chips into one "chiplet". That allows them to build even more powerful processors without having to shrink chip circuitry in order to squeeze in more transistors. One former TSMC executive predicted that within a decade, this technology could lead to a "multichiplet" containing more than 1 trillion transistors.

For investors, edge AI has the potential to mint more winners. So far, shareholders have assumed that most of the gains from AI will accrue to the biggest tech firms with the deepest pockets, as well as Nvidia and a handful of startups. Yet AI tools could prompt consumers to upgrade to newer and more sophisticated smartphones and personal computers. UBS analysts forecast combined sales in the two markets will surpass $700 billion by 2027, up 14% from this year. Brands from Apple to Lenovo 0992.HK – as well as their suppliers - all stand to benefit.

In semiconductors, Nvidia's advanced GPUs will still dominate. But other chip firms like Qualcomm QCOM.O and MediaTek 2454.TW should also gain. The Taiwanese group is set to unveil its latest chipset that can support large models next month; executives expect revenue from its flagship mobile products can grow 50% this year.

As with the cloud-based variety, the success of edge AI will depend on coming up with compelling applications which users think are worth paying for. If that happens, the next big thing in AI will be found in smaller models and smaller devices.

Follow @mak_robyn on X


Graphic: AI is turbo-charging Big Tech's capital expenditures https://reut.rs/4enVndf

Graphic: Edge AI will drive smartphone and PC sales https://reut.rs/3BsxOBp


Editing by Peter Thal Larsen and Aditya Srivastav

</body></html>

면책조항: XM Group 회사는 체결 전용 서비스와 온라인 거래 플랫폼에 대한 접근을 제공하여, 개인이 웹사이트에서 또는 웹사이트를 통해 이용 가능한 콘텐츠를 보거나 사용할 수 있도록 허용합니다. 이에 대해 변경하거나 확장할 의도는 없습니다. 이러한 접근 및 사용에는 다음 사항이 항상 적용됩니다: (i) 이용 약관, (ii) 위험 경고, (iii) 완전 면책조항. 따라서, 이러한 콘텐츠는 일반적인 정보에 불과합니다. 특히, 온라인 거래 플랫폼의 콘텐츠는 금융 시장에서의 거래에 대한 권유나 제안이 아닙니다. 금융 시장에서의 거래는 자본에 상당한 위험을 수반합니다.

온라인 거래 플랫폼에 공개된 모든 자료는 교육/정보 목적으로만 제공되며, 금융, 투자세 또는 거래 조언 및 권고, 거래 가격 기록, 금융 상품 또는 원치 않는 금융 프로모션의 거래 제안 또는 권유를 포함하지 않으며, 포함해서도 안됩니다.

이 웹사이트에 포함된 모든 의견, 뉴스, 리서치, 분석, 가격, 기타 정보 또는 제3자 사이트에 대한 링크와 같이 XM이 준비하는 콘텐츠 뿐만 아니라, 제3자 콘텐츠는 일반 시장 논평으로서 "현재" 기준으로 제공되며, 투자 조언으로 여겨지지 않습니다. 모든 콘텐츠가 투자 리서치로 해석되는 경우, 투자 리서치의 독립성을 촉진하기 위해 고안된 법적 요건에 따라 콘텐츠가 의도되지 않았으며, 준비되지 않았다는 점을 인지하고 동의해야 합니다. 따라서, 관련 법률 및 규정에 따른 마케팅 커뮤니케이션이라고 간주됩니다. 여기에서 접근할 수 있는 앞서 언급한 정보에 대한 비독립 투자 리서치 및 위험 경고 알림을 읽고, 이해하시기 바랍니다.

리스크 경고: 고객님의 자본이 위험에 노출 될 수 있습니다. 레버리지 상품은 모든 분들에게 적합하지 않을수 있습니다. 당사의 리스크 공시를 참고하시기 바랍니다.