Компания XM не предоставляет услуги резидентам Соединенных штатов Америки.

AI's next feat will be its descent from the cloud



<html xmlns="http://www.w3.org/1999/xhtml"><head><title>BREAKINGVIEWS-AI's next feat will be its descent from the cloud</title></head><body>

The author is a Reuters Breakingviews columnist. The opinions expressed are her own.

By Robyn Mak

HONG KONG, Oct 2 (Reuters Breakingviews) -It's been two years since ChatGPT made its public debut, kicking off a rush to invest in generative artificial intelligence. The frenzy has lifted valuations for startups like OpenAI, inventor of the chatbot, as well as technology titans whose cloud computing platforms train and host the models that enable these services. The current boom is already showing signs of strain. AI's next phase of growth may be in the palm of your hand.

So-called generative AI, where a model creates new content based on the data it’s trained on, today largely exists in the cloud. OpenAI, for example, uses Microsoft's MSFT.O Azure platform to train and run its large language models (LLMs). Anyone with an internet connection can make a query on ChatGPT using Azure's data centres around the world. But as models get larger and more complex, so does the infrastructure to train them and handle queries from users.

The result is a scramble to build bigger and more powerful data centres. OpenAI and Microsoft, for example, are in talks for a data centre project set to launch in 2028 that's projected to cost a whopping $100 billion, according to The Information.

All in all, Google owner Alphabet GOOGL.O, Microsoft and Meta Platforms META.O, which owns Instagram and Facebook, are forecast to spend a combined $160 billion in capital expenditures next year, per LSEG data, three-quarters more than in 2022. Most of that will go toward purchasing Nvidia's NVDA.O coveted $25,000 graphic processor units (GPU) and other related infrastructure to train models. The $3 trillion company's CEO Jensen Huang predicts investment in data centres will double to $2 trillion over the next four or five years.

These sums raise awkward questions about how sustainable this level of spending is, and whether chatbots and other applications can bring in enough revenue to generate a positive return on such staggering investments. Companies are also grappling with the challenge of finding land to house new data centres and the securing sufficient electricity supplies to power and cool the chips. Big Tech's dominance of LLMs and cloud computing is also attracting regulatory scrutiny. Last year, Microsoft, Amazon AMZN.O and Google accounted for 58% of global AI server procurement, Morgan Stanley analysts reckon.

These factors explain the latest tech buzzword: “edge AI”. This phrase refers to algorithms and models that run on smartphones or personal computers at the edge of a network rather than a centralised server farm. This approach has several advantages over cloud-based AI. Users will get responses on their devices in real time, without the need for a high-speed internet connection. Their personal data would also stay on the device, rather than being transmitted to a server owned by a third party. And given the ubiquity of handsets and PCs, adoption could be rapid. Analysts at UBS reckon nearly 50% of smartphones, roughly 583 million units, will have generative AI capabilities by 2027, up from just 4% in 2023.

The biggest hurdle is technological: today's devices do not have the computing power, energy and memory bandwidth to run a large model such as OpenAI's GPT-4, which contains an estimated 1.8 trillion parameters. Even Facebook's relatively smaller LLAMA models, with 7 billion parameters, would require an additional 14 gigabytes of temporary storage to work on a phone. Apple’s latest iPhone 16 only comes with 8GB of such random access memory (RAM).

Even so, there are reasons to be optimistic. Companies and developers are increasingly turning to smaller models which are customised for specific tasks. They require less data and effort to train - Google's self-described "lightweight" Gemma architecture contains as little as 2 billion parameters - and are typically open-source and free to use. And because of their highly-specialised nature, smaller models often outperform their larger and more generalised counterparts, with fewer errors.

Besides, most contemporary day-to-day use cases for AI, such as photo-editing tools and personal assistants, probably won't require large models. Some smartphones already boast live translation and real-time transcription functions. And it makes sense for cloud providers to shift basic AI functions to the edge, freeing up powerful data centres for more complex tasks.

At the same time, makers of semiconductors and other components are cramming more processing power and memory into a phone or PC. Research firm Yole Group forecasts the proportion of smartphones that can support an LLM with 7 billion parameters will grow to 11% this year, up from 8% last year. Leading chipmakers such as Taiwan's TSMC 2330.TW and South Korea's Samsung Electronics 005930.KS and SK Hynix 000660.KS are pioneering new methods such as advanced packaging in semiconductors, whereby they stack multiple chips into one "chiplet". That allows them to build even more powerful processors without having to shrink chip circuitry in order to squeeze in more transistors. One former TSMC executive predicted that within a decade, this technology could lead to a "multichiplet" containing more than 1 trillion transistors.

For investors, edge AI has the potential to mint more winners. So far, shareholders have assumed that most of the gains from AI will accrue to the biggest tech firms with the deepest pockets, as well as Nvidia and a handful of startups. Yet AI tools could prompt consumers to upgrade to newer and more sophisticated smartphones and personal computers. UBS analysts forecast combined sales in the two markets will surpass $700 billion by 2027, up 14% from this year. Brands from Apple to Lenovo 0992.HK – as well as their suppliers - all stand to benefit.

In semiconductors, Nvidia's advanced GPUs will still dominate. But other chip firms like Qualcomm QCOM.O and MediaTek 2454.TW should also gain. The Taiwanese group is set to unveil its latest chipset that can support large models next month; executives expect revenue from its flagship mobile products can grow 50% this year.

As with the cloud-based variety, the success of edge AI will depend on coming up with compelling applications which users think are worth paying for. If that happens, the next big thing in AI will be found in smaller models and smaller devices.

Follow @mak_robyn on X


Graphic: AI is turbo-charging Big Tech's capital expenditures https://reut.rs/4enVndf

Graphic: Edge AI will drive smartphone and PC sales https://reut.rs/3BsxOBp


Editing by Peter Thal Larsen and Aditya Srivastav

</body></html>

Похожие активы


Последние новости

Dollar firm as war widens in Middle East

A
C
E
E
G
N
U
W

Top Economic Events to November 28

C
L
S
S

Valero Benicia, California Refinery Reports Flaring Due to System Maintenance


Heavy oil discount tightens as new trade cycle begins


South Korea inflation cools more than expected as rate cut talk grows

Правовая оговорка: Компании группы XM Group предоставляют только услуги по исполнению сделок и доступ к нашей торговой онлайн-среде, в которой пользователи могут просматривать и (или) пользоваться материалами, доступными на вебсайте либо доступными по ссылкам с данного сайта на другие. Предоставление доступа к онлайн-среде не меняет сути предоставляемых услуг и не расширяет их. Такой доступ и пользование материалами предоставляются с учетом:(i) «Условий и положений»; (ii) «Предупреждений о рисках» и (iii) полного текста «Правовой оговорки». Следовательно, подобные материалы предоставляются лишь в качестве информации общего характера. В частности, просим Вас иметь в виду, что материалы, содержащиеся в нашей торговой онлайн-среде, не являются ни просьбой осуществить какие-либо транзакции на финансовых рынках, ни предложением к осуществлению подобных транзакций. Торговля на любом финансовом рынке подразумевает большой риск потери Вашего капитала.

Все материалы, опубликованные в нашей торговой среде, предоставляются только в образовательных или информационных целях и не содержат (и не должны рассматриваться как содержащие) финансовых, инвестиционных или торговых рекомендаций, а также информации о стоимости наших услуг по предоставлению доступа к рынкам, либо предложения или содействия в проведении транзакций по какому-либо финансовому инструменту или по незапрашиваемым финансовым услугам по отношению к Вам.

Любые материалы на данном вебсайте, созданные третьими лицами, а также материалы, подготовленные XM, такие как мнения экспертов, новости, исследования, анализ, котировки и другая информация, а также ссылки на сторонние сайты предоставляются в виде «как есть», как рыночная информация общего характера, и не являют собой рекомендации по инвестициям. Принимая во внимание то, что любые материалы рассматриваются как инвестиционное исследование, Вам следует учесть и принять тот факт, что никакие материалы не подготавливались и не предназначались к использованию в соответствии с правовыми нормами, способствующими независимости инвестиционных исследований. Следовательно, материалы следует рассматривать как материалы рекламного характера согласно соответствующим законам и правовым нормам. Рекомендуем Вам прочесть и уяснить для себя положения наших «Уведомления о субъективном инвестиционном исследовании» и «Предупреждения о рисках» в отношении приведённой выше информации. С этими документами можно ознакомиться здесь.

Предупреждение о риске. Вы рискуете потерять свой капитал. Торговля маржинальными продуктами подходит не всем инвесторам. Пожалуйста, ознакомьтесь с нашим Предупреждением о рисках.