Get rich quick

Now where is the shovel head maker, TSMC?

And then China popping their head out claiming Taiwan is part of China because they want to seize TSMC
- Once the US have in housed chip manufacturing they will let China too which is the worst part. Profits over everything amirite?
- has been part of china for 2000 years, anglo imperialism wont change that
Eh, they'll have plenty of demand for their nodes regardless. Non-AI CPUs and GPUs are still going to want them.

Nvidias being pretty smart here ngl

This is the ai gold rush and they sell the tools.

Yes that's the meme.

meanwhile i just want cheap gpus for my bideogames again

You can buy them new for somewhat reasonable prices. What people should really look at is used 1080ti's on ebay. They're going for less than $150 and still play plenty of games perfectly fine. It's the budget PC gaming deal of the century.
- it probably the best performance per dollar u can get but a lot of modern games are unplayable on it.
- not in my country lol. getting used cards were already the norm before, for a while you could literally only get used ones for a good price on aliexpress.
  
  and now our gvmnt imposed 100% tax on anything from china, so its really just not affordable.
Meanwhile I dont think I have played more than 30minutes on my ps5 this year and its june, and I have definitely not played any minutes on the 1080 sitting in my PC...

Oh fuck scratch that I may have played about 2 hours of Dune Spice Wars
- How are you finding dune? I watched a few let's plays of the demo and it looked interesting...
How about GeForce NOW?
- nah, its gonna become the next enshitified netflix. i stream games from my own pc.
  
  good thing about small screens is that you don't need the best resolution so it does better on older computers.
  
  not to mention the monthly cost instead of being a once every handful of years thing.
  
  not judging though streaming can be fine.

Edited the price to something more nvidiaish: 1000009536

Gotta add a few more 9s to that. This is enterprise cards we're talking about
Literally about to do same.

Jensen also is obsessed with how much stuff weighs. So maybe he'd sell shovels by the ton.
- Nobody expects the "4 elephants" GPU.
  
  https://www.youtube.com/watch?v=ugd61cUHbME

Don't forget AMD, good potential if they bring out similar technology to compete with NVIDIA. Less so Intel, but they're in the GPU market too.

Does ARM do anything special with AI? Or is that just the actual chip manufacturers designing that themselves?
- As I understand it, ARM chips are much more efficient on the same tasks, so they're cheaper to run.
- I think its largely the chip manufacturers, but ARM is still making money on licensing fees for Nvidia's new ai chip (with an integrated 72 core arm cpu) for example
  
  ARM is in the perfect place where, if a company using their architecture succeeds, they get tons of money, and if the company fails, they lose nothing.
- NPUs
Don't forget Qualcomm either.

They will eat massive shit when that AI bubble bursts.

I mean if LLM/Diffusion type AI is a dead-end and the extra investment happening now doesn't lead anywhere beyond that. Yes, likely the bubble will burst.

But, this kind of investment could create something else. We'll see. I'm 50/50 on the potential of it myself. I think it's more likely a lot of loud talking con artists will soak up all the investment and deliver nothing.
- bubbles have nothing to do with technology, the tech is just a tool to build the hype. The bubble will burst regardless of the success of the tech at most success will slightly delay the burst, because what is bursting isnt the tech its the financial structures around it.
- It's looking like a dead end. The content that can be fed into the big LLMs has already been done. New stuff is a combination of actual humans and stuff generated by LLMs. It then runs into an ouroboros problem where it just eats its own input.
No, they won't. They'll just jump on making shovels for the next bubble.
- Unfortunately you've completely switched your entire economy to shovels for this thing and also all your warehouses are full of shovels for this thing and most of your assets are tied up in shovels for this thing. And now suddenly, this thing stops.
I doubt it. Regardless of the current stage of machine learning, everyone is now tuned in and pushing the tech. Even if LLMs turn out to be mostly a dead end, everyone investing in ML means that the ability to do LOTS of floating point math very quickly without the heaviness of CPU operations isn’t going away any time soon. Which means nVidia is sitting pretty.
- the WWW wasn't a dead end but the bubble burst anyway. the same will happen to AI because exponential growth is impossible.
See Sun Microsystems after the .com bubble burst. They produced a lot of the servers that .com companies were using at the time. Shriveled up after and were eventually absorbed by Oracle.

Why did Oracle survive the same time? Because they latched onto a traditional Fortune 500 market and never let go down to this day.
It means having a shot at getting a good gaming gpu for cheap
- As far as I understand the tech, those things aren't really interchangeable :(
No they won't, this tech isn't going to go away Even if it plateaus. All the gpus they make will still get used.
- The internet didn't go away but there was still a .com bubble
- As far as I understand, the GPUs that LLMs use aren't exactly interchangeable with your regular GPU. Also, no one needs that many GPUs for any traditional use cases.
one can only hope

Worst one is probably Apple. They just announced "Apple Intelligence" which is just ChatGTP whose largest shareholder is Microsoft. Figure that one out.

Well, most of the requests are handled on device with their own models. If it’s going to ChatGPT for something it will ask for permission and then use ChatGPT.

So the Apple Intelligence isn’t all ChatGPT. I think this deserves to be mentioned as a lot of the processing will be on device.

Also, I believe part of the deal is ChatGPT can save nothing and Apple are anonymising the requests too.
- chatgpt won't save anything? Doubtful.
- Well, most of the requests are handled on device
  
  Doubt.
  
  Voice recognition, image recognition, yes. But actual questions will go to Apple servers.
If you think that’s the WORST ONE, you have no idea about any of this
- Yeah, if anything, Apple is behind the curve. Nvidia/AMD/Intel have gone full cocaine nose dive into AI already.
Not true. Most if not all requests are handled by apples own models on device or on their own servers. When it does use OpenAI you need to give it permission each time it does.
That's just not true. Most requests are handled on-device. If the system decides a request should go to ChatGPT, the user is promped to agree and no data is stored on OpenAI's servers. Plus, all of this is opt-in.
- Most requests are handled on-device.
  
  Literally impossible.
  
  "Hey Siri, what's the weather forecast for tomorrow."
  
  < The Farmer's Almanac that is in my local model says it will rain tomorrow. >

Admittedly, I bought an Nvidia card for AI. I am part of the problem.

I don't think it's a problem, more like a situation. You are not doing anything wrong or stupid, just interested in something new and promising and have the resources to pursue it. Good for you, may you find gold.

Serious Question:

Why is Nvidia AI king and I see nothing of AMD for AI?

I'm an AI Developer.

TLDR: CUDA.

Getting ROCM to work properly is like herding cats.

You need a custom implementation for the specific operating system, the driver version must be locked and compatible, especially with a Workstation / WRX card, the Pro drivers are especially prone to breaking, you need the specific dependencies to be compiled for your variant of HIPBlas, or zLUDA, if that doesn't work, you need ONNX transition graphs, but then find out PyTorch doesn't support ONNX unless it's 1.2.0 which breaks another dependency of X-Transformers, which then breaks because the version of HIPBlas is incompatible with that older version of Python and ..

Inhales

And THEN MAYBE it'll work at 85% of the speed of CUDA. If it doesn't crash first due to an arbitrary error such as CUDA_UNIMPEMENTED_FUNCTION_HALF

You get the picture. On Nvidia, it's click, open, CUDA working? Yes?, done. You don't spend 120 hours fucking around and recompiling for your specific usecase.
- Also, you need a supported card. I have a potato going by the name RX 5500, not on the supported list. I have the choice between three rocm versions:
  
  An age-old prebuilt, generally works, occasionally crashes the graphics driver, unrecoverably so... Linux tries to re-initialise everything but that fails, it needs a proper reset. I do need to tell it to pretend I have a different card.
  
  A custom-built one, which I fished out of a docker image I found on the net because I can't be arsed to build that behemoth. It's dog-slow, due to using all generic code and no specialised kernels.
  
  A newer prebuilt, any. Works fine for some, or should I say, very few workloads (mostly just BLAS stuff), otherwise it simply hangs. Presumably because they updated the kernels and now they're using instructions that my card doesn't have.
  
  #1 is what I'm actually using. I can deal with a random crash every other day to every other week or so.
  
  It really would not take much work for them to have a fourth version: One that's not "supported-supported" but "we're making sure this things runs": Current rocm code, use kernels you write for other cards if they happen to work, generic code otherwise.
  
  Seriously, rocm is making me consider Intel cards. Price/performance is decent, plenty of VRAM (at least for its class), and apparently their API support is actually great. I don't need cuda or rocm after all what I need is pytorch.
Simple Answer:

Cuda
I think it's in the pipeline. AMD has bought Xilinx, which builds FPGAs and already had some AI specific cores in their processors. I believe they're developing that further and integrating it in their GPUs now.

Well Nvidia sure did get rich quick

Nvidia and IA technology are both legit. Those companies need nvidia GPU for their development.