Downside is that it won’t work if my internet or power is out
If it doesn’t work when your internet is out, then it’s not local.
Downside is that it won’t work if my internet or power is out
If it doesn’t work when your internet is out, then it’s not local.
Did you really just try to excuse and downplay a company claiming full ownership and rights over all user’s data?
Most of that is in the kernel anyways.
Ya, I was a fairly big Nvidia defender for the past few years. I used a ton of their stuff for my last job, and genuinely didn’t have any issues with them (all Linux systems, gaming and AI workloads).
But their recent actions have really soured my view of them.
It is true. Most all tech news outlets covered it.
Well, right after that Nvidia amended their license agreements stating that you cannot use CUDA with any translation layers.
The project you’re thinking of is ZLUDA.
Are all top 500 super computers, all the computers at CERN, the servers at Google, Facebook, Amazon, Azure, Netflix, eBay, etc, just a lab?
India, the country with the largest population, has a 15% desktop Linux marketshare.
Additionally, these surveys are highly inaccurate. They are at best a “conservatively low balled figure”. Linux installations don’t send a ping to a server anywhere to count the install, and there’s no other facility to gauge or count through the Linux ecosystem itself. Most computers used for Linux are also sold with Windows pre-installed, which means there’s no clean way to use sales figures either.
All that leaves is the browser user agent when visiting select websites that track and share the number of unique visitors that identify as Linux.
I did the math a few months ago in a different discussion (not on Lemmy) and my math at the time came up to about 50 million desktop Linux users, and that was using the “official” reported numbers of 3.x% at the time.
That also ignores that the Stack Overflow developer survey puts desktop Linux at over 50% for personal use, and (IIRC) about 47% for professional use.
But let’s be honest
You can’t be honest if you look at a single boiled down percentage of a very large, very diverse and technical landscape with more variations and caveats than the English language.
Also, in case anyone is wondering, the Stack Overflow numbers didn’t include WSL. If you do include that then desktop Linux usage was over 70% for personal use.
OK mman, dont pop a vein over this
That’s incredibly rude. At no point was I angry or enraged. What you’re trying to do is minimize my criticism of your last comment by intentionally making it seem like I was unreasonably angry.
I was going to continue with you in a friendly manner, but screw you. You’re an ass (and also entirely wrong).
A lot of what you said is true.
Since the TPU is a matrix processor instead of a general purpose processor, it removes the memory access problem that slows down GPUs and CPUs and requires them to use more processing power.
Just no. Flat out no. Just so much wrong. How does the TPU process data? How does the data get there? It needs to be shuttled back and forth over the bus. Doing this for a 1080p image with of data several times a second is fine. An uncompressed 1080p image is about 8MB. Entirely manageable.
Edit: it’s not even 1080p, because the image would get resized to the input size. So again, 300x300x3 for the past model I could find.
/Edit
Look at this repo. You need to convert the models using the TFLite framework (Tensorflow Lite) which is designed for resource constrained edge devices. The max resolution for input size is 224x224x3. I would imagine it can’t handle anything larger.
https://github.com/jveitchmichaelis/edgetpu-yolo/tree/main/data
Now look at the official model zoo on the Google Coral website.
Not a single model is larger than 40MB. Whereas LLMs start at well over a big for even smaller (and inaccurate) models. The good ones start at about 4GB and I frequently run models at about 20GB. The size in parameters really makes a huge difference.
You likely/technically could run an LLM on a Coral, but you’re going to wait on the order of double-digit minutes for a basic response, of not way longer.
It’s just not going to happen.
when comparing apples to apples.
But this isn’t really easy to do, and impossible in some cases.
Historically, Nvidia has done better than AMD in gaming performance because there’s just so much game specific optimizations in the Nvidia drivers, whereas AMD didn’t.
On the other hand, AMD historically had better raw performance in scientific calculation tasks (pre-deeplearning trend).
Nvidia has had a stranglehold on the AI market entirely because of their CUDA dominance. But hopefully AMD has finally bucked that tend with their new ROCm release that is a drop-in replacement for CUDA (meaning you can just run CUDA compiled applications on AMD with no changes).
Also, AMD’s new MI300X AI processor is (supposedly) wiping the floor with Nvidia’s H100 cards. I say “supposedly” because I don’t have $50k USD to buy both cards and compare myself.
Ya, that just solidifies that you don’t know how to use the word.
How does using a certain operating system equate to “someone who annoys others by correcting small errors”?
I’m not sure you know how to use that word.
And you can add as many TPUs as you want to push it to whatever level you want
No you can’t. You’re going to be limited by the number of PCI lanes. But putting that aside, those Coral TPUs don’t have any memory. Which means for each operation you need to shuffle the relevant data over the bus to the device for processing, and then back and forth again. You’re going to be doing this thousands of times per second (likely much more) and I can tell you from personal experience that running AI like is painfully slow (if you can get it to even work that way in the first place).
You’re talking about the equivalent of buying hundreds of dollars of groceries, and then getting everything home 10km away by walking with whatever you can put in your pockets, and then doing multiple trips.
What you’re suggesting can’t work.
ATI cards (while pretty good) are always a step behind Nvidia.
Ok, you mean AMD. They bought ATI like 20 years ago now and that branding is long dead.
And AMD cards are hardly “a step behind” Nvidia. This is only true if you buy the 24GB top card of the series. Otherwise you’ll get comparable performance from AMD at a better value.
Plus, most distros have them working out of the box.
Unless you’re running a kernel <6.x then every distro will support AMD cards. And even then, you could always install the proprietary blobs from AMD and get full support on any distro. The kernel version only matters if you want to use the FOSS kernel drivers for the cards.
Two* GPUs? Is that a thing? How does that work on a desktop?
I’ve been using two GPUs in a desktop since 15 years ago. One AMD and one Nvidia (although not lately).
It really works just the same as a single GPU. The system doesn’t really care how many you have plugged in.
The only difference you have to care about is specifying which GPU you want a program to use.
For example, if you had multiple Nvidia GPUs you could specify which one to use from the command line with:
CUDA_VISIBLE_DEVICES=0
or the first two with:
CUDA_VISIBLE_DEVICES=0,1
Anyways, you get the idea. It’s a thing that people do and it’s fairly simple.
getting a few CUDA TPUs
Those aren’t “CUDA” anything. CUDA is a parallel processing framework by Nvidia and for Nvidia’s cards.
Also, those devices are only good for inferencing smaller models for things like object detection. They aren’t good for developing AI models (in the sense of training). And they can’t run LLMs. Maybe you can run a smaller model under 4B, but those aren’t exactly great for accuracy.
At best you could hope for is to run a very small instruct model trained on very specific data (like robotic actions) that doesn’t need accuracy in the sense of “knowledge accuracy”.
And completely forgot any kind of generative image stuff.
Are CUDAs something that I can select within pcpartpicker?
I’m not sure what they were trying to say, but there’s no such thing as “getting a couple of CUDA’s”.
CUDA is a framework that runs on Nvidia hardware. It’s the hardware that will have “CUDA cores” which are large amounts of low power processing units. AMD calls them “stream processors”.
You could also completely forego the GPU and get a couple of CUDAs for a fraction of the cost.
What is this sentence? How do you “get a couple of CUDA’s”?
Arch is great when you’re somewhat experienced with Linux. Otherwise I recommend an Arch + QoL distro like EndeavourOS.
I’m a developer using Linux for well over a decade and a half and I use EndeavourOS because it just adds a level of ease.