Nestlé USA Names Martin Thompson as Chief Executive Officer and U.S. Market Head

theunknownmuncher@lemmy.world · 3 days ago

Desktop computer mainly, sometimes a laptop. Tablets are painful to use IMO

theunknownmuncher@lemmy.world · 9 days ago

Great news!

theunknownmuncher@lemmy.world · 10 days ago

You gun down 1 (one) CEO in the street, and Musk starts carrying a 4 year old at chest level everywhere he goes

theunknownmuncher@lemmy.world · 16 days ago

You can overwrite the model by using the same name instead of creating one with a new name if it bothers you. Either way there is no duplication of the llm model file

theunknownmuncher@lemmy.world · 17 days ago

What I am talking about is when layers are split across GPUs. I guess this is loading the full model into each GPU to parallelize layers and do batching

theunknownmuncher@lemmy.world · 17 days ago

Agreed.

theunknownmuncher@lemmy.world · 17 days ago

Because they were banned.

I didn’t make this claim, no

theunknownmuncher@lemmy.world · 17 days ago

Because they would just be inactive rather than banned? You’re the one claiming that bans have occurred, which would be a major censorship issue for lemmy…

theunknownmuncher@lemmy.world · edit-2 17 days ago

Can you try setting the num_ctx and num_predict using a Modelfile with ollama? https://github.com/ollama/ollama/blob/main/docs/modelfile.md#parameter

theunknownmuncher@lemmy.world · edit-2 17 days ago

Are you using a tiny model (1.5B-7B parameters)? ollama pulls 4bit quant by default. It looks like vllm does not used quantized models by default so this is likely the difference. Tiny models are impacted more by quantization

I have no problems with changing num_ctx or num_predict

theunknownmuncher@lemmy.world · 17 days ago

Um… modlog is public. Where’s your evidence?

theunknownmuncher@lemmy.world · 17 days ago

Models are computed sequentially (the output of each layer is the input into the next layer in the sequence) so more GPUs do not offer any kind of performance benefit

theunknownmuncher@lemmy.world · edit-2 17 days ago

Ummm… did you try /set parameter num_ctx # and /set parameter num_predict #? Are you using a model that actually supports the context length that you desire…?

theunknownmuncher@lemmy.world · 18 days ago

Notice how all those bot accounts that were so active leading up to the election have completely vanished from the internet now? Yeah.

theunknownmuncher@lemmy.world · edit-2 19 days ago

It’s not. I can run the 2.51bit quant

theunknownmuncher@lemmy.world · 19 days ago

Tell that to my home rig currently running the 671b model…

theunknownmuncher@lemmy.world · edit-2 19 days ago

Hawley’s statement called DeepSeek “a data-harvesting, low-cost AI model that sparked international concern and sent American technology stocks plummeting.”

data-harvesting

???

It runs offline… using open-source software that provably does not collect or transmit any data…

It is low-cost and out-competes American technology, though, true

theunknownmuncher@lemmy.world · 19 days ago

this is deepseek-v3. deepseek-r1 is the model that got all the media hype: https://huggingface.co/deepseek-ai/DeepSeek-R1

theunknownmuncher@lemmy.world · 20 days ago

https://archive.org/details/www.cdc.gov_en_all_novid_2025-01 there’s a zim

theunknownmuncher@lemmy.world · 20 days ago

Trump certainly sounded like he wanted it to be that way with Ivanka

theunknownmuncher@lemmy.world · 3 months ago

Nestlé USA Names Martin Thompson as Chief Executive Officer and U.S. Market Head

theunknownmuncher@lemmy.world · edit-2 4 months ago

Nestlé USA Names Martin Thompson as Chief Executive Officer and U.S. Market Head

Nestlé USA Names Martin Thompson as Chief Executive Officer and U.S. Market Head

That was easy

That was easy