Oh and I typically get 16-20 tok/s running a 32b model on Ollama using Open WebUI. Also I have experienced issues with 4-bit quantization for the K/V cache on some models myself so just FYI
Oh and I typically get 16-20 tok/s running a 32b model on Ollama using Open WebUI. Also I have experienced issues with 4-bit quantization for the K/V cache on some models myself so just FYI
It really depends on how you quantize the model and the K/V cache as well. This is a useful calculator. https://smcleod.net/vram-estimator/ I can comfortably fit most 32b models quantized to 4-bit (usually KVM or IQ4XS) on my 3090’s 24 GB of VRAM with a reasonable context size. If you’re going to be needing a much larger context window to input large documents etc then you’d need to go smaller with the model size (14b, 27b etc) or get a multi GPU set up or something with unified memory and a lot of ram (like the Mac Minis others are mentioning).
This would be a great potential improvement in UX for streaming sports feeds for sure - not having to navigate web pages and start / manage streams manually etc. Does anyone know if this is possible for sites serving these streams like FirstRowSports or StreamEast etc?
Hopefully these improvements will become available to other Nvidia GPU architectures like Ada and Ampere in the future as well.
Yeah I use voyager pretty much exclusively on my iPhone so maybe I should request a feature like that there? Seems like it would be something that many people would appreciate. Not sure why I end up seeing posts with -10, -15 votes… Those are generally trash haha
True; I think I used LineageOS or similar back when I was still in Android but if you’re not in the 0.01% who do have a custom Android OS installed it seems like a privacy focused map app is still of limited use potentially.
This looks like it has come a long way, but since this is a privacy community I have to ask: Realistically, whether you are on iOS or Android, isn’t it likely Google or Apple are still tracking your location much of the time directly from the OS?
Based on the differences in color for each handle it makes me wonder if the one for not washing your hands is a different material. Maybe an antimicrobial metal like a copper alloy.
Definitely read this title and thought “Oh great some positive climate change news” assuming they meant ‘plays’ as in ‘potential courses of action’, but upon clicking the article I realized they meant ‘plays’ as in live theater productions…
As a ‘front page of the internet’ it has been a pretty great replacement for me as it’s where I go each day to just see what’s going on. However, due to the smaller size you do lose a lot of the activity in more niche communities and the sheer volume of posts/comments compared to Reddit. That’s the biggest downside. Still, you also lose the incessant ads/bad UI/UX decisions and ever accelerating late stage capitalism driven enshittification so that’s a big plus.
Tend to start with top (day) for my subs and then switch to scaled once I get down to posts that are below 100 upvotes or so to see more posts from smaller communities that can’t make that ‘top’ cut.
Furiously hammers X to doubt
Not exactly what I was thinking of but still worth a sub!
TBH This might be a good enough idea to merit a whole community of people just posting singular cool screenshots of games they are playing. Could be a cool low-effort visual way to document what folks are into at the moment. Kind of like a visual version of those ‘what are you playing’ weekly threads that used to be everywhere.
Been having fun and happy to see a gameplay balance patch so soon. That said, the technical side of this game is what really needs work and according to everything I’ve been seeing from Digital Foundry and others there’s some serious low hanging fruit that could improve the frame rate and pacing that is still pretty poor on all systems. Hopefully they bring some attention to that side of things soon. Game is certainly playable in the current state in my opinion but would be much more enjoyable if it actually stuck to something close to 60 FPS in most situations on XSX/PS5.
Maybe 1-3 times a day. I find that the newest version of ChatGPT (4o) typically returns answers that are faster and better quality than a search engine inquiry, especially for inquiries that have a bit more conceptualization required or are more bespoke (i.e give me recipes to use up these 3 ingredients etc) so it has replaced search engines for me in those cases.
Just jumping in to say that c/lemmytellyousomething should definitely be the name of an advice / self-help community on here
Okay a quick search tells me this is short for Single-root input/output virtualization right? Can you explain why that would be advantageous in a GPU?
Looks like it now has Docling Content Extraction Support for RAG. Has anyone used Docling much?