vluz

vluz@kbin.social · 6 months ago

Messing around with system python/pip and newly installed versions till all was broken and then looking at documentation.
This was way back on the 00’s and I’m still ashamed on how fast completely I messed it up.

vluz@kbin.social · 7 months ago

Just figured out there are 10 places called Lisbon dotted around the US, according to the search.

vluz@kbin.social · 8 months ago

Me with four open cli terminals righ now:
https://i.kym-cdn.com/photos/images/original/001/617/650/91a.jpg

vluz@kbin.social · 9 months ago

Got one more for you: https://gossip.ink/
I use it via a docker/podman container I’ve made for it: https://hub.docker.com/repository/docker/vluz/node-umi-gossip-run/general

vluz@kbin.social · 10 months ago

I got cancelled too and chose Hetzner instead. Will not do business with a company that can’t get their filters working decently.

vluz@kbin.social · 11 months ago

Not close enough for V.A.T.S.

vluz@kbin.social · 1 year ago

Lovely! I’ll go read the code as soon as I have some coffee.

vluz@kbin.social · 1 year ago

I do SDXL generation in 4GB at extreme expense of speed, by using a number of memory optimizations.
I’ve done this kind of stuff since SD 1.4, for the fun of it. I like to see how low I can push vram use.

SDXL takes around 3 to 4 minutes per generation including refiner but it works within constraints.
Graphics cards used are hilariously bad for the task, a 1050ti with 4GB and a 1060 with 3GB vram.

Have an implementation running on the 3GB card, inside a podman container, with no ram offloading, 1 vcpu and 4GB ram.
Graphical UI (streamlit) run on a laptop outside of server to save resources.

Working on a example implementation of SDXL as we speak and also working on SDXL generation on mobile.
That is the reason I’ve looked into this news, SSD-1B might be a good candidate for my dumb experiments.

vluz@kbin.social · 1 year ago

Oh my Gwyn, this comment section is just amazing.

vluz@kbin.social · 1 year ago

Goddammit! Don’t tell that one, I use it to impress random people at parties.

vluz@kbin.social · 1 year ago

Not joking, although I understand it seems very silly at face value.
Dark Souls 3 PvP specifically SL60+6 at gank town (after pontiff).
It used to be my go-to wind down after a work day.
It made me smile and actually relaxed me enough to go to bed and sleep, especially after a hard day.

vluz@kbin.social · 1 year ago

HateLLM will be a smash. /s

vluz@kbin.social · 1 year ago

Well done!

vluz@kbin.social · 1 year ago

That’s wonderful to know! Thank you again.
I’ll follow your instructions, this implementation is exactly what I was looking for.

vluz@kbin.social · edit-2 1 year ago

Absolutely stellar write up. Thank you!

I have a couple of questions.
Imagine I have a powerful consumer gpu card to trow at this solution, 4090ti for the sake of example.
- How many containers can share one physical card, taking into account total vram memory will not be exceeded?
- How does one virtual gpu look like in the container? Can I run standard stuff like PyTorch, Tensorflow, and CUDA stuff in general?

vluz@kbin.social · edit-2 1 year ago

Oof, pop-culture references are hard and I had not considered that at all.
Thanks for the examples, I’ll have a think on how to deal with those.

My only insight is one you already had.
Test at least the comment before, and then use the output to dampen or amplify the final result.
Sorry for being no help at all.

--

My project is very basic but I’ll post it here for any insight you might get out of it.
I teach Python in a variety of settings and this is part of a class.

The data used is from Kaggle: https://www.kaggle.com/competitions/jigsaw-toxic-comment-classification-challenge/
The original data came from Wikipedia toxic comments dataset.
There is code too from several users, very helpful for some insight into the problem.

Data is dirty and needs clean up so I’ve done so and posted result on HF here:
https://huggingface.co/datasets/vluz/Tox

Model is a very basic TensorFlow implementation intended for teaching TF basics.
https://github.com/vluz/ToxTest
Some of the helper scripts are very wonky, need fixing before I present this in class.

Here are my weights after 30 epochs:
https://huggingface.co/vluz/toxmodel30

And here is it running on a HF space:
https://huggingface.co/spaces/vluz/Tox

vluz@kbin.social · edit-2 1 year ago

deleted by creator

vluz@kbin.social · edit-2 1 year ago

While designing a similar classifier, I’ve considered the idea of giving it the whole thread as “context” of sorts.
Not just the parent comment, the whole thread up to original post.

I’ve abandoned the idea.
A comment must stand on it’s own, and it would put limits on results, the way I was planning to do it.
I might be very wrong, your insight into this would be very helpful.

My original idea was to go recursively trough the thread and test each comment individually.
Then I would influence the actual comment results with the combined results of it’s parents.
No context during inference, just one comment at a time.

For example consider thread OP->C1->C2->C3.
My current model takes milliseconds per test with little resources used.
It would be ok up to very large threads but would contain a limit to save on answer time.
I want to determine if Comment 3 is toxic in the context of C2, C1, and OP.
Test C3, test C2, test C1, test OP. Save results.
My current model gives answer in several fields (“toxic”, “severe toxic”, “obscene”, “threat”, “insult”, and “identity hate”)
The idea was to then combine the results of each into a final result for C3.

How to combine? Haven’t figure it out but it would be results manipulation instead of inference/context, etc.

Edit: Is there any way you can point me at examples difficult to classify? It would be a nice real world test to my stuff.
Current iteration of model is very new and has not been tested in the wild.

vluz@kbin.social · 1 year ago

Don’t trust brave, never will.

vluz@kbin.social · 1 year ago

Just pip install mscandy -U