appvoid/palmer-005-core
appvoid
AI & ML interests
Recent Activity
Organizations
appvoid/palmer-005-core
Next release will be above most 1b models while being 1/3 the size
Be careful... all I'm saying.
I guess the reason is slow is because llama.cpp is not optimized...
Correct! It's causal modeling (for now) with a char level tokenizer with only 8 tokens.
The model learns by looking for relationships of sequences for a single token, so the only way it learns is literally nudging weights towards a generalized solution using pure sequences.
In short, it learns to learn.
Will the be any app to.. convert the dots to something meaningful?
Not yet, I'm focusing on getting the core right first. But once the model is general enough, I don't see why not. Though you might need to finetune it for your use case.
Correct! It's causal modeling (for now) with a char level tokenizer with only 8 tokens.
The model learns by looking for relationships of sequences for a single token, so the only way it learns is literally nudging weights towards a generalized solution using pure sequences.
In short, it learns to learn.
It's already decent at some tasks, with next version coming in a few weeks.
appvoid/dot
The first model proudly trained from scratch on "physical" reasoning instead of chunky language tokens was published.
if you need raw power though slow, rwkv 0.4b has you covered, if you need something in between choose lfm2 350m
indeed ๐ค
The first project, as far as I know, that focuses purely on few-shot prompting results rather than zero-shot like usually done with decoder-only transformer models. This model excels at few-shot tasks compared to most 0.6b and even bigger models. It also outperforms the base model on some popular language modeling benchmarks.
appvoid/arco-3
Try it yourself!
Do you have your raspberry pis and phones ready for this new model yet?
New model, new architecture, more power:
Since I'm unable to post for about 11 hours, I will post it here: https://huggingface.co/appvoid/arco-3