[IDEA] Scaling inference-time with complexity

Smorty [she/her] · 5 months ago

[IDEA] Scaling inference-time with complexity

The Hobbyist@lemmy.zip · 5 months ago

What your are describing on a high level is what O1 does. But where you are mistaken is when you say:

This thought is not human-interpretable, but it is much more efficient than the pre-output reasoning tokens of o1, which uses human language to fill its own context window with.

What makes those reasoning tokens more efficient? They are just tokens, similarly to all other ones and equally complex/simple to generate. Yes they allow for more reflexion before a presented output is given, but the process is the same.

Also, they would all need to fit in the same context because otherwise you will prevent the model from actually reasoning on it while it iterates its thoughts.

Smorty [she/her] · 5 months ago

I imagine that a model would be held back by the format of human readable text.

Human text uses some concepts, which are mostly unimportant to an AI. Sentence syntax and grammar rules being examples. I think that letting the AI “define its own way of thinking” instead of telling it to think in human language would lead to more efficient thought proccesses. It would be similar to embeddings. A bunch of numbers representing a specific topic in these tokens. Not human readable, but useful for the model.

As far as I know, o1 writes a big document on what it will do, how it will do it and some reflection aswell. My approach however would allow the model to think of things on the fly, while it is writing the text.

You are right in that it would have to fit into the context window. As far as I can tell, the output from the o1 model doesn’t remember what the big thought document says. With my approach, the model would keep all its thoughts in mind while it is writing, as they are literally part of its message, just unreadable by humans.

Am I missing something here? If so, please point it out.

[IDEA] Scaling inference-time with complexity

[IDEA] Scaling inference-time with complexity

My observation

Example

My idea

Chances

Pitfalls and potential risks

What do you think?