LLMs develop their own understanding of reality as their language abilities improve

Hackworth@lemmy.world · 9 days ago

It’s probably a vision model (like this) with custom instructions that direct it to focus on those factors. It’d be interesting to see the instructions.

Hackworth@lemmy.world · 14 days ago

To help blind drivers, no. To help AI, yes.

Hackworth@lemmy.world · edit-2 27 days ago

I think it’s more likely a compound sigmoid (don’t Google that). LLMs are composed of distinct technologies working together. As we’ve reached the inflection point of the scaling for one, we’ve pivoted implementations to get back on track. Notably, context windows are no longer an issue. But the most recent pivot came just this week, allowing for a huge jump in performance. There are more promising stepping stones coming into view. Is the exponential curve just a series of sigmoids stacked too close together? In any case, the article’s correct - just adding more compute to the same exact implementation hasn’t enabled scaling exponentially.

Hackworth@lemmy.world · 1 month ago

There used to be very real hardware reasons that upload had much lower bandwidth. I have no idea if there still are.

Hackworth@lemmy.world · 1 month ago

https://www.unitree.com/g1

Hackworth@lemmy.world · 1 month ago

Yeah, but they encourage confining it to a virtual machine with limited access.

Hackworth@lemmy.world · 1 month ago

Logic and Path-finding?

Hackworth@lemmy.world · 2 months ago

Shithole country.

Hackworth@lemmy.world · edit-2 2 months ago

Yeah, using image recognition on a screenshot of the desktop and directing a mouse around the screen with coordinates is definitely an intermediate implementation. Open Interpreter, Shell-GPT, LLM-Shell, and DemandGen make a little more sense to me for anything that can currently be done from a CLI, but I’ve never actually tested em.

Hackworth@lemmy.world · edit-2 2 months ago

I was watching users test this out and am generally impressed. At one point, Claude tried to open Firefox, but it was not responding. So it killed the process from the console and restarted. A small thing, but not something I would have expected it to overcome this early. It’s clearly not ready for prime time (by their repeated warnings), but I’m happy to see these capabilities finally making it to a foundation model’s API. It’ll be interesting to see how much remains of GUIs (or high level programming languages for that matter) if/when AI can reliably translate common language to hardware behavior.

Hackworth@lemmy.world · 2 months ago

Can I blame Trump on 9/11 or something?

Hackworth@lemmy.world · 2 months ago

Aren’t they in Macy’s now? Wait, is Macy’s still a thing?

Hackworth@lemmy.world · 2 months ago

The next generation?

Hackworth@lemmy.world · edit-2 2 months ago

In its latest audit of 10 leading chatbots, compiled in September, NewsGuard found that AI will repeat misinformation 18% of the time

70% of the instances where AI repeated falsehoods were in response to bad actor prompts, as opposed to leading prompts or innocent user prompts.

Hackworth@lemmy.world · 2 months ago

To be clear, it’ll be 10-30 years before AI displaces all human jobs.

Hackworth@lemmy.world · 3 months ago

an eight-year-old girl was among those killed

Hackworth@lemmy.world · edit-2 3 months ago

Calling what attention transformers do memorization is wildly inaccurate.

*Unless we’re talking about semantic memory.

Hackworth@lemmy.world · 3 months ago

It honestly blows my mind that people look at a neutral network that’s even capable of recreating short works it was trained on without having access to that text during generation… and choose to focus on IP law.

Hackworth@lemmy.world · 3 months ago

The issue is that next to the transformed output, the not-transformed input is being in use in a commercial product.

Are you only talking about the word repetition glitch?

Hackworth@lemmy.world · 3 months ago

How do you imagine those works are used?

Hackworth@lemmy.world · edit-2 4 months ago

LLMs develop their own understanding of reality as their language abilities improve

Hackworth@lemmy.world · 4 months ago

A.I. groks 66%-76% faster with data augmentation strategies.

Hackworth@lemmy.world · 5 months ago

Posit: In the future, generative A.I. will be thought of as the unconscious part of a general A.I.'s mind.

Hackworth@lemmy.world · edit-2 6 months ago

The Future of Large Language Model Pre-training is Federated