Hacker News

sebg
The Prompt() Function: Use the Power of LLMs with SQL motherduck.com

delichon5 hours ago

  FROM hn.hacker_news
  LIMIT 100
"Oops I forgot the limit clause and now owe MotherDuck and OpenAI $93 billion."

domoritz5 hours ago

I love the simplicity of this. Hurray for small models for small tasks.

korkybuchek5 hours ago

Interesting -- is there any impact from LLM outputs not being deterministic?

drdaeman5 hours ago

SQL functions can be non-deterministic just fine. E.g. SQL:2003 grammar defines DETERMINISTIC | NOT DETERMINISTIC characteristic for CREATE FUNCTION. Or, e.g. PostgreSQL has IMMUTABLE | STABLE | VOLATILE clauses.

korkybuchek4 hours ago

Nice, TIL. Thanks!

xnx5 hours ago

Aren't LLM outputs deterministic given the same inputs?

simonw3 hours ago

Not at all. Even the ones that provide a "seed" parameter don't generally 100% guarantee you'll get back the same result.

My understanding is that this is mainly down to how floating point arithmetic works. Any performant LLM will be executing a whole bunch of floating point arithmetic in parallel (usually on a GPU) - and that means that the order in which those operations finish can very slightly affect the result.

darkteflonan hour ago

Funny wrinkle here: unless I’ve misread the OpenAI API docs[1], the recently added prompt caching feature cannot be explicitly disabled and automatically applies to all input prompts over 1024 tokens for a ~few minutes.

It seems to be possible to work around it by mixing up the very start of your prompt (e.g., with an iteration number), but it’s messed up some of our workflows which rely on running multiple hits with the same prompt to gather a consensus output.

Would be great if they let us disable it.

[1]: https://platform.openai.com/docs/guides/prompt-caching

korkybuchek4 hours ago

They are not, necessarily. Especially when using commercial providers who may change models, finetunes, privacy layers, and all kinds of other non-foundational-model things without notice.

hn-front (c) 2024 voximity
source