Topics created by drikanis@mstdn.ca | Postcall.pub

Your browser does not seem to support JavaScript. As a result, your viewing experience will be diminished, and you have been placed in read-only mode.

Please download a browser that supports JavaScript, or enable it if it's disabled (i.e. NoScript).

D

drikanis@mstdn.ca

@drikanis@mstdn.ca

0

Topics

D

i'm really just... totally done with work.
Watching Ignoring Scheduled Pinned Locked Moved Uncategorized
2

0 Votes

2 Posts

0 Views

D

the trash keeps piling up and rotting, promising younger colleagues end up leaving for greener pastures, and i'm stuck here pulling levers to help maintain the illusion that this bloated corpse of a company is still worth a damn.
D

So now "Copilot" in the context of Microsoft can mean:
Watching Ignoring Scheduled Pinned Locked Moved Uncategorized
1

0 Votes

1 Posts

0 Views

No one has replied
D

yay more mandatory training on AI... /s
Watching Ignoring Scheduled Pinned Locked Moved Uncategorized
4

0 Votes

4 Posts

0 Views

D

apparently the metrics used to evaluate llm-based systems don't come from anything grounded in reality. they just pass the prompt and response pairs to an llm and ask it to evaluate them. usually the llm doing the evaluation is the same one being evaluated.so much of this feels entirely unscientific. engineers are treating llms as these magical infallible black boxes without understanding their specific strengths and limitations. it's the ultimate hammer and now literally every problem looks like a nail.it's also very comical that the examples they are using in the lab produce incredibly generic and horribly useless responses but the llm is scoring them very high in all metrics.