Current world of software development doesnt make much sense to me. I started this journey when I was in a dark state of mind. Anxiety and depression were on my back 24/7. Then 8 years ago I decided to switch professions - I got an internship position. I got to learn everything from scratch - the feeling of learning was really exuberant for me. Internship was with a web agency that made ecommerce
When you have 5 unrelated questions, should you pack them into one message to the LLM, or send 5 requests simultaneously? Which is faster? Splitting into multiple independent parallel requests is almost always faster. This isn't a gut feeling — it's determined by the underlying inference mechanism of LLMs. Let's walk through the reasoning from first principles. To understand this problem, you firs