E-Solutions
USA
This position is within a project with one of the foundational LLM companies. The goal is to assist these foundational LLM companies in enhancing their Large Language Models. One way we help these companies improve their models is by providing them with high-quality proprietary data. This data serves two main purposes: first, as a basis for fine-tuning their models, and second, as an evaluation set to benchmark the performance of their models or competitor models. What does day-to-day look like: • Design multi-turn conversations that simulate real interactions between users and AI assistants using apps like calendar, email, maps, and drive. • Emulate both the user and the assistant, including the assistant's tool calls (only when corrections are needed). • Carefully select when and how the assistant uses available tools, ensuring logical flow and proper usage of function calls. • Craft dialogues that demonstrate natural language, intelligent behavior, and contextual...