Experiment - Function calling in LLMs
Problem:You have a large language model (LLM) that can generate text but struggles to reliably call external functions to perform specific tasks like calculations or data retrieval.
Current Metrics:Function call success rate: 60%, Response accuracy after function call: 70%
Issue:The LLM often generates incorrect or incomplete function call requests, leading to low success and accuracy in using external functions.