[Paper] Cross-Task Benchmarking and Evaluation of General-Purpose and Code-Specific Large Language Models
Large Language Models (LLMs) have revolutionized both general natural language processing and domain-specific applications such as code synthesis, legal reasoni...