A data-augmented model routing framework for efficient LLM deployment in edge--cloud environments

Publication
Journal of Supercomputing