GSA's architecture lead for AI services said standardized evaluation helps users identify the right models for their needs.