The move will help enterprises reduce inference costs and improve efficiency as they scale AI applications in production, ...