How iFood built a platform to run hundreds of machine learning models with Amazon SageMaker Inference

04/08/2025

Headquartered in São Paulo, Brazil, iFood is a national private company and the leader in food-tech in Latin America, processing ...

Modeling of scalable and principles rewards for LLM: Improvement of RMS generalist reward models and inference time optimization

by Technical Terrence Team

04/07/2025

0

RL reinforcement learning has become a method after training widely used for LLM, improving capacities such as human alignment, long ...

The Case for Centralized AI Model Inference Serving

by Technical Terrence Team

04/02/2025

0

models continue to increase in scope and accuracy, even tasks once dominated by traditional algorithms are gradually being replaced by ...

Efficient Inference time scale for flow models: improvement of sampling diversity and calculating allocation

by Technical Terrence Team

03/30/2025

0

Recent advances in ai laws have changed an increase in model size and training data optimization of the inference time ...

Enhance deployment guardrails with inference component rolling updates for Amazon SageMaker AI inference

by Technical Terrence Team

03/27/2025

0

Deploying models efficiently, reliably, and cost-effectively is a critical challenge for organizations of all sizes. As organizations increasingly deploy foundation ...

Enable Amazon Bedrock cross-Region inference in multi-account environments

by Technical Terrence Team

03/27/2025

0

amazon Bedrock cross-Region inference capability that provides organizations with flexibility to access foundation models (FMs) across AWS Regions while maintaining ...

Author's image of the Poisson distribution as green, yellow, and blue bar graph lines

Mastering the Poisson Distribution: Intuition and Foundations

by Technical Terrence Team

03/22/2025

0

You’ve probably used the normal distribution one or two times too many. We all have — It’s a true workhorse. ...

NVIDIA AI OPEN SOURCES Dynamo: An open source inference library to accelerate and climb AI reasoning models in AI factories

by Technical Terrence Team

03/22/2025

0

The rapid advance of artificial intelligence (ai) has led to the development of complex models capable of understanding and generating ...

How GoDaddy built a category generation system at scale with batch inference for Amazon Bedrock

by Technical Terrence Team

03/13/2025

0

This post was co-written with Vishal Singh, Data Engineering Leader at Data & Analytics team of GoDaddy Generative ai solutions ...

Angler: Helping Machine Translation Professionals Prioritize Model Improvements

M2R2: Multi-tasa waste mixture for an efficient transformer inference

by Technical Terrence Team

03/12/2025

0

Residual transformations improve the depth of representation and expressive power of large language models (LLM). However, the application of static ...

Tag: inference