PyPI Supply Chain Attack Hits LiteLLM, Google Launches Gemini API Pricing Tiers, OpenAI Codex Goes Pay-As-You-Go — AI Evening Update for April 4, 2026

Supply Chain API Pricing
KEY POINTS

PyPI Supply Chain Attack Hits LiteLLM, Gemini API Pricing Tiers Land, OpenAI Codex Goes Pay-As-You-Go

  • PyPI + LiteLLM — Malicious package injects credential-harvesting code; affected users must rotate API keys.
  • Gemini API — Published tiers: 2.5 Flash-Lite $0.10 input / $0.40 output; Flash $0.30/$2.50; 3.1 Flash-Lite Preview $0.25/$1.50; 3.1 Pro Preview $2–$4 / $12–$18 per million tokens. Batch API typically ~50% off.
  • OpenAI Codex — Pay-as-you-go removes upfront commitment; opens Codex to smaller teams.
  • Theme — Supply chain security and API monetization evolve in parallel as AI tooling scales.

April 4, 2026 evening highlights three stories: a PyPI supply chain attack targeting LiteLLM, new Gemini API pricing tiers, and OpenAI Codex moving to pay-as-you-go.

PyPI Supply Chain Attack Hits LiteLLM

Malicious package highlights AI-tooling supply chain fragility

Security Community · April 4, 2026

A supply chain attack compromised LiteLLM via a malicious PyPI package, injecting credential-harvesting code into downstream installations. Security teams issued mitigation guidance and advised affected users to rotate API keys for all providers including Anthropic ($3/$15 per million tokens on Sonnet 4.6), OpenAI, and Google (Gemini 2.5 Flash at $0.30/$2.50).

Tech Analysis

AI developer tooling depends on deep chains of open-source packages. Expect signed, reproducible builds, tighter dependency pinning, and runtime policy layers like Vercel’s deploy-time guardrails to become baseline.


Google Launches Gemini API Pricing Tiers

Differentiated cost-performance bands for every workload profile

Google · April 4, 2026

Google laid out Gemini API pricing with distinct cost-performance bands. Documented rates per million tokens: Gemini 3.1 Pro Preview input $2.00 (≤200k tokens) or $4.00 (>200k), output $12.00–$18.00; Gemini 3.1 Flash-Lite Preview input $0.25, output $1.50; Gemini 2.5 Flash input $0.30, output $2.50; Gemini 2.5 Flash-Lite input $0.10, output $0.40. Batch API typically reduces costs by roughly 50%. Free-tier Google Search Grounding allows 500 requests per day shared across Flash and Flash-Lite.

Tech Analysis

Tiered pricing reflects enterprise maturity: production workloads need SLA guarantees; background analytics tolerate variable latency. Anthropic and OpenAI should be expected to respond with similar priority tiers beyond Anthropic’s existing $3/$15 Sonnet 4.6 rate.


OpenAI Codex Goes Pay-As-You-Go

Removing upfront commitment opens Codex to smaller teams

OpenAI · April 4, 2026

OpenAI moved Codex to pay-as-you-go pricing, removing the upfront commitment that had limited adoption among smaller teams. Developers can now access Codex on standard API terms, competing directly with Cursor Composer 2 ($0.50–$1.50 per million input tokens) and GitHub Copilot Pro ($20/month) among small and mid-sized teams.

Tech Analysis

Pay-as-you-go lowers the friction for CI/CD integration experiments and pressures both Claude Code and Cursor at the entry tier. Expect coding-agent pricing to converge toward a low-floor, high-ceiling structure tied to agent task duration.

Related

Sources

AI Biz Insider · AI Trends · aibizinsider.com


AI Biz Insider에서 더 알아보기

구독을 신청하면 최신 게시물을 이메일로 받아볼 수 있습니다.

AI Biz Insider에서 더 알아보기

지금 구독하여 계속 읽고 전체 아카이브에 액세스하세요.

계속 읽기

AI Biz Insider에서 더 알아보기

지금 구독하여 계속 읽고 전체 아카이브에 액세스하세요.

계속 읽기