adam bien's blog

lightmetal: GPU LLM Inference From a Single Java 25 JAR 📎

GPU LLM inference on Apple Silicon, packaged as one Java 25 executable JAR, zero dependencies. lightmetal binds a Metal-enabled libllama.dylib through the Foreign Function & Memory API and runs Mistral- and Gemma-architecture GGUF models locally.

Build it with zb, point it at a GGUF, prompt it:

zb build
java --enable-native-access=ALL-UNNAMED -jar zbo/lightmetal.jar \
     -model ~/models/Mistral-Medium-3.5-128B-UD-Q5_K_XL-00001-of-00003.gguf \
     -prompt "What is Java?"

Add -serve and the same JAR exposes an Anthropic-compatible POST /v1/messages and an OpenAI-compatible POST /v1/chat/completions. xisting clients (zsmith, vibe) only need a base URL switch — the loaded GGUF wins, the model field is accepted and ignored.

Embedding into another Java app needs no compile-time dependency. lightmetal.jar registers a BinaryOperator via META-INF/services:

var generator = ServiceLoader.load(BinaryOperator.class).iterator().next();
var response  = generator.apply("/path/to/model.gguf", "What is Java?");

Just Java 25, llama.cpp, FFM, Metal — and a GGUF on disk.

Reflection in Java 25, Java vs. AI Careers, jfrdoc on zSmith, airails.dev Refactoring--Questions and Topics for the 147th airhacks.tv 📎

Questions and topics for the 2026.07/147th edition of airhacks.tv:

In the Java 25 / JUnit 5 era, is reflection obsolete, or only for designing libraries/APIs rather than your own code? (Simon Richter)
Should I continue building my career in Java, or move toward AI given the current market and MNC layoffs? (Fanib)
jfrdoc on zSmith (Rıdvan)
airails.dev refactoring (java-conventions)
Time machine: 100 episodes back (47th episode):
Database Authentication: SSH vs. username / password, Microservices with JSF frontend, How to modularize WARs, Dealing with denormalized databases, Java EE authentication (Active Directory), Identity preservation and audits in DB, JAX-RS authentication and principal delegation to EJB / CDI, Development in intranet environment, Reducing coupling between JavaScript components, Developers and operations -- their roles in the future, DeltaSpike project review, Impact of 3rd party libraries on build performance, Unpredictable, long running transactions, Propagating principals from JAX-RS to EJBs, Why it is a bad idea to resend a password on each request?, Dealing with security in credit card processing

Any questions left? Ask now: gist.github.com/AdamBien/33ece0569d45516d95163859aaaf8e1d and get the answers at the next airhacks.tv. Some questions are also answered with a short video: 60 seconds or less with Java

Ask questions during the show via twitter mentioning me: https://twitter.com/AdamBien (@AdamBien),using the hashtag: #airhacks or built-in chat at: airhacks.tv. You can join the Q&A session live each first Monday of month, 8 P.M at airhacks.tv

JAZ, Copilot SDK, and Why LLMs Write Better Java--airhacks.fm podcast 📎

Subscribe to airhacks.fm podcast via: spotify| iTunes

The #400 airhacks.fm episode with Bruno Borges (@brunoborges) about:

JVM tuning for containers with JAZ, building agentic systems with the Copilot SDK and the Microsoft Agent Framework, and grounding LLMs against Java specifications for hallucination-free code generation.

is available for download.

Summer of 26: Events, Conferences and Workshops 📎

Devoxx Poland 2026: Token-Efficient, Well-Crafted Java #livecoding
conference talk Devoxx Poland 2026 Krakow, Poland 18 June 2026
https://devoxx.pl/talk?id=15451
VibeKode Conference: How To Write Great Code with LLMs #vibeless
conference talk VibeKode Conference Munich, Germany 24 June 2026
https://vibekode.it/speaker/adam-bien/
Spec-Driven Java Development at LLM Speed[online event]
online workshop 2 July 2026
https://workshops.adam-bien.com
LLM-Assisted Web Components: No Frameworks, No Dependencies. Just Web Standards[online event]
online workshop 16 July 2026
https://workshops.adam-bien.com
Entwickler Summit 2026: Fewer Tokens, Better Apps: With Standards and Java
conference talk Entwickler Summit 2026 Berlin, Germany 17 September 2026
https://entwickler.de/entwickler-summit/
airhacks.tv Questions and Answers #livestream[online event]
live streaming show first monday of the month, 8pm CET
https://www.meetup.com/airhacks/

The in-person edition of airhacks is back at MUC Airport: "Architect-Grade Java with LLMs" airhacks.university

GlassFish, Corretto, Apple openJDK and Why Standards Beat Hype--airhacks.fm podcast 📎

Subscribe to airhacks.fm podcast via: spotify| iTunes

The #399 airhacks.fm episode with Arun Gupta (@arungupta) about:

RMI/CORBA, J2EE, GlassFish, Sun Grid, Amazon Corretto, the Apple openJDK, JetBrains and how normative Java specifications enable reliable LLM code generation

is available for download.

From CDI TCK to Quarkus MCP Server--airhacks.fm podcast 📎

Subscribe to airhacks.fm podcast via: spotify| iTunes

The #398 airhacks.fm episode with Martin Kouba (@martunek) about:

From CDI TCK, specifications and Weld to ArC and the MCP server in Quarkus

is available for download.

Finding Patterns: From Middleware to Modern AI--airhacks.fm podcast 📎

Subscribe to airhacks.fm podcast via: spotify| iTunes

The #397 airhacks.fm episode with Prof. Dr. Michael Stal /in/drstal about:

discovering patterns in middleware, writing the POSA book, Java adoption at Siemens, and the limits of LLMs and AGI

is available for download.

JAX Conference (German) Talk: Business Logic First: Escaping Java’s Bloat Addiction 📎

Java projects are still plagued by outdated habits: excessive layers, unnecessary abstractions, more YAML/XML than code, tests for obvious things like getters, "Stats and Ticket-d riven Development" and systems where everything is configurable but nothing varies, leaving business logic buried under unnecessary complexity. It’s time to focus on code again. This session will demonstrate how modern Java 25 and standards can help us to reduce unnecessary complexity. The focus is on shipping real business logic with greater simplicity and maintainability.

146th airhacks tv: Rust, Java 25, AI Agents, BCE, Web Components, zunit, zb 📎

2025.06, the 146th airhacks.tv episode is available:

Two Time Machines: Revisiting the 45th Episode After 9 Years, Java 25 Scripting With Zero Dependencies, Java vs. Kotlin in 2025, Web Components and Standards Without Frameworks, Multiple Datasources With Different Permissions for AI Agents, Connection Pools and JPA Cache in Serverless Environments, Why XA Transactions Are a Bad Idea, JavaFX Revival With Official Oracle Support, EJBs vs. CDI Pooling and Jakarta EE 12 Native Pooling Plans, BCE Design and AI Rails for AI Agents, Jakarta Data and Delta Spike Evolution, Docker Swarm vs. Kubernetes vs. OpenShift for Java Developers, Server Push With WebSockets and SSE in the MCP Era, Role-Based Access With MicroProfile JWT and @RolesAllowed, Scala vs. Rust vs. Java 25 Analysis, Why LLMs Understand Java Better Than Other Languages, zsmith Agent Harness in Pure Java 25, LangChain4j With Quarkus and Native Compilation, Zero-Dependency JUnit Runner Under 1 Second, JAX Frankfurt, AirHacks Munich Airport Workshop December 2026, Summer AirHacks Online, JCon Cologne, Devoxx Greece, JavaOne, Devoxx London, Geekon Krakow

From Manchester to Mountain View: Binary Translators, JVMs, and Android--airhacks.fm podcast 📎

Subscribe to airhacks.fm podcast via: spotify| iTunes

The #396 airhacks.fm episode with Ian Rogers about:

Binary translators, JVMs, and the Android Runtime, traced from a ZX Spectrum in Manchester to Linux kernel performance work at Google.

is available for download.

Rust, SSE, WebSockets, BCE, JPA History, LLMs-Questions and Topics for the 146th airhacks.tv 📎

Questions and topics for the 2026.05/1416h edition of airhacks.tv:

Opinion about the Rust programming language (asked by @Pscheidl on LinkedIn)
Time machine: the 46th airhacks.tv from January 2018
Application metrics and monitoring in Java EE, ServerSockets vs higher-level alternatives, SSE, WebSockets, DAO anti-patterns, JPA historisation, SLSB tuning, Docker, Kubernetes, OpenShift and more
Time machine: the 45th airhacks.tv from December 2017
Kotlin in Java EE, JasperReport alternatives, custom REST headers, two persistence units in CDI, JPA internals, JWT authentication, Web Components, offline-first PWAs and more
airhacks workshops return to Munich Airport
conference report

Any questions left? Ask now: gist.github.com/AdamBien/4c2a1c099321bac29ecb6ebc262ae196 and get the answers at the next airhacks.tv. Some questions are also answered with a short video: 60 seconds or less with Java

Live Coding at Devoxx Greece: Your Java Code Is Your LLM Prompt 📎

LLMs know Java's normative APIs, specifications, and implementations. That changes how you write code. Code structure, naming conventions, and adherence to standards directly affect what LLMs generate. Well-structured, zero-dependency Java produces more consistent, more predictable LLM output and scales to big projects. Starting from a blank project, each step demonstrates how code organization, standard API usage, and consistent patterns influence LLM-assisted development. The same principles that make code readable for developers also make it processable for LLMs. No slides. No theory. Just code. Questions welcome at any time.

Migrating Ruby Monoliths to Java, Agentic AI Foundation and MCP-airhacks.fm podcast 📎

Subscribe to airhacks.fm podcast via: spotify| iTunes

The #395 airhacks.fm episode with Manik Surtani (@maniksurtani) about:

From JBoss Cache and Infinispan to migrating Ruby on Rails monoliths to Java microservices at Square, co-designing MCP with Anthropic, building the Goose coding agent, and founding the Agentic AI Foundation

is available for download.

"hello, world" Java 25 Script In 4 Lines 📎

Java 25 source-file mode turns a single file into an executable command. No compilation, no build tool, no .java extension. Here is how to create a zhello script:

Create a file named zhello (no extension) with the shebang #!/usr/bin/env -S java --source 25
Add an instance void main() method that calls IO.println("hello, world")
Make it executable: chmod +x zhello
Install it system-wide: sudo cp zhello /usr/local/bin/

The entire script:

#!/usr/bin/env -S java --source 25

void main() {
    IO.println("hello, world");
}

145th airhacks tv: BCE, airails.dev, Zero-Dependency Agents, and Java 25 Scripts 📎

The 145th airhacks.tv episode 2026.04 is available:

BCE pattern from 1992: top-level packages named by context not by layer, at most 3 sub-packages: Boundary, Control, Entity, @Transactional belongs only on the Boundary — one button push equals one use case equals one transaction, Hibernate Validator skipped in favor of custom Control-layer validation for precise 400-error handling, zero-dependency MCP server by Mr. Aldo from France: built with BCE and zero-dependency principles, Agent Smith live demo: zero-dependency Java 25 agent framework in ~130KB single JAR, transcriber agent with episodic memory and agent delegation, GPU Llama / TornadoVM: Apple Metal support for running Mistral and DevStral locally, plans to run Agent Smith without cloud, Z ecosystem: ZDate, ZJDocFind, ZUnit single-file test runner in ~300 lines with parallel execution, ZB build tool — all zero-dependency Java 25 scripts, LLMs understand standards better than frameworks — enables lean code generation without external dependencies, Enterprise Fire April 1st skill: converting Hello World to 73 classes with all patterns, hexagonal architecture appears for the first time on AirHacks, upcoming AirHacks Live summer workshops on spec-driven Java development and front-ends without dependencies

Do you have any more questions? See you at: airhacks.live

Apache PLC4X, Industrial Protocol Drivers, and the JDBC of Industrial Automation--airhacks.fm podcast 📎

Subscribe to airhacks.fm podcast via: spotify| iTunes| YouTube

The #394 airhacks.fm episode with Christofer Dutz (christofer-dutz/) about:

Apache PLC4X as the JDBC of industrial automation, native protocol drivers versus OPC UA, founding ToddySoft for commercially supported open source industrial products, Apache IoTDB and TsFile for time series storage, and Industry 4.0 use cases on the shop floor.

is available for download.

airails.dev Skills, @Transactional in BCE, Hibernate Validator, Zero-Framework MCP Server, GPULLama3 and zSmith Agent News--145th airhacks.tv: 📎

Topics for the 145th airhacks.tv:

airails.dev microprofile-server skill: @Transactional in the boundary vs. control layer for bulk processing and synchronous API calls
BCE architecture in Jakarta Faces applications - how to structure them
Local inference with Tornado / Java GPULLama3
airails.dev microprofile-server skill: why never use quarkus-hibernate-validator? Alternatives to manual validation
MCP server with Zero-framework / BCE architecture by Aldo Lushklja: zdtp-mcp, blog post
Agent zSmith news
zeeds (Java Zero-Dependency Seed Scripts) news

See you every first Monday of the month at https://airhacks.tv 8pm CET (UTC+1:00). Show is also announced at: meetup.com/airhacks.

Ask questions during the show via twitter mentioning me: https://twitter.com/AdamBien (@AdamBien),using the hashtag: #airhacks or built-in chat at: airhacks.tv. You can join the Q&A session live each first Monday of month, 8 P.M at airhacks.tv

Early 2026: Upcoming Conferences, Workshops and Events 📎

JCON Europe 2026: Livecoding: Creating Beautiful Java Code With LLM and Agents
conference talk JCON Europe 2026 Cologne, Germany 21 April 2026
https://schedule.jcon.one/2026/speakers
Devoxx Greece 2026: Your Java Code Is Your LLM Prompt
conference talk Devoxx Greece 2026 Athens, Greece 23 April 2026
https://m.devoxx.com/events/dvgr26/talks/13551/session-by-adam-bien
JAX, Mainz: Business Logic First: Escaping Java's Bloat Addiction
conference talk JAX, Mainz Berlin, Germany 5 May 2026
https://jax.de/serverside-enterprise-java/business-logic-first-java/
JAX, Mainz: Live Coding Production Java: Incremental Development with LLMs
conference talk JAX, Mainz Berlin, Germany 5 May 2026
https://jax.de/big-data-machine-learning/java-incremental-development-llm-java/
Geecon: The 50x Developer: Java's Unfair Advantage in the Age of AI Agents
conference talk Geecon Krakow, Poland 14th May 2026
https://2026.geecon.org/speakers/info.html?id=1062
VibeKode Conference: How To Write Great Code with LLMs #vibeless
conference talk VibeKode Conference Munich, Germany 24 June 2026
https://vibekode.it
Spec-Driven Java Development at LLM Speed[online event]
online workshop 2 July 2026
https://workshops.adam-bien.com
LLM-Assisted Web Components: No Frameworks, No Dependencies. Just Web Standards[online event]
online workshop 16 July 2026
https://workshops.adam-bien.com
airhacks.tv Questions and Answers #livestream[online event]
live streaming show first monday of the month, 8pm CET
https://www.meetup.com/airhacks/

AWS Infrastructure as Code: CloudFormation Origins, CDK Stacks, and Terraform Trade-offs--airhacks.fm podcast 📎

Subscribe to airhacks.fm podcast via: spotify| iTunes|

The #393 airhacks.fm episode with Thorsten Höger (@hoegertn) about:

Migrating a German bank to AWS in 2012, the evolution from CloudFormation JSON to CDK, declarative state management, Terraform trade-offs, CDK stacks as atomic deployment units, regulated industries and compliance, and the CDK Book.

is available for download.

Green Java with Quarkus: Performance Benchmarks, SBOM, and Serverless Architecture--airhacks.fm podcast 📎

Subscribe to airhacks.fm podcast via: spotify| iTunes|

The #392 airhacks.fm episode with Holly Cummins (@holly_cummins) about:

Quarkus energy efficiency benchmarks, greener Java, serverless SnapStart, JVM tuning, SBOM generation, and cheese fondue

is available for download.

Java 25 Script Files with Classpath in the Shebang 📎

Java 25 script files (JEP 512: Compact Source Files and Instance Main Methods and JEP 458: Launch Multi-File Source-Code Programs) are self contained, but can specify a classpath in the shebang.

The Log.user, for example, is located in zcl.jar and can referenced with the following shebang:


#!/usr/bin/java --class-path=zcl.jar  --source 25
void main() {
  Log.user("Hello, duke");
}

zunit: Zero-Dependency Java 25 Test Runner 📎

zunit is a zero-dependency, single-file Java 25 Script test runner. It discovers *Test.java source files and runs each directly via java --source 25. No compilation step, no JUnit, no build tool required — just run zunit.

A GreeterTest.java example:


void main() {
    var greeting = Greeter.greet("World");
    assert Objects.equals("Hello, World!", greeting) : "expected 'Hello, World!' but got: " + greeting;
}

Each test is a self-contained Java source script with a void main() method. Any thrown exception or non-zero exit code indicates test failure. Each test gets its own JVM — full execution isolation with zero shared state.

Run it with:

zunit to auto-detect and run all tests,
zunit -cp:zbo/app.jar for an explicit classpath
zb && zunit to build with zb first.

zunit is available from: github.com/AdamBien/zunit

The corresponding airails.dev skill is available from: github.com/AdamBien/airails/tree/main/java/zunit

Formal Methods, Functional Programming, and Securing the Java Ecosystem--airhacks.fm podcast 📎

Subscribe to airhacks.fm podcast via: spotify| iTunes|

The #391 airhacks.fm episode with Brian Vermeer (@BrianVermeer) about:

Haskell and pure functional programming, building temperature sensor monitoring systems, enterprise service-based architecture and JavaServer Faces, Snyk's origins as an NPM dependency scanner, supply chain security and expansion to Java, static code analysis and container scanning and AI flow analysis, security as part of the development lifecycle, vibe coding risks and MCP server toxic flows, modern Java simplicity vs legacy enterprise verbosity, JBang for Java scripting, Java developers thinking about production readiness from the start

is available for download.

Live from W-JAX (German): Hardcore Serverless Java - This is the Way 📎

The talk covers how to build Java applications that auto-scale, are highly available, and run without traditional infrastructure overhead.