How Spring talks to databases

Before writing repositories and entity classes, it helps to understand what's happening in the layers below. Spring Boot makes database access feel effortless, but there's a real stack underneath. Knowing it saves you when things go wrong, and they will.

JDBC: the foundation

JDBC (Java Database Connectivity) is the raw Java API for talking to a relational database. Here's what a basic query looks like in plain JDBC:

Java
Kotlin
Groovy

Raw JDBC query
import java.sql.Connection;
import java.sql.DriverManager;
import java.sql.PreparedStatement;
import java.sql.ResultSet;

Connection conn = DriverManager.getConnection(
    "jdbc:postgresql://localhost:5432/sakila", "user", "password"
);

PreparedStatement stmt = conn.prepareStatement(
    "SELECT film_id, title FROM film WHERE release_year = ?"
);
stmt.setInt(1, 2006);

ResultSet rs = stmt.executeQuery();
while (rs.next()) {
    int id       = rs.getInt("film_id");
    String title = rs.getString("title");
    System.out.println(id + ": " + title);
}

rs.close();
stmt.close();
conn.close();

Raw JDBC query
import java.sql.DriverManager

val conn = DriverManager.getConnection(
    "jdbc:postgresql://localhost:5432/sakila", "user", "password"
)

val stmt = conn.prepareStatement(
    "SELECT film_id, title FROM film WHERE release_year = ?"
)
stmt.setInt(1, 2006)

val rs = stmt.executeQuery()
while (rs.next()) {
    val id    = rs.getInt("film_id")
    val title = rs.getString("title")
    println("$id: $title")
}

rs.close()
stmt.close()
conn.close()

Raw JDBC query
import java.sql.DriverManager

def conn = DriverManager.getConnection(
    'jdbc:postgresql://localhost:5432/sakila', 'user', 'password'
)

def stmt = conn.prepareStatement(
    'SELECT film_id, title FROM film WHERE release_year = ?'
)
stmt.setInt(1, 2006)

def rs = stmt.executeQuery()
while (rs.next()) {
    def id    = rs.getInt('film_id')
    def title = rs.getString('title')
    println "$id: $title"
}

rs.close()
stmt.close()
conn.close()

It works. But notice what you're managing yourself: the connection lifecycle, the parameter binding, the ResultSet iteration, and closing everything in the right order. Forget to close a connection and you've got a resource leak. Forget a try/catch and a checked SQLException blows up at the worst possible moment. Do this for every query in an application and you'll spend more time on plumbing than on business logic.

This is the problem Hibernate was built to solve.

Hibernate: ORM on top of JDBC

Hibernate is an ORM (Object-Relational Mapper). Its job is to bridge the gap between the relational world (tables, rows, foreign keys) and the object-oriented world (classes, instances, references). Instead of writing SQL and mapping ResultSet columns to fields yourself, you describe the mapping once with annotations, and Hibernate handles the rest.

Film.java
@Entity
@Table(name = "film")
public class Film {

    @Id
    @Column(name = "film_id")
    private Integer filmId;

    @Column(name = "title")
    private String title;

    @Column(name = "release_year")
    private Integer releaseYear;
}

With this in place, Hibernate loads a Film object from the database without you writing a line of SQL. It generates the query, executes it via JDBC, and maps the result back to the object automatically.

The key annotations you'll see throughout this section:

Annotation	What It Does
`@Entity`	Marks the class as a JPA-managed entity (maps to a table)
`@Table`	Specifies the table name (optional if it matches the class name)
`@Id`	Marks the primary key field
`@Column`	Maps a field to a specific column name
`@ManyToOne` / `@OneToMany`	Maps relationships between tables
`@EmbeddedId`	Marks a composite primary key

Hibernate still uses JDBC under the hood. It manages connection pooling, translates your object operations into SQL, and executes them through the same java.sql.* API you saw above. You just never have to touch it directly.

SessionFactory and Session

Hibernate's own API revolves around SessionFactory (created once at startup) and Session (one per unit of work). You'll see these names in older code and Hibernate documentation. In a Spring Boot project you'll rarely interact with them directly, Spring wraps them in a higher-level abstraction.

JPA: the standard specification

Hibernate was so widely adopted that the Java community standardized its ideas into a specification: JPA (Jakarta Persistence API). JPA defines the annotations (@Entity, @Id, @Column, and so on) and a core API — the EntityManager — that any compliant ORM must implement.

Hibernate is the most common JPA implementation, but others exist (EclipseLink, OpenJPA). Your code targets the jakarta.persistence.* interfaces, not Hibernate directly.

EntityManager is JPA's main handle for database operations:

Java
Kotlin
Groovy

FilmRepository.java
import jakarta.persistence.EntityManager;
import jakarta.persistence.PersistenceContext;

@Repository
public class FilmRepository {

    @PersistenceContext
    private EntityManager em;

    public Film findById(Integer id) {
        return em.find(Film.class, id);
    }

    public void save(Film film) {
        em.persist(film);
    }
}

FilmRepository.kt
import jakarta.persistence.EntityManager
import jakarta.persistence.PersistenceContext

@Repository
class FilmRepository {

    @PersistenceContext
    private lateinit var em: EntityManager

    fun findById(id: Int): Film? =
        em.find(Film::class.java, id)

    fun save(film: Film) =
        em.persist(film)
}

FilmRepository.groovy
import jakarta.persistence.EntityManager
import jakarta.persistence.PersistenceContext

@Repository
class FilmRepository {

    @PersistenceContext
    EntityManager em

    Film findById(Integer id) {
        em.find(Film, id)
    }

    void save(Film film) {
        em.persist(film)
    }
}

No ResultSet, no connection management, no checked exceptions. But you're still writing repository classes by hand for every entity. Spring Data JPA eliminates that too.

Spring Data JPA: repositories without boilerplate

The repository pattern

The Repository Pattern is an abstraction layer that sits between your application and your data storage. Think of it as a translator. Your application asks for what it wants in its own language, like "give me all the films released in 2024," and the repository handles the messy details of turning that request into actual database operations.

You don't need to know if the data lives in PostgreSQL, MongoDB, or some external API. The repository handles that complexity so your business logic stays clean and focused on what it does best.

Declaring a repository

Spring Data JPA sits on top of JPA and Hibernate. Instead of implementing a repository class yourself, you declare an interface and Spring generates the implementation at runtime.

FilmRepository.java
public interface FilmRepository extends JpaRepository<Film, Integer> {
    List<Film> findByReleaseYear(Integer year);
}

No class, no EntityManager, no SQL. Spring sees the interface, wires up the implementation, and findByReleaseYear generates the correct query from the method name automatically.

Beyond the basics

JpaRepository provides more than simple CRUD operations:

Pagination and sorting capabilities: Handle large datasets without loading everything into memory.
Dynamic query generation from method names: Name your method findByTitleContainingAndYearGreaterThan and Spring Data JPA figures out the SQL for you.
Custom query support: Use the @Query annotation when you need complex operations that method names can't express.
Transaction management: Spring handles the begin/commit/rollback dance automatically.

Schema migrations

Your schema will change. A new feature needs a column, a performance fix requires an index, a refactor renames a table. How you manage those changes matters as much as the code itself.

Code-first

The entity class is the source of truth. You add a field to the entity, and on next startup Hibernate reads it and issues the ALTER TABLE against the database, when spring.jpa.hibernate.ddl-auto is set to update or create.

The schema follows the code. This works well when your team owns the database entirely and you're moving fast on a greenfield project. The downside: Hibernate's schema updates aren't always reversible, you get no migration history, and a bad ddl-auto setting in production can cause data loss.

Database-first

The database is the source of truth. Schema changes are explicit SQL scripts, written and reviewed before anything runs. A migration tool applies them in order and tracks what's already been applied, so the same script never runs twice.

Flyway uses versioned SQL files: V1__create_film_table.sql, V2__add_last_updated_column.sql. On startup, Flyway checks which scripts have already run and applies any new ones. Simple and transparent.

Liquibase takes the same idea further. Changesets can be written in XML, YAML, JSON, or SQL. It supports rollback definitions, more complex diff operations, and is more database-agnostic. More power, more configuration.

	Code-First	Flyway	Liquibase
Source of truth	Entity class	SQL migration files	Changeset files
Migration history	None	`flyway_schema_history` table	`databasechangelog` table
Rollback	Not supported	Manual	Built-in (optional)
Format	Java/Kotlin annotations	SQL	SQL, XML, YAML, JSON
Best for	Local dev / prototyping	Most production teams	Complex / multi-DB environments

Never Use ddl-auto=update in Production

It's tempting to leave spring.jpa.hibernate.ddl-auto=update running everywhere. Don't. It can drop columns, silently ignore type mismatches, and gives you zero audit trail of what changed and when. Use validate in production, it checks that entities match the schema but changes nothing, and let Flyway or Liquibase own the actual migrations.

The full stack

Here's what sits between your business logic and the actual database:

Scroll to zoom • Drag corner to resize

Each layer adds a level of abstraction. You almost always work at the top. When something breaks (a query performs badly, a mapping is wrong, a transaction doesn't behave as expected) knowing this stack tells you where to look. Next, we'll reverse-engineer JPA entities from an existing database schema.

JDBC: the foundation​

Hibernate: ORM on top of JDBC​

JPA: the standard specification​

Spring Data JPA: repositories without boilerplate​

The repository pattern​

Declaring a repository​

Beyond the basics​

Schema migrations​

Code-first​

Database-first​

The full stack​