22410 字

59 分钟

java并发编程

2024-01-30

cs-base

java

doc

meeting

multi-prog

java并发编程#

什么是线程和进程?#

何为进程?#

进程是程序的一次执行过程，是系统运行程序的基本单位，因此进程是动态的。系统运行一个程序即是一个进程从创建，运行到消亡的过程。

在 Java 中，当我们启动 main 函数时其实就是启动了一个 JVM 的进程，而 main 函数所在的线程就是这个进程中的一个线程，也称主线程。

何为线程?#

线程与进程相似，但线程是一个比进程更小的执行单位。一个进程在其执行的过程中可以产生多个线程。与进程不同的是同类的多个线程共享进程的堆和方法区资源，但每个线程有自己的程序计数器、虚拟机栈和本地方法栈，所以系统在产生一个线程，或是在各个线程之间作切换工作时，负担要比进程小得多，也正因为如此，线程也被称为轻量级进程。

Java 程序天生就是多线程程序，我们可以通过 JMX 来看看一个普通的 Java 程序有哪些线程，代码如下。

1
public class MultiThread {
2
 public static void main(String[] args) {
3
  // 获取 Java 线程管理 MXBean
4
 ThreadMXBean threadMXBean = ManagementFactory.getThreadMXBean();
5
  // 不需要获取同步的 monitor 和 synchronizer 信息，仅获取线程和线程堆栈信息
6
  ThreadInfo[] threadInfos = threadMXBean.dumpAllThreads(false, false);
7
  // 遍历线程信息，仅打印线程 ID 和线程名称信息
8
  for (ThreadInfo threadInfo : threadInfos) {
9
   System.out.println("[" + threadInfo.getThreadId() + "] " + threadInfo.getThreadName());
10
  }
11
 }
12
}

上述程序输出如下（输出内容可能不同，不用太纠结下面每个线程的作用，只用知道 main 线程执行 main 方法即可）：

1
[5] Attach Listener //添加事件
2
[4] Signal Dispatcher // 分发处理给 JVM 信号的线程
3
[3] Finalizer //调用对象 finalize 方法的线程
4
[2] Reference Handler //清除 reference 线程
5
[1] main //main 线程,程序入口

从上面的输出内容可以看出：一个 Java 程序的运行是 main 线程和多个其他线程同时运行。

Java 线程和操作系统的线程有啥区别？#

JDK 1.2 之前，Java 线程是基于绿色线程（Green Threads）实现的，这是一种用户级线程（用户线程），也就是说 JVM 自己模拟了多线程的运行，而不依赖于操作系统。由于绿色线程和原生线程比起来在使用时有一些限制（比如绿色线程不能直接使用操作系统提供的功能如异步 I/O、只能在一个内核线程上运行无法利用多核），在 JDK 1.2 及以后，Java 线程改为基于原生线程（Native Threads）实现，也就是说 JVM 直接使用操作系统原生的内核级线程（内核线程）来实现 Java 线程，由操作系统内核进行线程的调度和管理。

我们上面提到了用户线程和内核线程，考虑到很多读者不太了解二者的区别，这里简单介绍一下：

用户线程：由用户空间程序管理和调度的线程，运行在用户空间（专门给应用程序使用）。
内核线程：由操作系统内核管理和调度的线程，运行在内核空间（只有内核程序可以访问）。

顺便简单总结一下用户线程和内核线程的区别和特点：用户线程创建和切换成本低，但不可以利用多核。内核态线程，创建和切换成本高，可以利用多核。

一句话概括 Java 线程和操作系统线程的关系：现在的 Java 线程的本质其实就是操作系统的线程。

线程模型是用户线程和内核线程之间的关联方式，常见的线程模型有这三种：

一对一（一个用户线程对应一个内核线程）
多对一（多个用户线程映射到一个内核线程）
多对多（多个用户线程映射到多个内核线程）

在 Windows 和 Linux 等主流操作系统中，Java 线程采用的是一对一的线程模型，也就是一个 Java 线程对应一个系统内核线程。Solaris 系统是一个特例（Solaris 系统本身就支持多对多的线程模型），HotSpot VM 在 Solaris 上支持多对多和一对一。

请简要描述线程与进程的关系,区别及优缺点？#

从 JVM 角度说进程和线程之间的关系。

图解进程和线程的关系#

下图是 Java 内存区域，通过下图我们从 JVM 的角度来说一下线程和进程之间的关系。

从上图可以看出：一个进程中可以有多个线程，多个线程共享进程的堆和方法区 (JDK1.8 之后的元空间)资源，但是每个线程有自己的程序计数器、虚拟机栈 和 本地方法栈。

总结： 线程是进程划分成的更小的运行单位。线程和进程最大的不同在于基本上各进程是独立的，而各线程则不一定，因为同一进程中的线程极有可能会相互影响。线程执行开销小，但不利于资源的管理和保护；而进程正相反。

下面是该知识点的扩展内容！

下面来思考这样一个问题：为什么程序计数器、虚拟机栈和本地方法栈是线程私有的呢？为什么堆和方法区是线程共享的呢？

程序计数器为什么是私有的?#

程序计数器主要有下面两个作用：

字节码解释器通过改变程序计数器来依次读取指令，从而实现代码的流程控制，如：顺序执行、选择、循环、异常处理。
在多线程的情况下，程序计数器用于记录当前线程执行的位置，从而当线程被切换回来的时候能够知道该线程上次运行到哪儿了。

需要注意的是，如果执行的是 native 方法，那么程序计数器记录的是 undefined 地址，只有执行的是 Java 代码时程序计数器记录的才是下一条指令的地址。

所以，程序计数器私有主要是为了线程切换后能恢复到正确的执行位置。

虚拟机栈和本地方法栈为什么是私有的?#

虚拟机栈： 每个 Java 方法在执行之前会创建一个栈帧用于存储局部变量表、操作数栈、常量池引用等信息。从方法调用直至执行完成的过程，就对应着一个栈帧在 Java 虚拟机栈中入栈和出栈的过程。
本地方法栈： 和虚拟机栈所发挥的作用非常相似，区别是：虚拟机栈为虚拟机执行 Java 方法（也就是字节码）服务，而本地方法栈则为虚拟机使用到的 Native 方法服务。 在 HotSpot 虚拟机中和 Java 虚拟机栈合二为一。

所以，为了保证线程中的局部变量不被别的线程访问到，虚拟机栈和本地方法栈是线程私有的。

一句话简单了解堆和方法区#

堆和方法区是所有线程共享的资源，其中堆是进程中最大的一块内存，主要用于存放新创建的对象 (几乎所有对象都在这里分配内存)，方法区主要用于存放已被加载的类信息、常量、静态变量、即时编译器编译后的代码等数据。

并发与并行的区别#

并发：两个及两个以上的作业在同一 时间段 内执行。
并行：两个及两个以上的作业在同一时刻执行。

最关键的点是：是否是同时执行。

同步和异步的区别#

同步：发出一个调用之后，在没有得到结果之前，该调用就不可以返回，一直等待。
异步：调用在发出之后，不用等待返回结果，该调用直接返回。

为什么要使用多线程?#

先从总体上来说：

从计算机底层来说： 线程可以比作是轻量级的进程，是程序执行的最小单位,线程间的切换和调度的成本远远小于进程。另外，多核 CPU 时代意味着多个线程可以同时运行，这减少了线程上下文切换的开销。
从当代互联网发展趋势来说： 现在的系统动不动就要求百万级甚至千万级的并发量，而多线程并发编程正是开发高并发系统的基础，利用好多线程机制可以大大提高系统整体的并发能力以及性能。

再深入到计算机底层来探讨：

单核时代：在单核时代多线程主要是为了提高单进程利用 CPU 和 IO 系统的效率。假设只运行了一个 Java 进程的情况，当我们请求 IO 的时候，如果 Java 进程中只有一个线程，此线程被 IO 阻塞则整个进程被阻塞。CPU 和 IO 设备只有一个在运行，那么可以简单地说系统整体效率只有 50%。当使用多线程的时候，一个线程被 IO 阻塞，其他线程还可以继续使用 CPU。从而提高了 Java 进程利用系统资源的整体效率。
多核时代: 多核时代多线程主要是为了提高进程利用多核 CPU 的能力。举个例子：假如我们要计算一个复杂的任务，我们只用一个线程的话，不论系统有几个 CPU 核心，都只会有一个 CPU 核心被利用到。而创建多个线程，这些线程可以被映射到底层多个 CPU 上执行，在任务中的多个线程没有资源竞争的情况下，任务执行的效率会有显著性的提高，约等于（单核时执行时间/CPU 核心数）。

使用多线程可能带来什么问题?#

并发编程的目的就是为了能提高程序的执行效率提高程序运行速度，但是并发编程并不总是能提高程序运行速度的，而且并发编程可能会遇到很多问题，比如：内存泄漏、死锁、线程不安全等等。

如何理解线程安全和不安全？#

线程安全和不安全是在多线程环境下对于同一份数据的访问是否能够保证其正确性和一致性的描述。

线程安全指的是在多线程环境下，对于同一份数据，不管有多少个线程同时访问，都能保证这份数据的正确性和一致性。
线程不安全则表示在多线程环境下，对于同一份数据，多个线程同时访问时可能会导致数据混乱、错误或者丢失。

单核 CPU 上运行多个线程效率一定会高吗？#

单核 CPU 同时运行多个线程的效率是否会高，取决于线程的类型和任务的性质。一般来说，有两种类型的线程：CPU 密集型和 IO 密集型。CPU 密集型的线程主要进行计算和逻辑处理，需要占用大量的 CPU 资源。IO 密集型的线程主要进行输入输出操作，如读写文件、网络通信等，需要等待 IO 设备的响应，而不占用太多的 CPU 资源。

在单核 CPU 上，同一时刻只能有一个线程在运行，其他线程需要等待 CPU 的时间片分配。如果线程是 CPU 密集型的，那么多个线程同时运行会导致频繁的线程切换，增加了系统的开销，降低了效率。如果线程是 IO 密集型的，那么多个线程同时运行可以利用 CPU 在等待 IO 时的空闲时间，提高了效率。

因此，对于单核 CPU 来说，如果任务是 CPU 密集型的，那么开很多线程会影响效率；如果任务是 IO 密集型的，那么开很多线程会提高效率。当然，这里的“很多”也要适度，不能超过系统能够承受的上限。

说说线程的生命周期和状态?#

Java 线程在运行的生命周期中的指定时刻只可能处于下面 6 种不同状态的其中一个状态：

NEW: 初始状态，线程被创建出来但没有被调用 start() 。
RUNNABLE: 运行状态，线程被调用了 start()等待运行的状态。
BLOCKED：阻塞状态，需要等待锁释放。
WAITING：等待状态，表示该线程需要等待其他线程做出一些特定动作（通知或中断）。
TIME_WAITING：超时等待状态，可以在指定的时间后自行返回而不是像 WAITING 那样一直等待。
TERMINATED：终止状态，表示该线程已经运行完毕。

线程在生命周期中并不是固定处于某一个状态而是随着代码的执行在不同状态之间切换。

由上图可以看出：线程创建之后它将处于 NEW（新建） 状态，调用 start() 方法后开始运行，线程这时候处于 READY（可运行） 状态。可运行状态的线程获得了 CPU 时间片（timeslice）后就处于 RUNNING（运行） 状态。

当线程执行 wait()方法之后，线程进入 WAITING（等待） 状态。进入等待状态的线程需要依靠其他线程的通知才能够返回到运行状态。
TIMED_WAITING(超时等待) 状态相当于在等待状态的基础上增加了超时限制，比如通过 sleep（long millis）方法或 wait（long millis）方法可以将线程置于 TIMED_WAITING 状态。当超时时间结束后，线程将会返回到 RUNNABLE 状态。
当线程进入 synchronized 方法/块或者调用 wait 后（被 notify）重新进入 synchronized 方法/块，但是锁被其它线程占有，这个时候线程就会进入 BLOCKED（阻塞） 状态。
线程在执行完了 run()方法之后将会进入到 TERMINATED（终止） 状态。

什么是线程上下文切换?#

线程在执行过程中会有自己的运行条件和状态（也称上下文），比如上文所说到过的程序计数器，栈信息等。当出现如下情况的时候，线程会从占用 CPU 状态中退出。

主动让出 CPU，比如调用了 sleep(), wait() 等。
时间片用完，因为操作系统要防止一个线程或者进程长时间占用 CPU 导致其他线程或者进程饿死。
调用了阻塞类型的系统中断，比如请求 IO，线程被阻塞。
被终止或结束运行

这其中前三种都会发生线程切换，线程切换意味着需要保存当前线程的上下文，留待线程下次占用 CPU 的时候恢复现场。并加载下一个将要占用 CPU 的线程上下文。这就是所谓的 上下文切换。

上下文切换是现代操作系统的基本功能，因其每次需要保存信息恢复信息，这将会占用 CPU，内存等系统资源进行处理，也就意味着效率会有一定损耗，如果频繁切换就会造成整体效率低下。

什么是线程死锁?如何避免死锁?#

认识线程死锁#

线程死锁描述的是这样一种情况：多个线程同时被阻塞，它们中的一个或者全部都在等待某个资源被释放。由于线程被无限期地阻塞，因此程序不可能正常终止。

如下图所示，线程 A 持有资源 2，线程 B 持有资源 1，他们同时都想申请对方的资源，所以这两个线程就会互相等待而进入死锁状态。

产生死锁的四个必要条件：

互斥条件：该资源任意一个时刻只由一个线程占用。
请求与保持条件：一个线程因请求资源而阻塞时，对已获得的资源保持不放。
不剥夺条件:线程已获得的资源在未使用完之前不能被其他线程强行剥夺，只有自己使用完毕后才释放资源。
循环等待条件:若干线程之间形成一种头尾相接的循环等待资源关系。

如何预防和避免线程死锁?#

如何预防死锁？ 破坏死锁的产生的必要条件即可：

破坏请求与保持条件：一次性申请所有的资源。
破坏不剥夺条件：占用部分资源的线程进一步申请其他资源时，如果申请不到，可以主动释放它占有的资源。
破坏循环等待条件：靠按序申请资源来预防。按某一顺序申请资源，释放资源则反序释放。破坏循环等待条件。

如何避免死锁？

避免死锁就是在资源分配时，借助于算法（比如银行家算法）对资源分配进行计算评估，使其进入安全状态。

安全状态指的是系统能够按照某种线程推进顺序（P1、P2、P3……Pn）来为每个线程分配所需资源，直到满足每个线程对资源的最大需求，使每个线程都可顺利完成。称 <P1、P2、P3…Pn> 序列为安全序列。

我们对线程 2 的代码修改成下面这样就不会产生死锁了。

1
new Thread(() -> {
2
          synchronized (resource1) {
3
              System.out.println(Thread.currentThread() + "get resource1");
4
              try {
5
                  Thread.sleep(1000);
6
              } catch (InterruptedException e) {
7
                  e.printStackTrace();
8
              }
9
              System.out.println(Thread.currentThread() + "waiting get resource2");
10
              synchronized (resource2) {
11
                  System.out.println(Thread.currentThread() + "get resource2");
12
              }
13
          }
14
      }, "线程 1").start();
15

16
new Thread(() -> {
17
          synchronized (resource1) {
18
              System.out.println(Thread.currentThread() + "get resource1");
19
              try {
20
                  Thread.sleep(1000);
21
              } catch (InterruptedException e) {
22
                  e.printStackTrace();
23
              }
24
              System.out.println(Thread.currentThread() + "waiting get resource2");
25
              synchronized (resource2) {
26
                  System.out.println(Thread.currentThread() + "get resource2");
27
              }
28
          }
29
      }, "线程 2").start();

我们分析一下上面的代码为什么避免了死锁的发生?

线程 1 首先获得到 resource1 的监视器锁,这时候线程 2 就获取不到了。然后线程 1 再去获取 resource2 的监视器锁，可以获取到。然后线程 1 释放了对 resource1、resource2 的监视器锁的占用，线程 2 获取到就可以执行了。这样就破坏了破坏循环等待条件，因此避免了死锁。

sleep() 方法和 wait() 方法对比#

共同点：两者都可以暂停线程的执行。

区别：

sleep() 方法没有释放锁，而 wait() 方法释放了锁 。
wait() 通常被用于线程间交互/通信，sleep()通常被用于暂停执行。
wait() 方法被调用后，线程不会自动苏醒，需要别的线程调用同一个对象上的 notify()或者 notifyAll() 方法。sleep()方法执行完成后，线程会自动苏醒，或者也可以使用 wait(long timeout) 超时后线程会自动苏醒。
sleep() 是 Thread 类的静态本地方法，wait() 则是 Object 类的本地方法。

为什么 wait() 方法不定义在 Thread 中？#

wait() 是让获得对象锁的线程实现等待，会自动释放当前线程占有的对象锁。每个对象（Object）都拥有对象锁，既然要释放当前线程占有的对象锁并让其进入 WAITING 状态，自然是要操作对应的对象（Object）而非当前的线程（Thread）。

类似的问题：为什么 sleep() 方法定义在 Thread 中？

因为 sleep() 是让当前线程暂停执行，不涉及到对象类，也不需要获得对象锁。

可以直接调用 Thread 类的 run 方法吗？#

new 一个 Thread，线程进入了新建状态。调用 start()方法，会启动一个线程并使线程进入了就绪状态，当分配到时间片后就可以开始运行了。 start() 会执行线程的相应准备工作，然后自动执行 run() 方法的内容，这是真正的多线程工作。但是，直接执行 run() 方法，会把 run() 方法当成一个 main 线程下的普通方法去执行，并不会在某个线程中执行它，所以这并不是多线程工作。

总结：调用 start() 方法方可启动线程并使线程进入就绪状态，直接执行 run() 方法的话不会以多线程的方式执行。

volatile 关键字#

如何保证变量的可见性？#

在 Java 中，volatile 关键字可以保证变量的可见性，如果我们将变量声明为 volatile ，这就指示 JVM，这个变量是共享且不稳定的，每次使用它都到主存中进行读取。

JMM(Java 内存模型)

JMM(Java 内存模型)强制在主存中进行读取

volatile 关键字其实并非是 Java 语言特有的，在 C 语言里也有，它最原始的意义就是禁用 CPU 缓存。如果我们将一个变量使用 volatile 修饰，这就指示编译器，这个变量是共享且不稳定的，每次使用它都到主存中进行读取。

volatile 关键字能保证数据的可见性，但不能保证数据的原子性。synchronized 关键字两者都能保证。

如何禁止指令重排序？#

在 Java 中，****volatile 关键字除了可以保证变量的可见性，还有一个重要的作用就是防止 JVM 的指令重排序。 如果我们将变量声明为 volatile ，在对这个变量进行读写操作的时候，会通过插入特定的 内存屏障 的方式来禁止指令重排序。

在 Java 中，Unsafe 类提供了三个开箱即用的内存屏障相关的方法，屏蔽了操作系统底层的差异：

1
public native void loadFence();
2
public native void storeFence();
3
public native void fullFence();

理论上来说，你通过这个三个方法也可以实现和volatile禁止重排序一样的效果，只是会麻烦一些。

下面我以一个常见的面试题为例讲解一下 volatile 关键字禁止指令重排序的效果。

面试中面试官经常会说：“单例模式了解吗？来给我手写一下！给我解释一下双重检验锁方式实现单例模式的原理呗！”

双重校验锁实现对象单例（线程安全）：

1
public class Singleton {
2

3
    private volatile static Singleton uniqueInstance;
4

5
    private Singleton() {
6
    }
7

8
    public  static Singleton getUniqueInstance() {
9
       //先判断对象是否已经实例过，没有实例化过才进入加锁代码
10
        if (uniqueInstance == null) {
11
            //类对象加锁
12
            synchronized (Singleton.class) {
13
                if (uniqueInstance == null) {
14
                    uniqueInstance = new Singleton();
15
                }
16
            }
17
        }
18
        return uniqueInstance;
19
    }
20
}

uniqueInstance 采用 volatile 关键字修饰也是很有必要的， uniqueInstance = new Singleton(); 这段代码其实是分为三步执行：

为 uniqueInstance 分配内存空间
初始化 uniqueInstance
将 uniqueInstance 指向分配的内存地址

但是由于 JVM 具有指令重排的特性，执行顺序有可能变成 1->3->2。指令重排在单线程环境下不会出现问题，但是在多线程环境下会导致一个线程获得还没有初始化的实例。例如，线程 T1 执行了 1 和 3，此时 T2 调用 getUniqueInstance() 后发现 uniqueInstance 不为空，因此返回 uniqueInstance，但此时 uniqueInstance 还未被初始化

volatile 可以保证原子性么？#

volatile 关键字能保证变量的可见性，但不能保证对变量的操作是原子性的。

我们通过下面的代码即可证明：

1
public class VolatoleAtomicityDemo {
2
    public volatile static int inc = 0;
3

4
    public void increase() {
5
        inc++;
6
    }
7

8
    public static void main(String[] args) throws InterruptedException {
9
        ExecutorService threadPool = Executors.newFixedThreadPool(5);
10
        VolatoleAtomicityDemo volatoleAtomicityDemo = new VolatoleAtomicityDemo();
11
        for (int i = 0; i < 5; i++) {
12
            threadPool.execute(() -> {
13
                for (int j = 0; j < 500; j++) {
14
                    volatoleAtomicityDemo.increase();
15
                }
16
            });
17
        }
18
        // 等待1.5秒，保证上面程序执行完成
19
        Thread.sleep(1500);
20
        System.out.println(inc);
21
        threadPool.shutdown();
22
    }
23
}

正常情况下，运行上面的代码理应输出 2500。但你真正运行了上面的代码之后，你会发现每次输出结果都小于 2500。

为什么会出现这种情况呢？不是说好了，volatile 可以保证变量的可见性嘛！

也就是说，如果 volatile 能保证 inc++ 操作的原子性的话。每个线程中对 inc 变量自增完之后，其他线程可以立即看到修改后的值。5 个线程分别进行了 500 次操作，那么最终 inc 的值应该是 5*500=2500。

很多人会误认为自增操作 inc++ 是原子性的，实际上，inc++ 其实是一个复合操作，包括三步：

读取 inc 的值。
对 inc 加 1。
将 inc 的值写回内存。

volatile 是无法保证这三个操作是具有原子性的，有可能导致下面这种情况出现：

线程 1 对 inc 进行读取操作之后，还未对其进行修改。线程 2 又读取了 inc的值并对其进行修改（+1），再将inc 的值写回内存。
线程 2 操作完毕后，线程 1 对 inc的值进行修改（+1），再将inc 的值写回内存。

这也就导致两个线程分别对 inc 进行了一次自增操作后，inc 实际上只增加了 1。

其实，如果想要保证上面的代码运行正确也非常简单，利用 synchronized、Lock或者AtomicInteger都可以。

使用 synchronized 改进：

1
public synchronized void increase() {
2
    inc++;
3
}

使用 AtomicInteger 改进：

1
public AtomicInteger inc = new AtomicInteger();
2

3
public void increase() {
4
    inc.getAndIncrement();
5
}

使用 ReentrantLock 改进：

1
Lock lock = new ReentrantLock();
2
public void increase() {
3
    lock.lock();
4
    try {
5
        inc++;
6
    } finally {
7
        lock.unlock();
8
    }
9
}

乐观锁和悲观锁#

什么是悲观锁？#

悲观锁总是假设最坏的情况，认为共享资源每次被访问的时候就会出现问题(比如共享数据被修改)，所以每次在获取资源操作的时候都会上锁，这样其他线程想拿到这个资源就会阻塞直到锁被上一个持有者释放。也就是说，共享资源每次只给一个线程使用，其它线程阻塞，用完后再把资源转让给其它线程。

像 Java 中synchronized和ReentrantLock等独占锁就是悲观锁思想的实现。

1
public void performSynchronisedTask() {
2
    synchronized (this) {
3
        // 需要同步的操作
4
    }
5
}
6

7
private Lock lock = new ReentrantLock();
8
lock.lock();
9
try {
10
   // 需要同步的操作
11
} finally {
12
    lock.unlock();
13
}

高并发的场景下，激烈的锁竞争会造成线程阻塞，大量阻塞线程会导致系统的上下文切换，增加系统的性能开销。并且，悲观锁还可能会存在死锁问题，影响代码的正常运行。

什么是乐观锁？#

乐观锁总是假设最好的情况，认为共享资源每次被访问的时候不会出现问题，线程可以不停地执行，无需加锁也无需等待，只是在提交修改的时候去验证对应的资源（也就是数据）是否被其它线程修改了（具体方法可以使用版本号机制或 CAS 算法）。

在 Java 中java.util.concurrent.atomic包下面的原子变量类（比如AtomicInteger、LongAdder）就是使用了乐观锁的一种实现方式 CAS 实现的。

1
// LongAdder 在高并发场景下会比 AtomicInteger 和 AtomicLong 的性能更好
2
// 代价就是会消耗更多的内存空间（空间换时间）
3
LongAdder sum = new LongAdder();
4
sum.increment();

高并发的场景下，乐观锁相比悲观锁来说，不存在锁竞争造成线程阻塞，也不会有死锁的问题，在性能上往往会更胜一筹。但是，如果冲突频繁发生（写占比非常多的情况），会频繁失败和重试，这样同样会非常影响性能，导致 CPU 飙升。

不过，大量失败重试的问题也是可以解决的，像我们前面提到的 LongAdder以空间换时间的方式就解决了这个问题。

理论上来说：

悲观锁通常多用于写比较多的情况（多写场景，竞争激烈），这样可以避免频繁失败和重试影响性能，悲观锁的开销是固定的。不过，如果乐观锁解决了频繁失败和重试这个问题的话（比如LongAdder），也是可以考虑使用乐观锁的，要视实际情况而定。
乐观锁通常多用于写比较少的情况（多读场景，竞争较少），这样可以避免频繁加锁影响性能。不过，乐观锁主要针对的对象是单个共享变量（参考java.util.concurrent.atomic包下面的原子变量类）。

如何实现乐观锁？#

乐观锁一般会使用版本号机制或 CAS 算法实现，CAS 算法相对来说更多一些，这里需要格外注意。

版本号机制#

一般是在数据表中加上一个数据版本号 version 字段，表示数据被修改的次数。当数据被修改时，version 值会加一。当线程 A 要更新数据值时，在读取数据的同时也会读取 version 值，在提交更新时，若刚才读取到的 version 值为当前数据库中的 version 值相等时才更新，否则重试更新操作，直到更新成功。

举一个简单的例子：假设数据库中帐户信息表中有一个 version 字段，当前值为 1 ；而当前帐户余额字段（ balance ）为 $100 。

操作员 A 此时将其读出（ version=1 ），并从其帐户余额中扣除 $50（$ 100-$50 ）。
在操作员 A 操作的过程中，操作员 B 也读入此用户信息（ version=1 ），并从其帐户余额中扣除 $20 （$ 100-$20 ）。
操作员 A 完成了修改工作，将数据版本号（ version=1 ），连同帐户扣除后余额（ balance=$50 ），提交至数据库更新，此时由于提交数据版本等于数据库记录当前版本，数据被更新，数据库记录 version 更新为 2 。
操作员 B 完成了操作，也将版本号（ version=1 ）试图向数据库提交数据（ balance=$80 ），但此时比对数据库记录版本时发现，操作员 B 提交的数据版本号为 1 ，数据库记录当前版本也为 2 ，不满足 “ 提交版本必须等于当前版本才能执行更新 “ 的乐观锁策略，因此，操作员 B 的提交被驳回。

这样就避免了操作员 B 用基于 version=1 的旧数据修改的结果覆盖操作员 A 的操作结果的可能。

CAS 算法#

CAS 的全称是 Compare And Swap（比较与交换） ，用于实现乐观锁，被广泛应用于各大框架中。CAS 的思想很简单，就是用一个预期值和要更新的变量值进行比较，两值相等才会进行更新。

CAS 是一个原子操作，底层依赖于一条 CPU 的原子指令。

原子操作即最小不可拆分的操作，也就是说操作一旦开始，就不能被打断，直到操作完成。

CAS 涉及到三个操作数：

V：要更新的变量值(Var)
E：预期值(Expected)
N：拟写入的新值(New)

当且仅当 V 的值等于 E 时，CAS 通过原子方式用新值 N 来更新 V 的值。如果不等，说明已经有其它线程更新了 V，则当前线程放弃更新。

举一个简单的例子：线程 A 要修改变量 i 的值为 6，i 原值为 1（V = 1，E=1，N=6，假设不存在 ABA 问题）。

i 与 1 进行比较，如果相等，则说明没被其他线程修改，可以被设置为 6 。
i 与 1 进行比较，如果不相等，则说明被其他线程修改，当前线程放弃更新，CAS 操作失败。

当多个线程同时使用 CAS 操作一个变量时，只有一个会胜出，并成功更新，其余均会失败，但失败的线程并不会被挂起，仅是被告知失败，并且允许再次尝试，当然也允许失败的线程放弃操作。

Java 语言并没有直接实现 CAS，CAS 相关的实现是通过 C++ 内联汇编的形式实现的（JNI 调用）。因此， CAS 的具体实现和操作系统以及 CPU 都有关系。

sun.misc包下的Unsafe类提供了compareAndSwapObject、compareAndSwapInt、compareAndSwapLong方法来实现的对Object、int、long类型的 CAS 操作

1
/**
2
  *  CAS
3
  * @param o         包含要修改field的对象
4
  * @param offset    对象中某field的偏移量
5
  * @param expected  期望值
6
  * @param update    更新值
7
  * @return          true | false
8
  */
9
public final native boolean compareAndSwapObject(Object o, long offset,  Object expected, Object update);
10

11
public final native boolean compareAndSwapInt(Object o, long offset, int expected,int update);
12

13
public final native boolean compareAndSwapLong(Object o, long offset, long expected, long update);

乐观锁存在哪些问题？#

ABA 问题是乐观锁最常见的问题。

ABA 问题#

如果一个变量 V 初次读取的时候是 A 值，并且在准备赋值的时候检查到它仍然是 A 值，那我们就能说明它的值没有被其他线程修改过了吗？很明显是不能的，因为在这段时间它的值可能被改为其他值，然后又改回 A，那 CAS 操作就会误认为它从来没有被修改过。这个问题被称为 CAS 操作的 “ABA”问题。

ABA 问题的解决思路是在变量前面追加上版本号或者时间戳。JDK 1.5 以后的 AtomicStampedReference 类就是用来解决 ABA 问题的，其中的 compareAndSet() 方法就是首先检查当前引用是否等于预期引用，并且当前标志是否等于预期标志，如果全部相等，则以原子方式将该引用和该标志的值设置为给定的更新值。

1
public boolean compareAndSet(V   expectedReference,
2
                             V   newReference,
3
                             int expectedStamp,
4
                             int newStamp) {
5
    Pair<V> current = pair;
6
    return
7
        expectedReference == current.reference &&
8
        expectedStamp == current.stamp &&
9
        ((newReference == current.reference &&
10
          newStamp == current.stamp) ||
11
         casPair(current, Pair.of(newReference, newStamp)));
12
}

循环时间长开销大#

CAS 经常会用到自旋操作来进行重试，也就是不成功就一直循环执行直到成功。如果长时间不成功，会给 CPU 带来非常大的执行开销。

如果 JVM 能支持处理器提供的 pause 指令那么效率会有一定的提升，pause 指令有两个作用：

可以延迟流水线执行指令，使 CPU 不会消耗过多的执行资源，延迟的时间取决于具体实现的版本，在一些处理器上延迟时间是零。
可以避免在退出循环的时候因内存顺序冲而引起 CPU 流水线被清空，从而提高 CPU 的执行效率。

只能保证一个共享变量的原子操作#

CAS 只对单个共享变量有效，当操作涉及跨多个共享变量时 CAS 无效。但是从 JDK 1.5 开始，提供了AtomicReference类来保证引用对象之间的原子性，你可以把多个变量放在一个对象里来进行 CAS 操作.所以我们可以使用锁或者利用AtomicReference类把多个共享变量合并成一个共享变量来操作。

synchronized 关键字#

synchronized 是什么？有什么用？#

synchronized 是 Java 中的一个关键字，翻译成中文是同步的意思，主要解决的是多个线程之间访问资源的同步性，可以保证被它修饰的方法或者代码块在任意时刻只能有一个线程执行。

在 Java 早期版本中，synchronized 属于 重量级锁，效率低下。这是因为监视器锁（monitor）是依赖于底层的操作系统的 Mutex Lock 来实现的，Java 的线程是映射到操作系统的原生线程之上的。如果要挂起或者唤醒一个线程，都需要操作系统帮忙完成，而操作系统实现线程之间的切换时需要从用户态转换到内核态，这个状态之间的转换需要相对比较长的时间，时间成本相对较高。

不过，在 Java 6 之后， synchronized 引入了大量的优化如自旋锁、适应性自旋锁、锁消除、锁粗化、偏向锁、轻量级锁等技术来减少锁操作的开销，这些优化让 synchronized 锁的效率提升了很多。因此， synchronized 还是可以在实际项目中使用的，像 JDK 源码、很多开源框架都大量使用了 synchronized 。

关于偏向锁多补充一点：由于偏向锁增加了 JVM 的复杂性，同时也并没有为所有应用都带来性能提升。因此，在 JDK15 中，偏向锁被默认关闭（仍然可以使用 -XX:+UseBiasedLocking 启用偏向锁），在 JDK18 中，偏向锁已经被彻底废弃（无法通过命令行打开）。

如何使用 synchronized？#

synchronized 关键字的使用方式主要有下面 3 种：

修饰实例方法

给当前对象实例加锁，进入同步代码前要获得 当前对象实例的锁 。
```
1
synchronized void method() {
2
    //业务代码
3
}
```
修饰静态方法

给当前类加锁，会作用于类的所有对象实例，进入同步代码前要获得 当前 class 的锁。

这是因为静态成员不属于任何一个实例对象，归整个类所有，不依赖于类的特定实例，被类的所有实例共享。
```
1
synchronized static void method() {
2
    //业务代码
3
}
```
静态 synchronized 方法和非静态 synchronized 方法之间的调用互斥么？不互斥！如果一个线程 A 调用一个实例对象的非静态 synchronized 方法，而线程 B 需要调用这个实例对象所属类的静态 synchronized 方法，是允许的，不会发生互斥现象，因为访问静态 synchronized 方法占用的锁是当前类的锁，而访问非静态 synchronized 方法占用的锁是当前实例对象锁。
修饰代码块

对括号里指定的对象/类加锁：
- synchronized(object) 表示进入同步代码库前要获得 给定对象的锁。
- synchronized(类.class) 表示进入同步代码前要获得 给定 Class 的锁
```
1
synchronized(this) {
2
    //业务代码
3
}
```

总结：

synchronized 关键字加到 static 静态方法和 synchronized(class) 代码块上都是是给 Class 类上锁；
synchronized 关键字加到实例方法上是给对象实例上锁；
尽量不要使用 synchronized(String a) 因为 JVM 中，字符串常量池具有缓存功能。

构造方法可以用 synchronized 修饰么？#

先说结论：构造方法不能使用 synchronized 关键字修饰。

构造方法本身就属于线程安全的，不存在同步的构造方法一说。

synchronized 底层原理了解吗？#

synchronized 关键字底层原理属于 JVM 层面的东西。

synchronized 同步语句块的情况
```
1
public class SynchronizedDemo {
2
    public void method() {
3
        synchronized (this) {
4
            System.out.println("synchronized 代码块");
5
        }
6
    }
7
}
```
通过 JDK 自带的 javap 命令查看 SynchronizedDemo 类的相关字节码信息：首先切换到类的对应目录执行 javac SynchronizedDemo.java 命令生成编译后的 .class 文件，然后执行javap -c -s -v -l SynchronizedDemo.class。

从上面我们可以看出：synchronized 同步语句块的实现使用的是 monitorenter 和 monitorexit 指令，其中 monitorenter 指令指向同步代码块的开始位置，****monitorexit 指令则指明同步代码块的结束位置。

上面的字节码中包含一个 monitorenter 指令以及两个 monitorexit 指令，这是为了保证锁在同步代码块代码正常执行以及出现异常的这两种情况下都能被正确释放。

当执行 monitorenter 指令时，线程试图获取锁也就是获取 对象监视器 monitor 的持有权。

在 Java 虚拟机(Hotspot)中，Monitor 是基于 C++实现的，由ObjectMonitor实现的。每个对象中都内置了一个 ObjectMonitor对象。

1
另外，`wait/notify`等方法也依赖于`monitor`对象，这就是为什么只有在同步的块或者方法中才能调用`wait/notify`等方法，否则会抛出`java.lang.IllegalMonitorStateException`的异常的原因。

1
在执行`monitorenter`时，会尝试获取对象的锁，如果锁的计数器为 0 则表示锁可以被获取，获取后将锁计数器设为 1 也就是加 1。
2

3

4
![20240131203520.png](https://dreaife-1306766477.cos.ap-nanjing.myqcloud.com/20240131203520.png)
5

6

7
对象锁的的拥有者线程才可以执行 `monitorexit` 指令来释放锁。在执行 `monitorexit` 指令后，将锁计数器设为 0，表明锁被释放，其他线程可以尝试获取锁。
8

9

10
[https://camo.githubusercontent.com/ff0fb002626c445b1adc69507f430bc0ffd1202c9e0decfc58749f71c8183587/68747470733a2f2f6f73732e6a61766167756964652e636e2f6769746875622f6a61766167756964652f6a6176612f636f6e63757272656e742f73796e6368726f6e697a65642d72656c656173652d6c6f636b2d626c6f636b2e706e67](https://camo.githubusercontent.com/ff0fb002626c445b1adc69507f430bc0ffd1202c9e0decfc58749f71c8183587/68747470733a2f2f6f73732e6a61766167756964652e636e2f6769746875622f6a61766167756964652f6a6176612f636f6e63757272656e742f73796e6368726f6e697a65642d72656c656173652d6c6f636b2d626c6f636b2e706e67)
11

12

13
如果获取对象锁失败，那当前线程就要阻塞等待，直到锁被另外一个线程释放为止。

synchronized 修饰方法的的情况
```
1
public class SynchronizedDemo2 {
2
    public synchronized void method() {
3
        System.out.println("synchronized 方法");
4
    }
5
}
```
synchronized 修饰的方法并没有 monitorenter 指令和 monitorexit 指令，取得代之的确实是 ACC_SYNCHRONIZED 标识，该标识指明了该方法是一个同步方法。JVM 通过该 ACC_SYNCHRONIZED 访问标志来辨别一个方法是否声明为同步方法，从而执行相应的同步调用。

如果是实例方法，JVM 会尝试获取实例对象的锁。如果是静态方法，JVM 会尝试获取当前 class 的锁。

总结#

synchronized 同步语句块的实现使用的是 monitorenter 和 monitorexit 指令，其中 monitorenter 指令指向同步代码块的开始位置，monitorexit 指令则指明同步代码块的结束位置。

synchronized 修饰的方法并没有 monitorenter 指令和 monitorexit 指令，取得代之的确实是 ACC_SYNCHRONIZED 标识，该标识指明了该方法是一个同步方法。

不过两者的本质都是对对象监视器 monitor 的获取。

JDK1.6 之后的 synchronized 底层做了哪些优化？锁升级原理了解吗？#

在 Java 6 之后， synchronized 引入了大量的优化如自旋锁、适应性自旋锁、锁消除、锁粗化、偏向锁、轻量级锁等技术来减少锁操作的开销，这些优化让 synchronized 锁的效率提升了很多（JDK18 中，偏向锁已经被彻底废弃，前面已经提到过了）。

锁主要存在四种状态，依次是：无锁状态、偏向锁状态、轻量级锁状态、重量级锁状态，他们会随着竞争的激烈而逐渐升级。注意锁可以升级不可降级，这种策略是为了提高获得锁和释放锁的效率。

synchronized 和 volatile 有什么区别？#

synchronized 关键字和 volatile 关键字是两个互补的存在，而不是对立的存在！

volatile 关键字是线程同步的轻量级实现，所以 volatile性能肯定比synchronized关键字要好。但是 volatile 关键字只能用于变量而 synchronized 关键字可以修饰方法以及代码块。
volatile 关键字能保证数据的可见性，但不能保证数据的原子性。synchronized 关键字两者都能保证。
volatile关键字主要用于解决变量在多个线程之间的可见性，而 synchronized 关键字解决的是多个线程之间访问资源的同步性。

ReentrantLock#

ReentrantLock 是什么？#

ReentrantLock 实现了 Lock 接口，是一个可重入且独占式的锁，和 synchronized 关键字类似。不过，ReentrantLock 更灵活、更强大，增加了轮询、超时、中断、公平锁和非公平锁等高级功能。

1
public class ReentrantLock implements Lock, java.io.Serializable {}

ReentrantLock 里面有一个内部类 Sync，Sync 继承 AQS（AbstractQueuedSynchronizer），添加锁和释放锁的大部分操作实际上都是在 Sync 中实现的。Sync 有公平锁 FairSync 和非公平锁 NonfairSync 两个子类。

ReentrantLock 默认使用非公平锁，也可以通过构造器来显式的指定使用公平锁。

1
// 传入一个 boolean 值，true 时为公平锁，false 时为非公平锁
2
public ReentrantLock(boolean fair) {
3
    sync = fair ? new FairSync() : new NonfairSync();
4
}

从上面的内容可以看出， ReentrantLock 的底层就是由 AQS 来实现的。

公平锁和非公平锁有什么区别？#

公平锁 : 锁被释放之后，先申请的线程先得到锁。性能较差一些，因为公平锁为了保证时间上的绝对顺序，上下文切换更频繁。
非公平锁：锁被释放之后，后申请的线程可能会先获取到锁，是随机或者按照其他优先级排序的。性能更好，但可能会导致某些线程永远无法获取到锁。

`synchronized`和 `ReentrantLock` 有什么区别？#

两者都是可重入锁

可重入锁 也叫递归锁，指的是线程可以再次获取自己的内部锁。比如一个线程获得了某个对象的锁，此时这个对象锁还没有释放，当其再次想要获取这个对象的锁的时候还是可以获取的，如果是不可重入锁的话，就会造成死锁。

JDK 提供的所有现成的 Lock 实现类，包括 synchronized 关键字锁都是可重入的。
synchronized 依赖于 JVM 而 ReentrantLock 依赖于 API

synchronized 是依赖于 JVM 实现的，前面我们也讲到了虚拟机团队在 JDK1.6 为 synchronized 关键字进行了很多优化，但是这些优化都是在虚拟机层面实现的，并没有直接暴露给我们。

ReentrantLock 是 JDK 层面实现的（也就是 API 层面，需要 lock() 和 unlock() 方法配合 try/finally 语句块来完成），所以我们可以通过查看它的源代码，来看它是如何实现的。
ReentrantLock 比 synchronized 增加了一些高级功能

相比synchronized，ReentrantLock增加了一些高级功能。主要来说主要有三点：
- 等待可中断 : ReentrantLock提供了一种能够中断等待锁的线程的机制，通过 lock.lockInterruptibly() 来实现这个机制。也就是说正在等待的线程可以选择放弃等待，改为处理其他事情。
- 可实现公平锁 : ReentrantLock可以指定是公平锁还是非公平锁。而synchronized只能是非公平锁。所谓的公平锁就是先等待的线程先获得锁。ReentrantLock默认情况是非公平的，可以通过 ReentrantLock类的ReentrantLock(boolean fair)构造方法来指定是否是公平的。
- 可实现选择性通知（锁可以绑定多个条件）: synchronized关键字与wait()和notify()/notifyAll()方法相结合可以实现等待/通知机制。ReentrantLock类当然也可以实现，但是需要借助于Condition接口与newCondition()方法。

如果你想使用上述功能，那么选择 ReentrantLock 是一个不错的选择。

可中断锁和不可中断锁有什么区别？#

可中断锁：获取锁的过程中可以被中断，不需要一直等到获取锁之后才能进行其他逻辑处理。ReentrantLock 就属于是可中断锁。
不可中断锁：一旦线程申请了锁，就只能等到拿到锁以后才能进行其他的逻辑处理。 synchronized 就属于是不可中断锁。

ReentrantReadWriteLock#

ReentrantReadWriteLock 在实际项目中使用的并不多，面试中也问的比较少，简单了解即可。JDK 1.8 引入了性能更好的读写锁 StampedLock 。

ReentrantReadWriteLock 是什么？#

ReentrantReadWriteLock 实现了 ReadWriteLock ，是一个可重入的读写锁，既可以保证多个线程同时读的效率，同时又可以保证有写入操作时的线程安全。

1
public class ReentrantReadWriteLock
2
        implements ReadWriteLock, java.io.Serializable{
3
}
4
public interface ReadWriteLock {
5
    Lock readLock();
6
    Lock writeLock();
7
}

一般锁进行并发控制的规则：读读互斥、读写互斥、写写互斥。
读写锁进行并发控制的规则：读读不互斥、读写互斥、写写互斥（只有读读不互斥）。

ReentrantReadWriteLock 其实是两把锁，一把是 WriteLock (写锁)，一把是 ReadLock（读锁）。读锁是共享锁，写锁是独占锁。读锁可以被同时读，可以同时被多个线程持有，而写锁最多只能同时被一个线程持有。

和 ReentrantLock 一样，ReentrantReadWriteLock 底层也是基于 AQS 实现的。

ReentrantReadWriteLock 也支持公平锁和非公平锁，默认使用非公平锁，可以通过构造器来显示的指定。

1
// 传入一个 boolean 值，true 时为公平锁，false 时为非公平锁
2
public ReentrantReadWriteLock(boolean fair) {
3
    sync = fair ? new FairSync() : new NonfairSync();
4
    readerLock = new ReadLock(this);
5
    writerLock = new WriteLock(this);
6
}

ReentrantReadWriteLock 适合什么场景？#

由于 ReentrantReadWriteLock 既可以保证多个线程同时读的效率，同时又可以保证有写入操作时的线程安全。因此，在读多写少的情况下，使用 ReentrantReadWriteLock 能够明显提升系统性能。

共享锁和独占锁有什么区别？#

共享锁：一把锁可以被多个线程同时获得。
独占锁：一把锁只能被一个线程获得。

线程持有读锁还能获取写锁吗？#

在线程持有读锁的情况下，该线程不能取得写锁(因为获取写锁的时候，如果发现当前的读锁被占用，就马上获取失败，不管读锁是不是被当前线程持有)。
在线程持有写锁的情况下，该线程可以继续获取读锁（获取读锁时如果发现写锁被占用，只有写锁没有被当前线程占用的情况才会获取失败）。

读锁为什么不能升级为写锁？#

写锁可以降级为读锁，但是读锁却不能升级为写锁。这是因为读锁升级为写锁会引起线程的争夺，毕竟写锁属于是独占锁，这样的话，会影响性能。

另外，还可能会有死锁问题发生。举个例子：假设两个线程的读锁都想升级写锁，则需要对方都释放自己锁，而双方都不释放，就会产生死锁。

ThreadLocal#

ThreadLocal 有什么用？#

通常情况下，我们创建的变量是可以被任何一个线程访问并修改的。如果想实现每一个线程都有自己的专属本地变量该如何解决呢？

JDK 中自带的ThreadLocal类正是为了解决这样的问题。 **ThreadLocal类主要解决的就是让每个线程绑定自己的值，可以将ThreadLocal**类形象的比喻成存放数据的盒子，盒子中可以存储每个线程的私有数据。

如果你创建了一个ThreadLocal变量，那么访问这个变量的每个线程都会有这个变量的本地副本，这也是ThreadLocal变量名的由来。他们可以使用 get() 和 set() 方法来获取默认值或将其值更改为当前线程所存的副本的值，从而避免了线程安全问题。

如何使用 ThreadLocal？#

相信看了上面的解释，大家已经搞懂 ThreadLocal 类是个什么东西了。下面简单演示一下如何在项目中实际使用 ThreadLocal 。

1
import java.text.SimpleDateFormat;
2
import java.util.Random;
3

4
public class ThreadLocalExample implements Runnable{
5

6
     // SimpleDateFormat 不是线程安全的，所以每个线程都要有自己独立的副本
7
    private static final ThreadLocal<SimpleDateFormat> formatter = ThreadLocal.withInitial(() -> new SimpleDateFormat("yyyyMMdd HHmm"));
8

9
    public static void main(String[] args) throws InterruptedException {
10
        ThreadLocalExample obj = new ThreadLocalExample();
11
        for(int i=0 ; i<10; i++){
12
            Thread t = new Thread(obj, ""+i);
13
            Thread.sleep(new Random().nextInt(1000));
14
            t.start();
15
        }
16
    }
17

18
    @Override
19
    public void run() {
20
        System.out.println("Thread Name= "+Thread.currentThread().getName()+" default Formatter = "+formatter.get().toPattern());
21
        try {
22
            Thread.sleep(new Random().nextInt(1000));
23
        } catch (InterruptedException e) {
24
            e.printStackTrace();
25
        }
26
        //formatter pattern is changed here by thread, but it won't reflect to other threads
27
        formatter.set(new SimpleDateFormat());
28

29
        System.out.println("Thread Name= "+Thread.currentThread().getName()+" formatter = "+formatter.get().toPattern());
30
    }
31

32
}

从输出中可以看出，虽然 Thread-0 已经改变了 formatter 的值，但 Thread-1 默认格式化值与初始化值相同，其他线程也一样。

上面有一段代码用到了创建 ThreadLocal 变量的那段代码用到了 Java8 的知识，它等于下面这段代码，如果你写了下面这段代码的话，IDEA 会提示你转换为 Java8 的格式(IDEA 真的不错！)。因为 ThreadLocal 类在 Java 8 中扩展，使用一个新的方法withInitial()，将 Supplier 功能接口作为参数。

1
private static final ThreadLocal<SimpleDateFormat> formatter = new ThreadLocal<SimpleDateFormat>(){
2
    @Override
3
    protected SimpleDateFormat initialValue(){
4
        return new SimpleDateFormat("yyyyMMdd HHmm");
5
    }
6
};

ThreadLocal 原理了解吗？#

从 Thread类源代码入手。

1
public class Thread implements Runnable {
2
    //......
3
    //与此线程有关的ThreadLocal值。由ThreadLocal类维护
4
    ThreadLocal.ThreadLocalMap threadLocals = null;
5

6
    //与此线程有关的InheritableThreadLocal值。由InheritableThreadLocal类维护
7
    ThreadLocal.ThreadLocalMap inheritableThreadLocals = null;
8
    //......
9
}

从上面Thread类源代码可以看出Thread 类中有一个 threadLocals 和一个 inheritableThreadLocals 变量，它们都是 ThreadLocalMap 类型的变量,我们可以把 ThreadLocalMap 理解为ThreadLocal 类实现的定制化的 HashMap。默认情况下这两个变量都是 null，只有当前线程调用 ThreadLocal 类的 set或get方法时才创建它们，实际上调用这两个方法的时候，我们调用的是ThreadLocalMap类对应的 get()、set()方法。

ThreadLocal类的set()方法

1
public void set(T value) {
2
    //获取当前请求的线程
3
    Thread t = Thread.currentThread();
4
    //取出 Thread 类内部的 threadLocals 变量(哈希表结构)
5
    ThreadLocalMap map = getMap(t);
6
    if (map != null)
7
        // 将需要存储的值放入到这个哈希表中
8
        map.set(this, value);
9
    else
10
        createMap(t, value);
11
}
12
ThreadLocalMap getMap(Thread t) {
13
    return t.threadLocals;
14
}

通过上面这些内容，我们足以通过猜测得出结论：最终的变量是放在了当前线程的 ThreadLocalMap 中，并不是存在 ThreadLocal 上，****ThreadLocal 可以理解为只是**ThreadLocalMap**的封装，传递了变量值。 ThreadLocal 类中可以通过Thread.currentThread()获取到当前线程对象后，直接通过getMap(Thread t)可以访问到该线程的ThreadLocalMap对象。

每个**Thread中都具备一个ThreadLocalMap，而ThreadLocalMap可以存储以ThreadLocal**为 key ，Object 对象为 value 的键值对。

1
ThreadLocalMap(ThreadLocal<?> firstKey, Object firstValue) {
2
    //......
3
}

比如我们在同一个线程中声明了两个 ThreadLocal 对象的话， Thread内部都是使用仅有的那个ThreadLocalMap 存放数据的，ThreadLocalMap的 key 就是 ThreadLocal对象，value 就是 ThreadLocal 对象调用set方法设置的值。

ThreadLocal 数据结构如下图所示：

ThreadLocalMap是ThreadLocal的静态内部类。

ThreadLocal 内存泄露问题是怎么导致的？#

ThreadLocalMap 中使用的 key 为 ThreadLocal 的弱引用，而 value 是强引用。所以，如果 ThreadLocal 没有被外部强引用的情况下，在垃圾回收的时候，key 会被清理掉，而 value 不会被清理掉。

这样一来，ThreadLocalMap 中就会出现 key 为 null 的 Entry。假如我们不做任何措施的话，value 永远无法被 GC 回收，这个时候就可能会产生内存泄露。ThreadLocalMap 实现中已经考虑了这种情况，在调用 set()、get()、remove() 方法的时候，会清理掉 key 为 null 的记录。使用完 ThreadLocal方法后最好手动调用remove()方法

1
static class Entry extends WeakReference<ThreadLocal<?>> {
2
    /** The value associated with this ThreadLocal. */
3
    Object value;
4

5
    Entry(ThreadLocal<?> k, Object v) {
6
        super(k);
7
        value = v;
8
    }
9
}

弱引用介绍：

如果一个对象只具有弱引用，那就类似于可有可无的生活用品。弱引用与软引用的区别在于：只具有弱引用的对象拥有更短暂的生命周期。在垃圾回收器线程扫描它所管辖的内存区域的过程中，一旦发现了只具有弱引用的对象，不管当前内存空间足够与否，都会回收它的内存。不过，由于垃圾回收器是一个优先级很低的线程，因此不一定会很快发现那些只具有弱引用的对象。

弱引用可以和一个引用队列（ReferenceQueue）联合使用，如果弱引用所引用的对象被垃圾回收，Java 虚拟机就会把这个弱引用加入到与之关联的引用队列中。

线程池#

什么是线程池?#

顾名思义，线程池就是管理一系列线程的资源池。当有任务要处理时，直接从线程池中获取线程来处理，处理完之后线程并不会立即被销毁，而是等待下一个任务。

为什么要用线程池？#

池化技术想必大家已经屡见不鲜了，线程池、数据库连接池、HTTP 连接池等等都是对这个思想的应用。池化技术的思想主要是为了减少每次获取资源的消耗，提高对资源的利用率。

线程池提供了一种限制和管理资源（包括执行一个任务）的方式。每个线程池还维护一些基本统计信息，例如已完成任务的数量。

这里借用《Java 并发编程的艺术》提到的来说一下使用线程池的好处：

降低资源消耗。通过重复利用已创建的线程降低线程创建和销毁造成的消耗。
提高响应速度。当任务到达时，任务可以不需要等到线程创建就能立即执行。
提高线程的可管理性。线程是稀缺资源，如果无限制的创建，不仅会消耗系统资源，还会降低系统的稳定性，使用线程池可以进行统一的分配，调优和监控。

如何创建线程池？#

通过**ThreadPoolExecutor**构造函数来创建（推荐）。
通过 Executor 框架的工具类 Executors 来创建。

我们可以创建多种类型的 ThreadPoolExecutor：
- FixedThreadPool：该方法返回一个固定线程数量的线程池。该线程池中的线程数量始终不变。当有一个新的任务提交时，线程池中若有空闲线程，则立即执行。若没有，则新的任务会被暂存在一个任务队列中，待有线程空闲时，便处理在任务队列中的任务。
- SingleThreadExecutor****： 该方法返回一个只有一个线程的线程池。若多余一个任务被提交到该线程池，任务会被保存在一个任务队列中，待线程空闲，按先入先出的顺序执行队列中的任务。
- CachedThreadPool****： 该方法返回一个可根据实际情况调整线程数量的线程池。初始大小为 0。当有新任务提交时，如果当前线程池中没有线程可用，它会创建一个新的线程来处理该任务。如果在一段时间内（默认为 60 秒）没有新任务提交，核心线程会超时并被销毁，从而缩小线程池的大小。
- ScheduledThreadPool：该方法返回一个用来在给定的延迟后运行任务或者定期执行任务的线程池。

为什么不推荐使用内置线程池？#

在《阿里巴巴 Java 开发手册》“并发处理”这一章节，明确指出线程资源必须通过线程池提供，不允许在应用中自行显式创建线程。

为什么呢？

使用线程池的好处是减少在创建和销毁线程上所消耗的时间以及系统资源开销，解决资源不足的问题。如果不使用线程池，有可能会造成系统创建大量同类线程而导致消耗完内存或者“过度切换”的问题。

另外，《阿里巴巴 Java 开发手册》中强制线程池不允许使用 Executors 去创建，而是通过 ThreadPoolExecutor 构造函数的方式，这样的处理方式让写的同学更加明确线程池的运行规则，规避资源耗尽的风险

Executors 返回线程池对象的弊端如下(后文会详细介绍到)：

FixedThreadPool 和 SingleThreadExecutor：使用的是无界的 LinkedBlockingQueue，任务队列最大长度为 Integer.MAX_VALUE,可能堆积大量的请求，从而导致 OOM。
CachedThreadPool：使用的是同步队列 SynchronousQueue, 允许创建的线程数量为 Integer.MAX_VALUE ，如果任务数量过多且执行速度较慢，可能会创建大量的线程，从而导致 OOM。
ScheduledThreadPool 和 SingleThreadScheduledExecutor : 使用的无界的延迟阻塞队列DelayedWorkQueue，任务队列最大长度为 Integer.MAX_VALUE,可能堆积大量的请求，从而导致 OOM。

1
// 无界队列 LinkedBlockingQueue
2
public static ExecutorService newFixedThreadPool(int nThreads) {
3
    return new ThreadPoolExecutor(nThreads, nThreads,0L, TimeUnit.MILLISECONDS,new LinkedBlockingQueue<Runnable>());
4
}
5

6
// 无界队列 LinkedBlockingQueue
7
public static ExecutorService newSingleThreadExecutor() {
8
    return new FinalizableDelegatedExecutorService (new ThreadPoolExecutor(1, 1,0L, TimeUnit.MILLISECONDS,new LinkedBlockingQueue<Runnable>()));
9
}
10

11
// 同步队列 SynchronousQueue，没有容量，最大线程数是 Integer.MAX_VALUE
12
public static ExecutorService newCachedThreadPool() {
13
    return new ThreadPoolExecutor(0, Integer.MAX_VALUE,60L, TimeUnit.SECONDS,new SynchronousQueue<Runnable>());
14
}
15

16
// DelayedWorkQueue（延迟阻塞队列）
17
public static ScheduledExecutorService newScheduledThreadPool(int corePoolSize) {
18
    return new ScheduledThreadPoolExecutor(corePoolSize);
19
}
20
public ScheduledThreadPoolExecutor(int corePoolSize) {
21
    super(corePoolSize, Integer.MAX_VALUE, 0, NANOSECONDS,
22
          new DelayedWorkQueue());
23
}

线程池常见参数有哪些？如何解释？#

1
/**
2
     * 用给定的初始参数创建一个新的ThreadPoolExecutor。
3
     */
4
    public ThreadPoolExecutor(int corePoolSize,//线程池的核心线程数量
5
                              int maximumPoolSize,//线程池的最大线程数
6
                              long keepAliveTime,//当线程数大于核心线程数时，多余的空闲线程存活的最长时间
7
                              TimeUnit unit,//时间单位
8
                              BlockingQueue<Runnable> workQueue,//任务队列，用来储存等待执行任务的队列
9
                              ThreadFactory threadFactory,//线程工厂，用来创建线程，一般默认即可
10
                              RejectedExecutionHandler handler//拒绝策略，当提交的任务过多而不能及时处理时，我们可以定制策略来处理任务
11
                               ) {
12
        if (corePoolSize < 0 ||
13
            maximumPoolSize <= 0 ||
14
            maximumPoolSize < corePoolSize ||
15
            keepAliveTime < 0)
16
            throw new IllegalArgumentException();
17
        if (workQueue == null || threadFactory == null || handler == null)
18
            throw new NullPointerException();
19
        this.corePoolSize = corePoolSize;
20
        this.maximumPoolSize = maximumPoolSize;
21
        this.workQueue = workQueue;
22
        this.keepAliveTime = unit.toNanos(keepAliveTime);
23
        this.threadFactory = threadFactory;
24
        this.handler = handler;
25
    }

ThreadPoolExecutor 3 个最重要的参数：

corePoolSize : 任务队列未达到队列容量时，最大可以同时运行的线程数量。
maximumPoolSize : 任务队列中存放的任务达到队列容量的时候，当前可以同时运行的线程数量变为最大线程数。
workQueue****: 新任务来的时候会先判断当前运行的线程数量是否达到核心线程数，如果达到的话，新任务就会被存放在队列中。

ThreadPoolExecutor其他常见参数 :

keepAliveTime:线程池中的线程数量大于 corePoolSize 的时候，如果这时没有新的任务提交，多余的空闲线程不会立即销毁，而是会等待，直到等待的时间超过了 keepAliveTime才会被回收销毁，线程池回收线程时，会对核心线程和非核心线程一视同仁，直到线程池中线程的数量等于 corePoolSize ，回收过程才会停止。
unit : keepAliveTime 参数的时间单位。
threadFactory 创建新线程的时候会用到。
handler :饱和策略。关于饱和策略下面单独介绍一下。

下面这张图可以加深你对线程池中各个参数的相互关系的理解

线程池的饱和策略有哪些？#

如果当前同时运行的线程数量达到最大线程数量并且队列也已经被放满了任务时，ThreadPoolExecutor 定义一些策略:

ThreadPoolExecutor.AbortPolicy****： 抛出 RejectedExecutionException来拒绝新任务的处理。
ThreadPoolExecutor.CallerRunsPolicy****： 调用执行自己的线程运行任务，也就是直接在调用execute方法的线程中运行(run)被拒绝的任务，如果执行程序已关闭，则会丢弃该任务。因此这种策略会降低对于新任务提交速度，影响程序的整体性能。如果您的应用程序可以承受此延迟并且你要求任何一个任务请求都要被执行的话，你可以选择这个策略。
ThreadPoolExecutor.DiscardPolicy****： 不处理新任务，直接丢弃掉。
ThreadPoolExecutor.DiscardOldestPolicy****： 此策略将丢弃最早的未处理的任务请求。

举个例子：Spring 通过 ThreadPoolTaskExecutor 或者我们直接通过 ThreadPoolExecutor 的构造函数创建线程池的时候，当我们不指定 RejectedExecutionHandler 饱和策略来配置线程池的时候，默认使用的是 AbortPolicy。在这种饱和策略下，如果队列满了，ThreadPoolExecutor 将抛出 RejectedExecutionException 异常来拒绝新来的任务，这代表你将丢失对这个任务的处理。如果不想丢弃任务的话，可以使用CallerRunsPolicy。CallerRunsPolicy 和其他的几个策略不同，它既不会抛弃任务，也不会抛出异常，而是将任务回退给调用者，使用调用者的线程来执行任务

1
public static class CallerRunsPolicy implements RejectedExecutionHandler {
2

3
        public CallerRunsPolicy() { }
4

5
        public void rejectedExecution(Runnable r, ThreadPoolExecutor e) {
6
            if (!e.isShutdown()) {
7
                // 直接主线程执行，而不是线程池中的线程执行
8
                r.run();
9
            }
10
        }
11
    }

线程池常用的阻塞队列有哪些？#

新任务来的时候会先判断当前运行的线程数量是否达到核心线程数，如果达到的话，新任务就会被存放在队列中。

不同的线程池会选用不同的阻塞队列，我们可以结合内置线程池来分析。

容量为 Integer.MAX_VALUE 的 LinkedBlockingQueue（无界队列）：FixedThreadPool 和 SingleThreadExector 。FixedThreadPool最多只能创建核心线程数的线程（核心线程数和最大线程数相等），SingleThreadExector只能创建一个线程（核心线程数和最大线程数都是 1），二者的任务队列永远不会被放满。
SynchronousQueue（同步队列）：CachedThreadPool 。SynchronousQueue 没有容量，不存储元素，目的是保证对于提交的任务，如果有空闲线程，则使用空闲线程来处理；否则新建一个线程来处理任务。也就是说，CachedThreadPool 的最大线程数是 Integer.MAX_VALUE ，可以理解为线程数是可以无限扩展的，可能会创建大量线程，从而导致 OOM。
DelayedWorkQueue（延迟阻塞队列）：ScheduledThreadPool 和 SingleThreadScheduledExecutor 。DelayedWorkQueue 的内部元素并不是按照放入的时间排序，而是会按照延迟的时间长短对任务进行排序，内部采用的是“堆”的数据结构，可以保证每次出队的任务都是当前队列中执行时间最靠前的。DelayedWorkQueue 添加元素满了之后会自动扩容原来容量的 1/2，即永远不会阻塞，最大扩容可达 Integer.MAX_VALUE，所以最多只能创建核心线程数的线程。

线程池处理任务的流程了解吗？#

如果当前运行的线程数小于核心线程数，那么就会新建一个线程来执行任务。
如果当前运行的线程数等于或大于核心线程数，但是小于最大线程数，那么就把该任务放入到任务队列里等待执行。
如果向任务队列投放任务失败（任务队列已经满了），但是当前运行的线程数是小于最大线程数的，就新建一个线程来执行任务。
如果当前运行的线程数已经等同于最大线程数了，新建线程将会使当前运行的线程超出最大线程数，那么当前任务会被拒绝，饱和策略会调用RejectedExecutionHandler.rejectedExecution()方法。

如何给线程池命名？#

初始化线程池的时候需要显示命名（设置线程池名称前缀），有利于定位问题。

默认情况下创建的线程名字类似 pool-1-thread-n 这样的，没有业务含义，不利于我们定位问题。

给线程池里的线程命名通常有下面两种方式：

1、利用 guava 的 ThreadFactoryBuilder

1
ThreadFactory threadFactory = new ThreadFactoryBuilder()
2
                        .setNameFormat(threadNamePrefix + "-%d")
3
                        .setDaemon(true).build();
4
ExecutorService threadPool = new ThreadPoolExecutor(corePoolSize, maximumPoolSize, keepAliveTime, TimeUnit.MINUTES, workQueue, threadFactory);

2、自己实现 ThreadFactory****。

1
import java.util.concurrent.ThreadFactory;
2
import java.util.concurrent.atomic.AtomicInteger;
3

4
/**
5
 * 线程工厂，它设置线程名称，有利于我们定位问题。
6
 */
7
public final class NamingThreadFactory implements ThreadFactory {
8

9
    private final AtomicInteger threadNum = new AtomicInteger();
10
    private final String name;
11

12
    /**
13
     * 创建一个带名字的线程池生产工厂
14
     */
15
    public NamingThreadFactory(String name) {
16
        this.name = name;
17
    }
18

19
    @Override
20
    public Thread newThread(Runnable r) {
21
        Thread t = new Thread(r);
22
        t.setName(name + " [#" + threadNum.incrementAndGet() + "]");
23
        return t;
24
    }
25
}

如何设定线程池的大小？#

很多人可能会觉得把线程池配置大一点比较好,但线程数量过多，对于多线程这个场景来说主要是增加了上下文切换成本。

如果我们设置的线程池数量太小的话，如果同一时间有大量任务/请求需要处理，可能会导致大量的请求/任务在任务队列中排队等待执行，甚至会出现任务队列满了之后任务/请求无法处理的情况，或者大量任务堆积在任务队列导致 OOM。这样很明显是有问题的，CPU 根本没有得到充分利用。
如果我们设置线程数量太大，大量线程可能会同时在争取 CPU 资源，这样会导致大量的上下文切换，从而增加线程的执行时间，影响了整体执行效率。

有一个简单并且适用面比较广的公式：

CPU 密集型任务(N+1)： 这种任务消耗的主要是 CPU 资源，可以将线程数设置为 N（CPU 核心数）+1。比 CPU 核心数多出来的一个线程是为了防止线程偶发的缺页中断，或者其它原因导致的任务暂停而带来的影响。一旦任务暂停，CPU 就会处于空闲状态，而在这种情况下多出来的一个线程就可以充分利用 CPU 的空闲时间。
I/O 密集型任务(2N)： 这种任务应用起来，系统会用大部分的时间来处理 I/O 交互，而线程在处理 I/O 的时间段内不会占用 CPU 来处理，这时就可以将 CPU 交出给其它线程使用。因此在 I/O 密集型任务的应用中，我们可以多配置一些线程，具体的计算方法是 2N。

如何判断是 CPU 密集任务还是 IO 密集任务？

CPU 密集型简单理解就是利用 CPU 计算能力的任务比如你在内存中对大量数据进行排序。但凡涉及到网络读取，文件读取这类都是 IO 密集型，这类任务的特点是 CPU 计算耗费时间相比于等待 IO 操作完成的时间来说很少，大部分时间都花在了等待 IO 操作完成上。

线程数更严谨的计算的方法应该是：最佳线程数 = N（CPU 核心数）∗（1+WT（线程等待时间）/ST（线程计算时间）），其中 WT（线程等待时间）=线程运行总时间 - ST（线程计算时间）。

线程等待时间所占比例越高，需要越多线程。线程计算时间所占比例越高，需要越少线程。

我们可以通过 JDK 自带的工具 VisualVM 来查看 WT/ST 比例。

CPU 密集型任务的 WT/ST 接近或者等于 0，因此，线程数可以设置为 N（CPU 核心数）∗（1+0）= N，和我们上面说的 N（CPU 核心数）+1 差不多。

IO 密集型任务下，几乎全是线程等待时间，从理论上来说，你就可以将线程数设置为 2N。

公示也只是参考，具体还是要根据项目实际线上运行情况来动态调整。

如何动态修改线程池的参数？#

美团技术团队在《Java 线程池实现原理及其在美团业务中的实践》这篇文章中介绍到对线程池参数实现可自定义配置的思路和方法。

美团技术团队的思路是主要对线程池的核心参数实现自定义可配置。这三个核心参数是：

corePoolSize : 核心线程数线程数定义了最小可以同时运行的线程数量。
maximumPoolSize : 当队列中存放的任务达到队列容量的时候，当前可以同时运行的线程数量变为最大线程数。
workQueue****: 当新任务来的时候会先判断当前运行的线程数量是否达到核心线程数，如果达到的话，新任务就会被存放在队列中。

为什么是这三个参数？

这三个参数是 ThreadPoolExecutor 最重要的参数，它们基本决定了线程池对于任务的处理策略。

格外需要注意的是corePoolSize，程序运行期间的时候，我们调用 setCorePoolSize（）这个方法的话，线程池会首先判断当前工作线程数是否大于corePoolSize，如果大于的话就会回收工作线程。

另外，你也看到了上面并没有动态指定队列长度的方法，美团的方式是自定义了一个叫做 ResizableCapacityLinkedBlockIngQueue 的队列（主要就是把LinkedBlockingQueue的 capacity 字段的 final 关键字修饰给去掉了，让它变为可变的）。

如果我们的项目也想要实现这种效果的话，可以借助现成的开源项目：

Hippo4j：异步线程池框架，支持线程池动态变更&监控&报警，无需修改代码轻松引入。支持多种使用模式，轻松引入，致力于提高系统运行保障能力。
Dynamic TP：轻量级动态线程池，内置监控告警功能，集成三方中间件线程池管理，基于主流配置中心（已支持 Nacos、Apollo，Zookeeper、Consul、Etcd，可通过 SPI 自定义实现）。

如何设计一个能够根据任务的优先级来执行的线程池？#

这是一个常见的面试问题，本质其实还是在考察求职者对于线程池以及阻塞队列的掌握。

我们上面也提到了，不同的线程池会选用不同的阻塞队列作为任务队列，比如FixedThreadPool 使用的是LinkedBlockingQueue（无界队列），由于队列永远不会被放满，因此FixedThreadPool最多只能创建核心线程数的线程。

假如我们需要实现一个优先级任务线程池的话，那可以考虑使用 PriorityBlockingQueue （优先级阻塞队列）作为任务队列（ThreadPoolExecutor 的构造函数有一个 workQueue 参数可以传入任务队列）。

PriorityBlockingQueue 是一个支持优先级的无界阻塞队列，可以看作是线程安全的 PriorityQueue，两者底层都是使用小顶堆形式的二叉堆，即值最小的元素优先出队。不过，PriorityQueue 不支持阻塞操作。

要想让 PriorityBlockingQueue 实现对任务的排序，传入其中的任务必须是具备排序能力的，方式有两种：

提交到线程池的任务实现 Comparable 接口，并重写 compareTo 方法来指定任务之间的优先级比较规则。
创建 PriorityBlockingQueue 时传入一个 Comparator 对象来指定任务之间的排序规则(推荐)。

不过，这存在一些风险和问题，比如：

PriorityBlockingQueue 是无界的，可能堆积大量的请求，从而导致 OOM。
可能会导致饥饿问题，即低优先级的任务长时间得不到执行。
由于需要对队列中的元素进行排序操作以及保证线程安全（并发控制采用的是可重入锁 ReentrantLock），因此会降低性能。

对于 OOM 这个问题的解决比较简单粗暴，就是继承PriorityBlockingQueue 并重写一下 offer 方法(入队)的逻辑，当插入的元素数量超过指定值就返回 false 。

饥饿问题这个可以通过优化设计来解决（比较麻烦），比如等待时间过长的任务会被移除并重新添加到队列中，但是优先级会被提升。

对于性能方面的影响，是没办法避免的，毕竟需要对任务进行排序操作。并且，对于大部分业务场景来说，这点性能影响是可以接受的。

Future#

Future 类有什么用？#

Future 类是异步思想的典型运用，主要用在一些需要执行耗时任务的场景，避免程序一直原地等待耗时任务执行完成，执行效率太低。具体来说是这样的：当我们执行某一耗时的任务时，可以将这个耗时任务交给一个子线程去异步执行，同时我们可以干点其他事情，不用傻傻等待耗时任务执行完成。等我们的事情干完后，我们再通过 Future 类获取到耗时任务的执行结果。这样一来，程序的执行效率就明显提高了。

这其实就是多线程中经典的 Future 模式，你可以将其看作是一种设计模式，核心思想是异步调用，主要用在多线程领域，并非 Java 语言独有。

在 Java 中，Future 类只是一个泛型接口，位于 java.util.concurrent 包下，其中定义了 5 个方法，主要包括下面这 4 个功能：

取消任务；
判断任务是否被取消;
判断任务是否已经执行完成;
获取任务执行结果。

1
// V 代表了Future执行的任务返回值的类型
2
public interface Future<V> {
3
    // 取消任务执行
4
    // 成功取消返回 true，否则返回 false
5
    boolean cancel(boolean mayInterruptIfRunning);
6
    // 判断任务是否被取消
7
    boolean isCancelled();
8
    // 判断任务是否已经执行完成
9
    boolean isDone();
10
    // 获取任务执行结果
11
    V get() throws InterruptedException, ExecutionException;
12
    // 指定时间内没有返回计算结果就抛出 TimeOutException 异常
13
    V get(long timeout, TimeUnit unit)
14

15
        throws InterruptedException, ExecutionException, TimeoutExceptio
16

17
}

简单理解就是：我有一个任务，提交给了 Future 来处理。任务执行期间我自己可以去做任何想做的事情。并且，在这期间我还可以取消任务以及获取任务的执行状态。一段时间之后，我就可以 Future 那里直接取出任务执行结果。

Callable 和 Future 有什么关系？#

我们可以通过 FutureTask 来理解 Callable 和 Future 之间的关系。

FutureTask 提供了 Future 接口的基本实现，常用来封装 Callable 和 Runnable，具有取消任务、查看任务是否执行完成以及获取任务执行结果的方法。ExecutorService.submit() 方法返回的其实就是 Future 的实现类 FutureTask 。

FutureTask 不光实现了 Future接口，还实现了Runnable 接口，因此可以作为任务直接被线程执行。

FutureTask 有两个构造函数，可传入 Callable 或者 Runnable 对象。实际上，传入 Runnable 对象也会在方法内部转换为Callable 对象。

1
public FutureTask(Callable<V> callable) {
2
    if (callable == null)
3
        throw new NullPointerException();
4
    this.callable = callable;
5
    this.state = NEW;
6
}
7
public FutureTask(Runnable runnable, V result) {
8
    // 通过适配器RunnableAdapter来将Runnable对象runnable转换成Callable对象
9
    this.callable = Executors.callable(runnable, result);
10
    this.state = NEW;
11
}

FutureTask相当于对Callable 进行了封装，管理着任务执行的情况，存储了 Callable 的 call 方法的任务执行结果。

`CompletableFuture` 类有什么用？#

Future 在实际使用过程中存在一些局限性比如不支持异步任务的编排组合、获取计算结果的 get() 方法为阻塞调用。

Java 8 才被引入CompletableFuture 类可以解决Future 的这些缺陷。CompletableFuture 除了提供了更为好用和强大的 Future 特性之外，还提供了函数式编程、异步任务编排组合（可以将多个异步任务串联起来，组成一个完整的链式调用）等能力。

下面我们来简单看看 CompletableFuture 类的定义。

1
public class CompletableFuture<T> implements Future<T>, CompletionStage<T> {
2
}

可以看到，CompletableFuture 同时实现了 Future 和 CompletionStage 接口。

CompletionStage 接口描述了一个异步计算的阶段。很多计算可以分成多个阶段或步骤，此时可以通过它将所有步骤组合起来，形成异步计算的流水线。

CompletionStage 接口中的方法比较多，CompletableFuture 的函数式能力就是这个接口赋予的。从这个接口的方法参数你就可以发现其大量使用了 Java8 引入的函数式编程。

AQS#

AQS 是什么？#

AQS 的全称为 AbstractQueuedSynchronizer ，翻译过来的意思就是抽象队列同步器。这个类在 java.util.concurrent.locks 包下面。

AQS 就是一个抽象类，主要用来构建锁和同步器。

1
public abstract class AbstractQueuedSynchronizer extends AbstractOwnableSynchronizer implements java.io.Serializable {
2
}

AQS 为构建锁和同步器提供了一些通用功能的实现，因此，使用 AQS 能简单且高效地构造出应用广泛的大量的同步器，比如我们提到的 ReentrantLock，Semaphore，其他的诸如 ReentrantReadWriteLock，SynchronousQueue等等皆是基于 AQS 的。

AQS 的原理是什么？#

AQS 核心思想是，如果被请求的共享资源空闲，则将当前请求资源的线程设置为有效的工作线程，并且将共享资源设置为锁定状态。如果被请求的共享资源被占用，那么就需要一套线程阻塞等待以及被唤醒时锁分配的机制，这个机制 AQS 是用 CLH 队列锁 实现的，即将暂时获取不到锁的线程加入到队列中。

CLH(Craig,Landin,and Hagersten) 队列是一个虚拟的双向队列（虚拟的双向队列即不存在队列实例，仅存在结点之间的关联关系）。AQS 是将每条请求共享资源的线程封装成一个 CLH 锁队列的一个结点（Node）来实现锁的分配。在 CLH 同步队列中，一个节点表示一个线程，它保存着线程的引用（thread）、当前节点在队列中的状态（waitStatus）、前驱节点（prev）、后继节点（next）。

CLH 队列结构如下图所示：

AQS(AbstractQueuedSynchronizer)的核心原理图：

AQS 使用 int 成员变量 state 表示同步状态，通过内置的 线程等待队列 来完成获取资源线程的排队工作。

state 变量由 volatile 修饰，用于展示当前临界资源的获锁情况。

1
// 共享变量，使用volatile修饰保证线程可见性
2
private volatile int state;

另外，状态信息 state 可以通过 protected 类型的getState()、setState()和compareAndSetState() 进行操作。并且，这几个方法都是 final 修饰的，在子类中无法被重写。

1
//返回同步状态的当前值
2
protected final int getState() {
3
     return state;
4
}
5
 // 设置同步状态的值
6
protected final void setState(int newState) {
7
     state = newState;
8
}
9
//原子地（CAS操作）将同步状态值设置为给定值update如果当前同步状态的值等于expect（期望值）
10
protected final boolean compareAndSetState(int expect, int update) {
11
      return unsafe.compareAndSwapInt(this, stateOffset, expect, update);
12
}

以 ReentrantLock 为例，state 初始值为 0，表示未锁定状态。A 线程 lock() 时，会调用 tryAcquire() 独占该锁并将 state+1 。此后，其他线程再 tryAcquire() 时就会失败，直到 A 线程 unlock() 到 state=0（即释放锁）为止，其它线程才有机会获取该锁。当然，释放锁之前，A 线程自己是可以重复获取此锁的（state 会累加），这就是可重入的概念。但要注意，获取多少次就要释放多少次，这样才能保证 state 是能回到零态的。

再以 CountDownLatch 以例，任务分为 N 个子线程去执行，state 也初始化为 N（注意 N 要与线程个数一致）。这 N 个子线程是并行执行的，每个子线程执行完后countDown() 一次，state 会 CAS(Compare and Swap) 减 1。等到所有子线程都执行完后(即 state=0 )，会 unpark() 主调用线程，然后主调用线程就会从 await() 函数返回，继续后余动作。

Semaphore 有什么用？#

synchronized 和 ReentrantLock 都是一次只允许一个线程访问某个资源，而Semaphore(信号量)可以用来控制同时访问特定资源的线程数量。

Semaphore 的使用简单，我们这里假设有 N(N>5) 个线程来获取 Semaphore 中的共享资源，下面的代码表示同一时刻 N 个线程中只有 5 个线程能获取到共享资源，其他线程都会阻塞，只有获取到共享资源的线程才能执行。等到有线程释放了共享资源，其他阻塞的线程才能获取到。

1
// 初始共享资源数量
2
final Semaphore semaphore = new Semaphore(5);
3
// 获取1个许可
4
semaphore.acquire();
5
// 释放1个许可
6
semaphore.release();

当初始的资源个数为 1 的时候，Semaphore 退化为排他锁。

Semaphore 有两种模式：。

公平模式： 调用 acquire() 方法的顺序就是获取许可证的顺序，遵循 FIFO；
非公平模式： 抢占式的。

Semaphore 对应的两个构造方法如下：

1
public Semaphore(int permits) {
2
    sync = new NonfairSync(permits);
3
}
4

5
public Semaphore(int permits, boolean fair) {
6
    sync = fair ? new FairSync(permits) : new NonfairSync(permits);
7
}

这两个构造方法，都必须提供许可的数量，第二个构造方法可以指定是公平模式还是非公平模式，默认非公平模式。

Semaphore 通常用于那些资源有明确访问数量限制的场景比如限流（仅限于单机模式，实际项目中推荐使用 Redis +Lua 来做限流）。

Semaphore 的原理是什么？#

Semaphore 是共享锁的一种实现，它默认构造 AQS 的 state 值为 permits，你可以将 permits 的值理解为许可证的数量，只有拿到许可证的线程才能执行。

调用semaphore.acquire() ，线程尝试获取许可证，如果 state >= 0 的话，则表示可以获取成功。如果获取成功的话，使用 CAS 操作去修改 state 的值 state=state-1。如果 state<0 的话，则表示许可证数量不足。此时会创建一个 Node 节点加入阻塞队列，挂起当前线程。

1
/**
2
 *  获取1个许可证
3
 */
4
public void acquire() throws InterruptedException {
5
    sync.acquireSharedInterruptibly(1);
6
}
7
/**
8
 * 共享模式下获取许可证，获取成功则返回，失败则加入阻塞队列，挂起线程
9
 */
10
public final void acquireSharedInterruptibly(int arg)
11
    throws InterruptedException {
12
    if (Thread.interrupted())
13
      throw new InterruptedException();
14
        // 尝试获取许可证，arg为获取许可证个数，当可用许可证数减当前获取的许可证数结果小于0,则创建一个节点加入阻塞队列，挂起当前线程。
15
    if (tryAcquireShared(arg) < 0)
16
      doAcquireSharedInterruptibly(arg);
17
}

调用semaphore.release(); ，线程尝试释放许可证，并使用 CAS 操作去修改 state 的值 state=state+1。释放许可证成功之后，同时会唤醒同步队列中的一个线程。被唤醒的线程会重新尝试去修改 state 的值 state=state-1 ，如果 state>=0 则获取令牌成功，否则重新进入阻塞队列，挂起线程。

1
// 释放一个许可证
2
public void release() {
3
    sync.releaseShared(1);
4
}
5

6
// 释放共享锁，同时会唤醒同步队列中的一个线程。
7
public final boolean releaseShared(int arg) {
8
    //释放共享锁
9
    if (tryReleaseShared(arg)) {
10
      //唤醒同步队列中的一个线程
11
      doReleaseShared();
12
      return true;
13
    }
14
    return false;
15
}

CountDownLatch 有什么用？#

CountDownLatch 允许 count 个线程阻塞在一个地方，直至所有线程的任务都执行完毕。

CountDownLatch 是一次性的，计数器的值只能在构造方法中初始化一次，之后没有任何机制再次对其设置值，当 CountDownLatch 使用完毕后，它不能再次被使用。

CountDownLatch 的原理是什么？#

CountDownLatch 是共享锁的一种实现,它默认构造 AQS 的 state 值为 count。当线程使用 countDown() 方法时,其实使用了tryReleaseShared方法以 CAS 的操作来减少 state,直至 state 为 0 。当调用 await() 方法的时候，如果 state 不为 0，那就证明任务还没有执行完毕，await() 方法就会一直阻塞，也就是说 await() 方法之后的语句不会被执行。直到count 个线程调用了countDown()使 state 值被减为 0，或者调用await()的线程被中断，该线程才会从阻塞中被唤醒，await() 方法之后的语句得到执行。

用过 CountDownLatch 么？什么场景下用的？#

CountDownLatch 的作用就是允许 count 个线程阻塞在一个地方，直至所有线程的任务都执行完毕。比如，有一个使用多线程读取多个文件处理的场景，用到了 CountDownLatch 。

要读取处理 6 个文件，这 6 个任务都是没有执行顺序依赖的任务，但是需要返回给用户的时候将这几个文件的处理的结果进行统计整理。

为此我们定义了一个线程池和 count 为 6 的CountDownLatch对象。使用线程池处理读取任务，每一个线程处理完之后就将 count-1，调用CountDownLatch对象的 await()方法，直到所有文件读取完之后，才会接着执行后面的逻辑。

伪代码是下面这样的：

1
public class CountDownLatchExample1 {
2
    // 处理文件的数量
3
    private static final int threadCount = 6;
4

5
    public static void main(String[] args) throws InterruptedException {
6
        // 创建一个具有固定线程数量的线程池对象（推荐使用构造方法创建）
7
        ExecutorService threadPool = Executors.newFixedThreadPool(10);
8
        final CountDownLatch countDownLatch = new CountDownLatch(threadCount);
9
        for (int i = 0; i < threadCount; i++) {
10
            final int threadnum = i;
11
            threadPool.execute(() -> {
12
                try {
13
                    //处理文件的业务操作
14
                    //......
15
                } catch (InterruptedException e) {
16
                    e.printStackTrace();
17
                } finally {
18
                    //表示一个文件已经被完成
19
                    countDownLatch.countDown();
20
                }
21

22
            });
23
        }
24
        countDownLatch.await();
25
        threadPool.shutdown();
26
        System.out.println("finish");
27
    }
28
}

有没有可以改进的地方呢？

可以使用 CompletableFuture 类来改进！Java8 的 CompletableFuture 提供了很多对多线程友好的方法，使用它可以很方便地为我们编写多线程程序，什么异步、串行、并行或者等待所有线程执行完任务什么的都非常方便。

1
CompletableFuture<Void> task1 =
2
    CompletableFuture.supplyAsync(()->{
3
        //自定义业务操作
4
    });
5
......
6
CompletableFuture<Void> task6 =
7
    CompletableFuture.supplyAsync(()->{
8
    //自定义业务操作
9
    });
10
......
11
CompletableFuture<Void> headerFuture=CompletableFuture.allOf(task1,.....,task6);
12

13
try {
14
    headerFuture.join();
15
} catch (Exception ex) {
16
    //......
17
}
18
System.out.println("all done. ");

上面的代码还可以继续优化，当任务过多的时候，把每一个 task 都列出来不太现实，可以考虑通过循环来添加任务。

1
//文件夹位置
2
List<String> filePaths = Arrays.asList(...)
3
// 异步处理所有文件
4
List<CompletableFuture<String>> fileFutures = filePaths.stream()
5
    .map(filePath -> doSomeThing(filePath))
6
    .collect(Collectors.toList());
7
// 将他们合并起来
8
CompletableFuture<Void> allFutures = CompletableFuture.allOf(
9
    fileFutures.toArray(new CompletableFuture[fileFutures.size()])
10
);

CyclicBarrier 有什么用？#

CyclicBarrier 和 CountDownLatch 非常类似，它也可以实现线程间的技术等待，但是它的功能比 CountDownLatch 更加复杂和强大。主要应用场景和 CountDownLatch 类似。

CountDownLatch 的实现是基于 AQS 的，而 CycliBarrier 是基于 ReentrantLock(ReentrantLock 也属于 AQS 同步器)和 Condition 的。

CyclicBarrier 的字面意思是可循环使用（Cyclic）的屏障（Barrier）。它要做的事情是：让一组线程到达一个屏障（也可以叫同步点）时被阻塞，直到最后一个线程到达屏障时，屏障才会开门，所有被屏障拦截的线程才会继续干活。

CyclicBarrier 的原理是什么？#

CyclicBarrier 内部通过一个 count 变量作为计数器，count 的初始值为 parties 属性的初始化值，每当一个线程到了栅栏这里了，那么就将计数器减 1。如果 count 值为 0 了，表示这是这一代最后一个线程到达栅栏，就尝试执行我们构造方法中输入的任务。

CyclicBarrier 默认的构造方法是 CyclicBarrier(int parties)，其参数表示屏障拦截的线程数量，每个线程调用 await() 方法告诉 CyclicBarrier 我已经到达了屏障，然后当前线程被阻塞。

其中，parties 就代表了有拦截的线程的数量，当拦截的线程数量达到这个值的时候就打开栅栏，让所有线程通过。
当调用 CyclicBarrier 对象调用 await() 方法时，实际上调用的是 dowait(false, 0L)方法。 await() 方法就像树立起一个栅栏的行为一样，将线程挡住了，当拦住的线程数量达到 parties 的值时，栅栏才会打开，线程才得以通过执行。

9187 字

46 分钟

Java Concurrent Programming

2024-01-30

cs-base

java

doc

meeting

multi-prog

Java Concurrency#

What are threads and processes?#

What is a process?#

A process is an instance of program execution; it is the basic unit for running programs in the system, so a process is dynamic. When the system runs a program, that is a process from creation, running, to demise.

In Java, when we start the main function, we actually start a JVM process, and the thread containing the main function is a thread within this process, also called the main thread.

What is a thread?#

Threads are similar to processes, but a thread is a smaller execution unit than a process. A process can spawn multiple threads during its execution. Unlike processes, multiple threads of the same kind share the process’s heap and method area resources, but each thread has its own program counter, JVM stack, and native method stack. Therefore, the burden of creating a thread or switching between threads is much lighter than for processes, which is why threads are also called lightweight processes.

Java programs are inherently multi-threaded, and we can use JMX to see what threads exist in a normal Java program. The code is as follows.

1
public class MultiThread {
2
 public static void main(String[] args) {
3
  // Get the Java thread management MXBean
4
 ThreadMXBean threadMXBean = ManagementFactory.getThreadMXBean();
5
  // Do not need to obtain synchronized monitor and synchronizer information, only print thread and thread stack information
6
  ThreadInfo[] threadInfos = threadMXBean.dumpAllThreads(false, false);
7
  // Iterate over thread information, only print thread ID and thread name
8
  for (ThreadInfo threadInfo : threadInfos) {
9
   System.out.println("[" + threadInfo.getThreadId() + "] " + threadInfo.getThreadName());
10
  }
11
 }
12
}

The above program output is as follows (the output contents may differ; don’t worry too much about what each thread does below—just know that the main thread runs the main method):

1
[5] Attach Listener // Add event
2
[4] Signal Dispatcher // Thread that dispatches JVM signals
3
[3] Finalizer // Thread that calls object finalize methods
4
[2] Reference Handler // Thread that clears references
5
[1] main // main thread, program entry

From the above output you can see: a Java program runs with the main thread and multiple other threads concurrently.

What is the difference between Java threads and operating system threads?#

Before JDK 1.2, Java threads were implemented as Green Threads (user-level threads), i.e., the JVM simulated multi-threading itself without relying on the OS. Because green threads have limitations (e.g., they cannot directly use OS-provided features like asynchronous I/O, and they can only run on a single kernel thread, preventing multi-core utilization), starting with JDK 1.2, Java threads were implemented on native threads. That means the JVM directly uses the OS native kernel threads (kernel threads) to implement Java threads, with the OS kernel performing thread scheduling and management.

We mentioned user threads and kernel threads above. For readers who aren’t familiar with their differences, a brief introduction:

User threads: managed and scheduled by user-space programs, running in user space (exclusively for applications).
Kernel threads: managed and scheduled by the operating system kernel, running in kernel space (accessible only to kernel programs).

In short, user threads have lower creation and switching costs, but cannot utilize multi-core; kernel threads have higher creation and switching costs, but can utilize multi-core.

A single sentence summary of the relationship: the essence of modern Java threads is that they are basically operating system threads.

Thread models describe how user threads map to kernel threads. The common thread models are:

One-to-one (one user thread per kernel thread)
Many-to-one (multiple user threads map to one kernel thread)
Many-to-many (multiple user threads map to multiple kernel threads)

In Windows and Linux, Java threads adopt a one-to-one thread model, i.e., one Java thread corresponds to a system kernel thread. Solaris is a special case (Solaris itself supports a many-to-many thread model); HotSpot VM on Solaris supports both many-to-many and one-to-one.

Please briefly describe the relationship between threads and processes, their differences, and advantages and disadvantages?#

From the JVM perspective, the relationship between processes and threads.

Diagram of the relationship between processes and threads#

The following image shows the Java memory areas; from the JVM perspective, here is the relationship between threads and processes.

From the figure above you can see: a process can have multiple threads; multiple threads share the process’s heap and method area (metaspace after JDK 1.8), but each thread has its own program counter, virtual machine stack, and native method stack.

Summary: Threads are smaller running units divided from a process. The biggest difference between threads and processes is that processes are largely independent, while threads in the same process can affect each other. Threads have low execution overhead but are harder to manage and protect resources; processes are the opposite.

Here is some additional extended content about this knowledge point!

Think about this question: why is the program counter, the VM stack, and the native method stack thread-private? Why are the heap and the method area shared among threads?

Why is the program counter private?#

The program counter has two main purposes:

The bytecode interpreter uses the program counter to read instructions in sequence, enabling code flow control such as sequential execution, branching, looping, and exception handling.
In a multi-threaded scenario, the program counter records the current thread’s execution position, so when the thread is switched back in, it knows where it left off.

Note that if a native method is being executed, the program counter holds an undefined address; only when executing Java code does it hold the address of the next instruction.

Thus, making the program counter private primarily ensures that after a thread switch, execution resumes at the correct position.

Why are the VM stack and the native method stack private?#

VM stack: Each Java method, before execution, creates a stack frame to store local variables, operand stacks, constant pool references, etc. As the method is invoked and returns, stack frames are pushed onto and popped off the Java VM stack.
Native method stack: Similar in function to the VM stack, but the VM stack serves Java methods (bytecode), whereas the native method stack serves Native methods used by the VM. In HotSpot, the native method stack is merged with the Java VM stack.

Therefore, to guarantee that local variables in a thread are not accessible by other threads, the VM stack and native method stack are thread-private.

A quick one-sentence explanation of the heap and the method area#

The heap and the method area are shared resources among all threads. The heap is the largest memory region in the process, mainly used for storing newly created objects (almost all objects are allocated here). The method area is mainly used to store loaded class information, constants, static variables, and code generated by the just-in-time compiler, among other data.

Concurrency vs Parallelism#

Concurrency: two or more tasks execute during the same time period.
Parallelism: two or more tasks execute at the same exact moment.

The key point is whether they execute simultaneously.

Synchronous vs Asynchronous#

Synchronous: after issuing a call, you cannot return until the result is obtained; you wait.
Asynchronous: after issuing a call, you don’t wait for the result; the call returns immediately.

Why use multithreading?#

First, overall:

From the computer’s bottom layer: threads can be seen as lightweight processes, the smallest unit of program execution; thread context switching and scheduling costs are far less than processes. Also, in the era of multi-core CPUs, multiple threads can run at the same time, reducing thread context switching overhead.
From the development trend of the Internet: modern systems often demand millions or even tens of millions of concurrent requests, and multithreaded concurrent programming is the foundation for building high-concurrency systems; using many-thread mechanisms can greatly improve overall system concurrency and performance.

Delving to the computer’s bottom layer:

Single-core era: multithreading mainly aimed to improve CPU and I/O resource utilization within a single process. Suppose only one Java process runs; if we perform I/O and the process has a single thread that thread blocks on I/O, the entire process is blocked. The CPU and I/O devices run as if there’s only one core; the overall system efficiency is around 50%. With multiple threads, while one thread blocks on I/O, others can use the CPU, increasing the process’s efficiency.
Multi-core era: in multi-core, multithreading mainly improves a process’s ability to utilize multiple CPU cores. For example, if a task is to be computed, using one thread will only utilize one core regardless of how many cores exist. By creating multiple threads, these threads can be mapped to underlying CPUs; when there’s no resource contention, the task’s execution efficiency increases significantly, roughly equal to (execution time on a single core) divided by the number of cores.

What problems can using multithreading bring?#

The goal of concurrent programming is to improve the program’s execution efficiency and speed, but concurrency does not always increase speed, and it can introduce problems such as memory leaks, deadlocks, and thread safety issues.

How to understand thread safety and unsafety?#

Thread safety and unsafety describe whether access to the same data in a multi-threaded environment can guarantee correctness and consistency.

Thread-safe means that in a multi-threaded environment, regardless of how many threads access the same data concurrently, the data remains correct and consistent.
Thread-unsafe means that in a multi-threaded environment, concurrent access to the same data may lead to data corruption, errors, or loss.

Will running multiple threads on a single-core CPU necessarily be faster?#

Whether running multiple threads on a single-core CPU increases efficiency depends on the thread type and the task nature. There are two types of threads: CPU-intensive and IO-intensive. CPU-intensive threads perform computations and logic and require substantial CPU resources. IO-intensive threads perform input/output operations like reading/writing files or network communication, waiting for IO devices.

On a single-core CPU, only one thread can run at a time; other threads wait for CPU time slices. If the task is CPU-intensive, many threads will cause frequent context switches and reduce efficiency. If the task is IO-intensive, multiple threads can utilize the CPU’s idle time while waiting for IO, improving efficiency.

Therefore, on a single-core CPU, if the task is CPU-intensive, too many threads will harm efficiency; if IO-intensive, more threads can improve efficiency. Of course, “many” should be moderate and not exceed what the system can handle.

Talk about the thread lifecycle and states?#

A Java thread can be in one of six states at specified moments in its lifecycle:

NEW: initial state; the thread is created but start() hasn’t been called.
RUNNABLE: running state; the thread is started and waiting to run.
BLOCKED: blocked state; waiting for a lock.
WAITING: waiting state; the thread needs other threads to perform certain actions (notification or interruption).
TIME_WAITING: timed waiting state; can return after a specified time instead of waiting indefinitely.
TERMINATED: terminated state; the thread has finished running.

Threads can switch between these states as the code executes.

From the above figure: after creation the thread is in NEW state; after calling start() it begins to run, and the thread is in READY (runnable) state. A runnable thread that obtains a CPU time slice then enters RUNNING.

When a thread executes wait(), it enters WAITING. A thread in waiting state must rely on other threads’ notifications to return to running.
TIMED_WAITING is like WAITING with a timeout; for example, sleep(long millis) or wait(long millis) can place a thread into TIMED_WAITING. When the timeout expires, the thread returns to RUNNABLE.
If a thread enters a synchronized method/block or re-enters a synchronized method/block after wait (notify), but the lock is held by another thread, the thread will enter BLOCKED.
After a thread completes the run() method, it will enter TERMINATED.

What is thread context switching?#

During execution, a thread has its own running conditions and state (also called context), such as the program counter and stack information mentioned above. A thread leaves the CPU due to:

Actively yielding the CPU, e.g., sleep(), wait(), etc.
Time slice exhaustion, to prevent a thread or process from hogging the CPU and starving others.
Blocking type system interrupts, e.g., IO requests; the thread is blocked.
Termination or end of execution.

The first three cause thread switches; a thread switch requires saving the current thread’s context and restoring the context for the next thread that will use the CPU. This is called a context switch.

Context switching is a fundamental feature of modern operating systems. Because it requires saving and restoring state, it consumes CPU and memory resources, leading to some overhead; frequent switching reduces overall efficiency.

What is a thread deadlock? How to avoid deadlock?#

Understanding thread deadlock#

Thread deadlock describes a situation where several threads are blocked, one or more of which are waiting for resources to be released. Since threads are blocked indefinitely, the program cannot terminate normally.

As shown below, Thread A holds resource 2 and Thread B holds resource 1; they both attempt to acquire each other’s resource, so they wait for one another and enter a deadlock state.

Four necessary conditions for deadlock:

Mutual exclusion: A resource is held by only one thread at any moment.
Hold and wait: A thread is blocked while requesting a resource and does not release resources it already holds.
No preemption: Resources held by a thread are not forcibly taken away until the thread releases them.
Circular wait: A set of threads forms a cycle in which each thread holds a resource the next thread needs.

How to prevent and avoid thread deadlocks?#

How to prevent deadlock? Break the necessary conditions for deadlock:

Break the hold-and-wait condition: Acquire all required resources at once.
Break the no-preemption condition: If a thread holds some resources and cannot obtain more, it can release its current resources.
Break the circular-wait condition: Acquire resources in a fixed order; release in the reverse order. Break the circular waiting condition.

How to avoid deadlocks?

Avoiding deadlock means using an algorithm (e.g., Banker’s algorithm) to compute and evaluate resource distribution so that the system enters a safe state.

A safe state means the system can allocate resources to each thread in some sequence (P1, P2, P3, … Pn) so that every thread can complete. The sequence <P1, P2, P3, … Pn> is called a safe sequence.

We can modify the code for Thread 2 as follows to avoid deadlock.

1
new Thread(() -> {
2
          synchronized (resource1) {
3
              System.out.println(Thread.currentThread() + "get resource1");
4
              try {
5
                  Thread.sleep(1000);
6
              } catch (InterruptedException e) {
7
                  e.printStackTrace();
8
              }
9
              System.out.println(Thread.currentThread() + "waiting get resource2");
10
              synchronized (resource2) {
11
                  System.out.println(Thread.currentThread() + "get resource2");
12
              }
13
          }
14
      }, "Thread 1").start();
15

16
new Thread(() -> {
17
          synchronized (resource1) {
18
              System.out.println(Thread.currentThread() + "get resource1");
19
              try {
20
                  Thread.sleep(1000);
21
              } catch (InterruptedException e) {
22
                  e.printStackTrace();
23
              }
24
              System.out.println(Thread.currentThread() + "waiting get resource2");
25
              synchronized (resource2) {
26
                  System.out.println(Thread.currentThread() + "get resource2");
27
              }
28
          }
29
      }, "Thread 2").start();

We can analyze why the above code avoids deadlock:

Thread 1 first obtains the monitor lock on resource1; Thread 2 cannot obtain it. Then Thread 1 proceeds to obtain the monitor lock on resource2. After completing, Thread 1 releases the locks on resource1 and resource2, allowing Thread 2 to proceed. This breaks the circular wait condition and thereby avoids deadlock.

sleep() vs wait()#

Commonality: Both can pause a thread’s execution.

Differences:

sleep() does not release the lock, whereas wait() releases the lock.
wait() is typically used for inter-thread communication; sleep() is typically used to pause execution.
After wait() is called, the thread does not wake up automatically; other threads must call notify() or notifyAll() on the same object. After sleep() completes, the thread automatically wakes up, or you can use wait(long timeout) to wake automatically.
sleep() is a static native method of the Thread class; wait() is a native method of the Object class.

Why isn’t wait() defined in Thread?#

wait() makes the thread that owns the object’s lock wait and automatically releases the object’s lock. Each object (Object) has its own lock; since you need to release the current thread’s lock on the object and put it into WAITING state, you must operate on the corresponding object (Object) rather than the current thread (Thread).

Similar question: Why is the sleep() method defined in Thread?

Because sleep() pauses the current thread’s execution and does not involve an object class or require acquiring an object lock.

Can you directly call the Thread class’s run method?#

Creating a new Thread puts it into the NEW state. Calling start() starts a thread and puts the thread into the READY state; when allocated a time slice, it can begin to run. start() performs the thread’s necessary preparation and then automatically executes the contents of the run() method, which is the actual multi-threaded work. However, directly executing the run() method will treat run() as a normal method on the main thread and will not run in a new thread, so this is not multi-threaded work.

Summary: Only by calling start() can you start the thread and have it enter the ready state; directly invoking run() will not run in a multi-threaded way.

The volatile keyword#

How to guarantee visibility of variables?#

In Java, the volatile keyword guarantees visibility. If a variable is declared volatile, it indicates to the JVM that this variable is shared and may change; every use of it reads from the main memory.

JMM (Java Memory Model)

JMM (Java Memory Model) forces reads from main memory

The volatile keyword is not unique to Java; in C it exists as well. Its original meaning is to disable CPU caching. If a variable is marked volatile, it tells the compiler that the variable is shared and may change, so every use reads from main memory.

Volatile guarantees visibility but does not guarantee atomicity. The synchronized keyword guarantees both.

How to prevent instruction reordering?#

In Java, the volatile keyword not only guarantees visibility but also prevents the JVM from reordering instructions. If a variable is declared volatile, reads and writes to this variable insert specific memory barriers to prevent instruction reordering.

In Java, the Unsafe class provides three out-of-the-box memory barrier methods, abstracting OS-level differences:

1
public native void loadFence();
2
public native void storeFence();
3
public native void fullFence();

Theoretically, you could achieve the same effect as volatile’s reordering prevention with these three methods, but it’s more cumbersome.

Now I’ll use a common interview question to illustrate how volatile prevents instruction reordering.

Interviewers often say: “Do you know the singleton pattern? Please hand-write it for me! Explain the principle of the double-checked locking approach to implement a singleton.”

Double-checked locking to implement a thread-safe singleton:

1
public class Singleton {
2

3
    private volatile static Singleton uniqueInstance;
4

5
    private Singleton() {
6
    }
7

8
    public  static Singleton getUniqueInstance() {
9
       // First check if the object is already instantiated; if not, enter synchronized code
10
        if (uniqueInstance == null) {
11
            // Lock the class object
12
            synchronized (Singleton.class) {
13
                if (uniqueInstance == null) {
14
                    uniqueInstance = new Singleton();
15
                }
16
            }
17
        }
18
        return uniqueInstance;
19
    }
20
}

Using the volatile keyword for uniqueInstance is very important. The statement uniqueInstance = new Singleton(); is actually executed in three steps:

Allocate memory for uniqueInstance
Initialize uniqueInstance
Set the uniqueInstance reference to the allocated memory address

But due to the JVM’s instruction reordering, the execution order may become 1-3-2. In a single-threaded environment this is not a problem, but in a multi-threaded context it can cause a thread to obtain an instance that has not yet been initialized. For example, Thread T1 executes 1 and 3; then Thread T2 calls getUniqueInstance() and finds uniqueInstance is not null, returns it, but it hasn’t been initialized yet.

Can volatile guarantee atomicity?#

Volatile guarantees visibility but cannot guarantee atomicity of operations on the variable.

We can prove it with the following example:

1
public class VolatoleAtomicityDemo {
2
    public volatile static int inc = 0;
3

4
    public void increase() {
5
        inc++;
6
    }
7

8
    public static void main(String[] args) throws InterruptedException {
9
        ExecutorService threadPool = Executors.newFixedThreadPool(5);
10
        VolatoleAtomicityDemo volatoleAtomicityDemo = new VolatoleAtomicityDemo();
11
        for (int i = 0; i < 5; i++) {
12
            threadPool.execute(() -> {
13
                for (int j = 0; j < 500; j++) {
14
                    volatoleAtomicityDemo.increase();
15
                }
16
            });
17
        }
18
        // Wait 1.5 seconds to ensure above tasks finish
19
        Thread.sleep(1500);
20
        System.out.println(inc);
21
        threadPool.shutdown();
22
    }
23
}

Normally, this should print 2500. But in practice, you’ll find the output is always less than 2500.

Why does this happen? Didn’t volatile guarantee visibility?

In other words, if volatile could guarantee atomicity of inc++, then after each thread increments inc, other threads could see the updated value immediately. If five threads each perform 500 increments, inc should be 5 * 500 = 2500.

Many people mistakenly think the increment operation inc++ is atomic. In fact, inc++ is a composite operation with three steps:

Read the value of inc
Add 1 to it
Write the value back to memory

Volatile cannot guarantee that these three steps are atomic; this can lead to the following situation:

Thread 1 reads inc and is not yet modifying it. Thread 2 reads inc, increments it (+1), and writes back.
Thread 1 then updates inc (+1) and writes back.

This results in both threads performing one increment, but inc only increases by 1 overall.

In fact, if you want to ensure correctness of the above code, you can easily do so with synchronized, Lock, or AtomicInteger.

Using synchronized:

1
public synchronized void increase() {
2
    inc++;
3
}

Using AtomicInteger:

1
public AtomicInteger inc = new AtomicInteger();
2

3
public void increase() {
4
    inc.getAndIncrement();
5
}

Using ReentrantLock:

1
Lock lock = new ReentrantLock();
2
public void increase() {
3
    lock.lock();
4
    try {
5
        inc++;
6
    } finally {
7
        lock.unlock();
8
    }
9
}

Optimistic locking and pessimistic locking#

What is a pessimistic lock?#

A pessimistic lock always assumes the worst case: that a shared resource will be modified when accessed. So each time a resource is acquired, it is locked; other threads attempting to access the resource will be blocked until the lock is released by the current holder. In other words, a shared resource is used by only one thread at a time, and others wait.

In Java, exclusive locks like synchronized and ReentrantLock embody the pessimistic locking mindset.

1
public void performSynchronisedTask() {
2
    synchronized (this) {
3
        // synchronized operations
4
    }
5
}
6

7
private Lock lock = new ReentrantLock();
8
lock.lock();
9
try {
10
   // synchronized operations
11
} finally {
12
    lock.unlock();
13
}

In high-concurrency scenarios, heavy lock contention can lead to thread blocking, many blocked threads cause context switches, and increase system overhead. Pessimistic locks may also lead to deadlocks, affecting code execution.

What is optimistic locking?#

Optimistic locking assumes the best case: shared resources are not expected to be modified during access; threads proceed without locking or waiting, and only verify, when committing updates, whether the resource has been modified by another thread (the typical methods use versioning or CAS).

In Java, atomic variables under java.util.concurrent.atomic (for example AtomicInteger, LongAdder) implement optimistic locking using CAS.

1
// LongAdder can outperform AtomicInteger and AtomicLong under high concurrency
2
// The cost is higher memory usage (space-for-time trade-off)
3
LongAdder sum = new LongAdder();
4
sum.increment();

Under high concurrency, optimistic locking avoids lock contention and thread blocking and often outperforms pessimistic locking. However, if conflicts occur frequently (e.g., write-heavy scenarios), there will be frequent failures and retries, which can also degrade performance and raise CPU usage.

Nevertheless, many failures and retries can be mitigated; as mentioned above, LongAdder uses a space-for-time approach to solve this problem.

In theory:

Optimistic locking is typically used where writes are relatively rare (read-heavy scenarios, low contention); this avoids frequent locking and improves performance. But optimistic locking mainly targets a single shared variable (see atomic variable classes under java.util.concurrent.atomic).
Pessimistic locking is typically used where writes are frequent (high contention), to avoid frequent failures and retries; its overhead is fixed. If optimistic locking solves frequent failures and retries (as with LongAdder), it can be considered, depending on the situation.

How to implement optimistic locking?#

Optimistic locking is usually implemented with a versioning mechanism or CAS; CAS is more common and requires attention.

Versioning mechanism#

Typically add a version column named version in the data table to indicate how many times the data has been modified. When data is modified, the version value is incremented. When thread A wants to update data, it reads the data and the version value; when submitting the update, if the version read equals the current version in the database, update; otherwise retry the update until success.

A simple example: suppose an accounts table has a version field with current value 1, and the account balance (balance) is $100.

Operator A reads it (version = 1) and deducts $50 from the balance ($ 100 - $50).
Operator B reads this user’s info (version = 1) and deducts $20 from the balance ($ 100 - $20).
Operator A completes the update, submitting version = 1 along with the updated balance ($50). The database sees the submitted version equals the current version and updates the balance; the database version becomes 2.
Operator B attempts to submit with version = 1 and balance = $80, but the database current version is 2, so the optimistic lock policy fails and the submission is rejected.

This avoids Operator B’s update from overwriting Operator A’s results using old data.

CAS#

CAS stands for Compare And Swap; used to implement optimistic locking and widely applied in frameworks. The idea is to compare the current value with an expected value and, if equal, swap to a new value atomically.

CAS is atomic and relies on a CPU instruction.

An atomic operation is the smallest indivisible operation, which, once started, cannot be interrupted until it completes.

CAS involves three operands:

V: value to be updated (Var)
E: expected value (Expected)
N: new value intended to be written (New)

Only when V equals E will CAS atomically update V to N. If not equal, another thread updated V, so the current thread gives up.

An example: Thread A wants to change i to 6, with i initially 1 (V = 1, E = 1, N = 6, ABA not considered).

Compare i with 1; if equal, set to 6.
If not equal, another thread changed i; CAS fails.

When multiple threads use CAS on a single variable, only one will win; the others fail, but failed threads are not blocked; they are informed of the failure and may retry, or give up.

Java does not provide a direct CAS implementation; CAS related implementations are achieved via C++ inline assembly (JNI). Therefore, CAS implementation depends on the OS and CPU.

Unsafe in sun.misc provides compareAndSwapObject, compareAndSwapInt, compareAndSwapLong to perform CAS on Object, int, long.

1
/**
2
  *  CAS
3
  * @param o         the object containing the field to modify
4
  * @param offset    the offset of the field within the object
5
  * @param expected  the expected value
6
  * @param update    the update value
7
  * @return          true | false
8
  */
9
public final native boolean compareAndSwapObject(Object o, long offset,  Object expected, Object update);
10

11
public final native boolean compareAndSwapInt(Object o, long offset, int expected,int update);
12

13
public final native boolean compareAndSwapLong(Object o, long offset, long expected, long update);

What problems exist with optimistic locking?#

ABA problems are the most common problem with optimistic locking.

ABA problem#

If a variable V is first read as A, and while we prepare to update it we see it’s still A, can we guarantee it hasn’t been changed by other threads? Obviously not, because in the meantime it might have been changed to some other value and then changed back to A, causing a CAS operation to falsely believe it has never been modified. This is called the CAS “ABA” problem.

The solution is to attach a version number or timestamp in front of the variable. After 1.5, AtomicStampedReference solves ABA via a stamp that CAS checks with the current reference.

1
public boolean compareAndSet(V   expectedReference,
2
                             V   newReference,
3
                             int expectedStamp,
4
                             int newStamp) {
5
    Pair<V> current = pair;
6
    return
7
        expectedReference == current.reference &&
8
        expectedStamp == current.stamp &&
9
        ((newReference == current.reference &&
10
          newStamp == current.stamp) ||
11
         casPair(current, Pair.of(newReference, newStamp)));
12
}

Long unlocking time can be expensive#

CAS often retries via spinning, which can waste CPU cycles if retries take long.

If the processor supports a pause instruction, it can improve efficiency:

Delay the pipeline and prevent the CPU from wasting resources.
Prevent memory-ordering hazards from clearing the CPU pipeline.

CAS can also be aided by library support (e.g., with Unsafe or Atomic classes).

CAS can only guarantee atomicity for a single shared variable#

CAS is only effective for a single shared variable. If you need to operate on multiple shared variables, CAS alone is not enough. However, from Java 1.5 onward, AtomicReference allows atomic operations on references, enabling you to combine multiple variables into one shared variable for CAS, or use locks to achieve the same.

The synchronized keyword#

What is synchronized? What is it for?#

Synchronized is a Java keyword that expresses synchronization for access to resources across threads. It ensures that a method or a block annotated with synchronized can be executed by only one thread at a time.

In early Java versions, synchronized was a heavyweight lock and less efficient because the monitor lock relied on the OS’s Mutex Lock, and thread context-switching involved user-to-kernel mode transitions, which took time.

Since Java 6, synchronized has been optimized with spin locks, adaptive spin, lock elimination, lock coarsening, biased locking, lightweight locks, etc., improving its performance. Therefore, synchronized is still commonly used in real projects; the JDK source and many frameworks use it extensively.

About biased locking: biased locking adds complexity; it does not always benefit all applications. In JDK 15 biased locking is disabled by default (though you can enable with -XX:+UseBiasedLocking). In JDK 18 biased locking has been deprecated and is no longer available.

How to use synchronized?#

There are three main usage patterns:

Modifying instance methods

1
synchronized void method() {
2
    // business code
3
}

Modifying static methods

This locks the current class and affects all instances of the class; entering synchronized code requires the lock for the class.
```
1
synchronized static void method() {
2
    // business code
3
}
```
Static synchronized methods and non-static synchronized methods do not mutually exclude each other. If one thread A calls a non-static synchronized method of an instance, and thread B calls a static synchronized method of the class, they won’t block each other because the locks are on the class (static synchronized) and on the instance (non-static) respectively.
Modifying a code block
- synchronized(object): acquire the lock of the given object before entering the synchronized code.
- synchronized(Class.class): acquire the lock of the given Class before entering the synchronized code.
```
1
synchronized(this) {
2
    // business code
3
}
```

Summary:

Putting synchronized on static methods and synchronized(class) blocks locks the Class object.
Putting synchronized on instance methods locks the object instance.
Avoid using synchronized(String a) because the JVM’s string constant pool is cached and can lead to contention or other issues.

Can constructors be annotated with synchronized?#

Conclusion: Constructors cannot be annotated with synchronized.

Constructors are inherently thread-safe; there is no such thing as a synchronized constructor.

Do you understand the underlying mechanism of synchronized?#

The underlying mechanism of synchronized is at the JVM level.

Synchronized block:

1
public class SynchronizedDemo {
2
    public void method() {
3
        synchronized (this) {
4
            System.out.println("synchronized code block");
5
        }
6
    }
7
}

Using javap (JDK’s tool) to inspect the bytecode for SynchronizedDemo: compile with javac SynchronizedDemo.java and then run javap -c -s -v -l SynchronizedDemo.class.

[Images showing monitorenter and monitorexit and their usage]

When monitorenter executes, the thread attempts to acquire the object’s lock (monitor). In HotSpot, the monitor is realized in C++ as ObjectMonitor. If the lock acquisition fails, the thread blocks until the lock is released by another thread.

For synchronized method:

1
public class SynchronizedDemo2 {
2
    public synchronized void method() {
3
        System.out.println("synchronized method");
4
    }
5
}

[Image showing ACC_SYNCHRONIZED flag]

A synchronized method does not use monitorenter/monitorexit for entry/exit; instead, the JVM uses ACC_SYNCHRONIZED to indicate that the method is synchronized, and it handles the synchronization accordingly.

In summary:

The implementation of synchronized blocks uses monitorenter and monitorexit.
Synchronized methods use the ACC_SYNCHRONIZED flag to indicate a synchronized method.
Either way, both approaches ultimately obtain the object’s monitor.

What optimizations were done to synchronized after Java 1.6? Do you understand lock upgrading?#

Since Java 6, synchronized has seen many optimizations such as spin locks, adaptive spin locks, lock elimination, lock coarsening, biased locking, and lightweight locking to reduce synchronization overhead. Locks can be upgraded to higher forms as contention increases; however, biasing has been deprecated in recent versions (as noted earlier).

What’s the difference between synchronized and volatile?#

synchronized and volatile are complementary; not opposing.
volatile provides a lightweight form of synchronization, so it generally has better performance than synchronized. However, volatile can be used only on variables, not on methods or blocks.
volatile ensures visibility but not atomicity; synchronized ensures both.

ReentrantLock#

What is ReentrantLock?#

ReentrantLock implements the Lock interface, is reentrant and exclusive, similar to the synchronized keyword. However, ReentrantLock is more flexible and powerful, adding features such as polling, timeouts, interruption, fair and non-fair locking, etc.

1
public class ReentrantLock implements Lock, java.io.Serializable {}

ReentrantLock contains an inner class Sync; Sync extends AQS (AbstractQueuedSynchronizer), and most lock/unlock operations are implemented in Sync. Sync has two subclasses: FairSync and NonfairSync.

ReentrantLock by default uses a non-fair lock, but you can explicitly specify a FairLock via the constructor.

1
// Pass a boolean, true for fair lock, false for non-fair
2
public ReentrantLock(boolean fair) {
3
    sync = fair ? new FairSync() : new NonfairSync();
4
}

From the above, you can see that ReentrantLock’s implementation is based on AQS.

What’s the difference between fair and non-fair locks?#

Fair lock: the thread that requests the lock first gets it after the lock is released. Performance is slightly worse because fair locks require more context switches to maintain strict ordering.
Non-fair lock: after a lock is released, a thread that arrives later may acquire the lock in a non-deterministic order, usually offering better performance but potentially starving some threads.

What’s the difference between synchronized and ReentrantLock?#

Both are reentrant locks.
Synchronized is implemented by the JVM; ReentrantLock is implemented at the API level.
ReentrantLock provides higher-level features like interruptible lock acquisition, fairness, and multiple conditions, which synchronized cannot directly provide.
If you want to use the features of ReentrantLock (lock interrupts, fairness, allow multiple conditions), pick ReentrantLock; otherwise, synchronized is simpler and typically sufficient.

Interruptible vs non-interruptible locks?#

Interruptible locks: a thread waiting to acquire a lock can be interrupted; ReentrantLock supports lockInterruptibly().
Non-interruptible locks: once a thread starts to acquire a lock, it cannot be interrupted until it acquires the lock.

ReentrantReadWriteLock#

What is ReentrantReadWriteLock?#

ReentrantReadWriteLock implements ReadWriteLock, a reentrant read-write lock that allows multiple readers to hold the lock simultaneously, but only one writer to hold the lock, and writers have exclusive access when present.

1
public class ReentrantReadWriteLock
2
        implements ReadWriteLock, java.io.Serializable{
3
}
4
public interface ReadWriteLock {
5
    Lock readLock();
6
    Lock writeLock();
7
}

Read locks are shared; write locks are exclusive. Read locks can be held by multiple threads simultaneously, while a write lock can be held by only one thread at a time.

Like ReentrantLock, ReentrantReadWriteLock is also based on AQS.

ReentrantReadWriteLock also supports fair and non-fair locking, with non-fair as default; you can specify fairness in the constructor.

1
// Pass a boolean, true for fair lock, false for non-fair lock
2
public ReentrantReadWriteLock(boolean fair) {
3
    sync = fair ? new FairSync() : new NonfairSync();
4
    readerLock = new ReadLock(this);
5
    writerLock = new WriteLock(this);
6
}

When is ReentrantReadWriteLock suitable?#

Because it provides both read and write locks, and read locks are shareable while writes are exclusive, it can significantly improve performance in read-mostly workloads.

What’s the difference between shared locks and exclusive locks?#

Shared lock: one lock can be held by multiple threads simultaneously (reads).
Exclusive lock: the lock can be held by only one thread at a time (writes).

Can a thread holding a read lock obtain a write lock?#

If a thread holds a read lock, it cannot obtain a write lock (if the write lock is needed and currently held by another thread, the attempt to acquire the write lock will fail unless the current thread releases the read lock first).
If a thread holds a write lock, it can acquire a read lock (a scenario that allows read locks even when a write is in progress, but this is subject to the implementation and configuration).

Why can’t a read lock be upgraded to a write lock?#

Write locks can be downgraded to read locks, but read locks cannot be upgraded to write locks. Upgrading would cause threads to compete for the write lock, which is exclusive, potentially harming performance. There can also be deadlocks if two threads with read locks try to upgrade to write locks.

ThreadLocal#

What is ThreadLocal good for?#

Typically, a variable is accessible by any thread. If you want each thread to have its own local copy of a variable, how can you do that?

ThreadLocal is designed to solve exactly this: it binds a value to each thread independently. ThreadLocal can be imagined as a small box to store data local to each thread. If you create a ThreadLocal variable, each thread that accesses it will have its own local copy, avoiding thread-safety issues.

How to use ThreadLocal?#

Below is a simple example showing how to use ThreadLocal in a project.

1
import java.text.SimpleDateFormat;
2
import java.util.Random;
3

4
public class ThreadLocalExample implements Runnable{
5

6
     // SimpleDateFormat is not thread-safe, so each thread should have its own copy
7
    private static final ThreadLocal<SimpleDateFormat> formatter = ThreadLocal.withInitial(() -> new SimpleDateFormat("yyyyMMdd HHmm"));
8

9
    public static void main(String[] args) throws InterruptedException {
10
        ThreadLocalExample obj = new ThreadLocalExample();
11
        for(int i=0 ; i<10; i++){
12
            Thread t = new Thread(obj, ""+i);
13
            Thread.sleep(new Random().nextInt(1000));
14
            t.start();
15
        }
16
    }
17

18
    @Override
19
    public void run() {
20
        System.out.println("Thread Name= "+Thread.currentThread().getName()+" default Formatter = "+formatter.get().toPattern());
21
        try {
22
            Thread.sleep(new Random().nextInt(1000));
23
        } catch (InterruptedException e) {
24
            e.printStackTrace();
25
        }
26
        // formatter pattern is changed here by thread, but it won't reflect to other threads
27
        formatter.set(new SimpleDateFormat());
28

29
        System.out.println("Thread Name= "+Thread.currentThread().getName()+" formatter = "+formatter.get().toPattern());
30
    }
31

32
}

From the output you can see that although Thread-0 has changed the value of formatter, Thread-1’s default formatting value remains the same as the initialize value, and the same for the other threads.

The code above uses Java 8 knowledge: it is equivalent to the following, which IDEA would suggest converting to Java 8 style. ThreadLocal’s class was extended in Java 8 with the withInitial() method that takes a Supplier:

1
private static final ThreadLocal<SimpleDateFormat> formatter = new ThreadLocal<SimpleDateFormat>(){
2
    @Override
3
    protected SimpleDateFormat initialValue(){
4
        return new SimpleDateFormat("yyyyMMdd HHmm");
5
    }
6
};

Do you understand ThreadLocal’s mechanism?#

Starting from the Thread class source code.

1
public class Thread implements Runnable {
2
    //......
3
    // ThreadLocal values related to this thread. Maintained by ThreadLocal
4
    ThreadLocal.ThreadLocalMap threadLocals = null;
5

6
    // InheritableThreadLocal values related to this thread. Maintained by InheritableThreadLocal
7
    ThreadLocal.ThreadLocalMap inheritableThreadLocals = null;
8
    //......
9
}

From the Thread class source, you can see that the Thread class has a threadLocals and an inheritableThreadLocals, both of type ThreadLocalMap. You can think of ThreadLocalMap as a specialized HashMap implemented by ThreadLocal. By default, these two variables are null and are created only when the current thread calls the ThreadLocal set or get methods. When these methods are called, you’re actually operating on the mapping within ThreadLocalMap.

ThreadLocal stores key-value pairs where the key is a ThreadLocal and the value is the object set via ThreadLocal’s set method.

1
ThreadLocalMap(ThreadLocal<?> firstKey, Object firstValue) {
2
    //......
3
}

For example, if you declare two ThreadLocal objects in the same thread, the ThreadLocalMap inside the Thread stores data for both ThreadLocals; the key is the ThreadLocal instance, and the value is what was set via the ThreadLocal’s set call.

The ThreadLocal data structure is shown here:

ThreadLocalMap is a static inner class of ThreadLocal.

How does a ThreadLocal memory leak occur?#

ThreadLocalMap uses a weak reference for the key (ThreadLocal), but a strong reference for the value. If a ThreadLocal is not strongly referenced elsewhere, the key is collected by GC but its value may not be collected, leading to a memory leak.

Thus, ThreadLocalMap can contain entries whose keys are null. If you don’t clean up, the values may never be GC’d. The implementation of ThreadLocalMap already handles this by cleaning up entries with null keys when calling set(), get(), or remove(). It’s best to call remove() after using ThreadLocal.

1
static class Entry extends WeakReference<ThreadLocal<?>> {
2
    /** The value associated with this ThreadLocal. */
3
    Object value;
4

5
    Entry(ThreadLocal<?> k, Object v) {
6
        super(k);
7
        value = v;
8
    }
9
}

Weak references:

If an object only has weak references, it is like a disposable item. Weak references differ from soft references in that an object with only weak references has a shorter lifetime. When the garbage collector scans the memory region it controls and finds objects with only weak references, it will immediately reclaim their memory, regardless of current memory pressure.

Weak references can be used with a ReferenceQueue; if the object referenced by a weak reference is garbage collected, the VM will enqueue the weak reference into the associated reference queue.

Thread pools#

What is a thread pool?#

A thread pool is a resource pool that manages a set of threads. When there is a task to process, you take a thread from the pool; after the task finishes, the thread is not destroyed immediately but waits for the next task.

Why use a thread pool?#

Pooling reduces the overhead of creating and destroying threads and improves resource utilization. A thread pool also provides centralized management, tuning, and monitoring of threads.

How to create a thread pool?#

Create via the ThreadPoolExecutor constructor (recommended).
Create via the Executor framework’s utility class Executors.

We can create several types of ThreadPoolExecutor:

FixedThreadPool: Returns a thread pool with a fixed number of threads. The number of threads remains constant. When a new task is submitted, if there are idle threads, they execute immediately; otherwise, the task is queued until a thread becomes available.
SingleThreadExecutor: Returns a thread pool with only one thread. If more than one task is submitted, tasks are queued and executed in FIFO order when the single thread is available.
CachedThreadPool: Returns a thread pool that can adjust the number of threads based on demand. Initial size is 0. If a new task arrives and there are no available threads, it creates a new thread. If there is no new task for a while (default 60 seconds), core threads time out and are terminated, shrinking the pool.
ScheduledThreadPool: Returns a thread pool for executing tasks after a given delay or periodically.

Why not recommended to use built-in thread pools?#

Alibaba Java Development Manual’s concurrency chapter explicitly states that thread resources must be provided by thread pools; applications should not create threads directly.

The benefit of thread pools is to reduce the cost of creating/destroying threads and resource usage, and to avoid resource exhaustion. If you don’t use a thread pool, the system may create a large number of similar threads, consuming memory or causing excessive context switching.

The manual also enforces using ThreadPoolExecutor constructors (instead of Executors wrappers) to clearly convey pool behavior.

The downsides of Executors wrappers:

FixedThreadPool and SingleThreadExecutor use an unbounded LinkedBlockingQueue; the queue can grow to Integer.MAX_VALUE, potentially causing OOM with many requests.
CachedThreadPool uses SynchronousQueue; the maximum threads can reach Integer.MAX_VALUE, potentially causing OOM if tasks are numerous and slow.
ScheduledThreadPool and SingleThreadScheduledExecutor use an unbounded DelayedWorkQueue; the queue can grow to Integer.MAX_VALUE, potentially causing OOM.

1
// Unbounded LinkedBlockingQueue
2
public static ExecutorService newFixedThreadPool(int nThreads) {
3
    return new ThreadPoolExecutor(nThreads, nThreads,0L, TimeUnit.MILLISECONDS,new LinkedBlockingQueue<Runnable>());
4
}
5

6
// Unbounded LinkedBlockingQueue
7
public static ExecutorService newSingleThreadExecutor() {
8
    return new FinalizableDelegatedExecutorService (new ThreadPoolExecutor(1, 1,0L, TimeUnit.MILLISECONDS,new LinkedBlockingQueue<Runnable>()));
9
}
10

11
// SynchronousQueue, no capacity, maximum threads Integer.MAX_VALUE
12
public static ExecutorService newCachedThreadPool() {
13
    return new ThreadPoolExecutor(0, Integer.MAX_VALUE,60L, TimeUnit.SECONDS,new SynchronousQueue<Runnable>());
14
}
15

16
// DelayedWorkQueue (delayed blocking queue)
17
public static ScheduledExecutorService newScheduledThreadPool(int corePoolSize) {
18
    return new ScheduledThreadPoolExecutor(corePoolSize);
19
}
20
public ScheduledThreadPoolExecutor(int corePoolSize) {
21
    super(corePoolSize, Integer.MAX_VALUE, 0, NANOSECONDS,
22
          new DelayedWorkQueue());
23
}

What are common parameters for a thread pool? How to explain?#

1
/**
2
     * Create a new ThreadPoolExecutor with the given initial parameters.
3
     */
4
    public ThreadPoolExecutor(int corePoolSize, // core pool size
5
                              int maximumPoolSize, // max pool size
6
                              long keepAliveTime, // keep-alive time for extra threads
7
                              TimeUnit unit, // time unit for keepAliveTime
8
                              BlockingQueue<Runnable> workQueue, // task queue
9
                              ThreadFactory threadFactory, // thread factory
10
                              RejectedExecutionHandler handler // rejection policy
11
                               ) {
12
        if (corePoolSize < 0 ||
13
            maximumPoolSize <= 0 ||
14
            maximumPoolSize < corePoolSize ||
15
            keepAliveTime < 0)
16
            throw new IllegalArgumentException();
17
        if (workQueue == null || threadFactory == null || handler == null)
18
            throw new NullPointerException();
19
        this.corePoolSize = corePoolSize;
20
        this.maximumPoolSize = maximumPoolSize;
21
        this.workQueue = workQueue;
22
        this.keepAliveTime = unit.toNanos(keepAliveTime);
23
        this.threadFactory = threadFactory;
24
        this.handler = handler;
25
    }

Three most important parameters of ThreadPoolExecutor:

corePoolSize: When the task queue has not reached capacity, the maximum number of threads that can run concurrently.
maximumPoolSize: When the queue has reached capacity, the number of concurrently running threads is capped at this maximum.
workQueue: When a new task arrives, the pool first checks whether the number of running threads has reached corePoolSize; if so, the task is stored in the queue.

Other common parameters:

keepAliveTime: When the number of threads is greater than corePoolSize and there are no new tasks, the extra idle threads will wait for keepAliveTime and then be terminated; the pool will reduce to corePoolSize. Both core and non-core threads are treated the same during cleanup.
unit: Time unit for keepAliveTime.
threadFactory: Used to create new threads for the executor.
handler: RejectedExecutionHandler when the pool is saturated.

[Diagram showing the relationships between thread pool parameters]

What are the thread pool saturation policies?#

If the number of running threads reaches maximum and the queue is full, ThreadPoolExecutor defines several policies:

ThreadPoolExecutor.AbortPolicy: throws a RejectedExecutionException to reject the new task.
ThreadPoolExecutor.CallerRunsPolicy: the task is run by the thread that invoked execute; if the executor has shut down, the task is discarded. This policy reduces the rate of new task submissions and can affect overall performance. If your application can tolerate this delay and you want to ensure every task is executed, you can choose this policy.
ThreadPoolExecutor.DiscardPolicy: discards the new task.
ThreadPoolExecutor.DiscardOldestPolicy: discards the oldest unprocessed request.

For example, when Spring creates a thread pool via ThreadPoolTaskExecutor or by directly using ThreadPoolExecutor constructors, the default saturation policy is AbortPolicy. In this saturation policy, if the queue is full, ThreadPoolExecutor throws a RejectedExecutionException to reject the new task, meaning you will lose the ability to process that task. If you don’t want to discard tasks, you can use CallerRunsPolicy. CallerRunsPolicy, unlike the other policies, does not throw or drop tasks; instead, it returns the task to the caller and executes it in the caller’s thread.

1
public static class CallerRunsPolicy implements RejectedExecutionHandler {
2

3
        public CallerRunsPolicy() { }
4

5
        public void rejectedExecution(Runnable r, ThreadPoolExecutor e) {
6
            if (!e.isShutdown()) {
7
                // Directly execute in the main thread, not in the thread pool
8
                r.run();
9
            }
10
        }
11
    }

Which common blocking queues are used with thread pools?#

When a new task arrives, the pool first checks if the number of running threads has reached the core pool size, and if so, the task is placed into the queue.

Different thread pools use different blocking queues:

A LinkedBlockingQueue with capacity Integer.MAX_VALUE (unbounded): FixedThreadPool and SingleThreadExecutor. The pool’s thread count never exceeds the core pool size (as the queue will never be full).
A SynchronousQueue: CachedThreadPool. No capacity; it ensures a new thread is created if there is no idle thread. The pool can grow to Integer.MAX_VALUE.
DelayedWorkQueue (delayed blocking queue): ScheduledThreadPool and SingleThreadScheduledExecutor. The internal elements are ordered by delay; the queue uses a heap to keep the earliest-execution-time task at the head. The queue grows but never blocks; maximum growth can reach Integer.MAX_VALUE, thus at most the pool’s core size is limited.

Do you understand the flow of processing tasks in a thread pool?#

If the current number of running threads is less than the core pool size, a new thread is created to execute the task.
If the current running threads are at least the core pool size but less than the maximum, the task is placed into the queue for later execution.
If the task cannot be queued (the queue is full) but the current number of running threads is less than the maximum, a new thread is created to execute the task.
If the current number of running threads has reached the maximum, a new thread would exceed the maximum; the task is rejected and the saturation policy handles it.

How to name threads in a thread pool?#

When initializing a thread pool, naming is helpful for debugging.

By default, thread names look like pool-1-thread-n, which carry no business meaning.

There are two common ways to name threads in the pool:

Using Guava’s ThreadFactoryBuilder

1
ThreadFactory threadFactory = new ThreadFactoryBuilder()
2
                        .setNameFormat(threadNamePrefix + "-%d")
3
                        .setDaemon(true).build();
4
ExecutorService threadPool = new ThreadPoolExecutor(corePoolSize, maximumPoolSize, keepAliveTime, TimeUnit.MINUTES, workQueue, threadFactory);

Implementing your own ThreadFactory

1
import java.util.concurrent.ThreadFactory;
2
import java.util.concurrent.atomic.AtomicInteger;
3

4
/**
5
 * Thread factory that sets thread names to help locate problems.
6
 */
7
public final class NamingThreadFactory implements ThreadFactory {
8

9
    private final AtomicInteger threadNum = new AtomicInteger();
10
    private final String name;
11

12
    /**
13
     * Create a thread pool factory with a name.
14
     */
15
    public NamingThreadFactory(String name) {
16
        this.name = name;
17
    }
18

19
    @Override
20
    public Thread newThread(Runnable r) {
21
        Thread t = new Thread(r);
22
        t.setName(name + " [#" + threadNum.incrementAndGet() + "]");
23
        return t;
24
    }
25
}

How to set the thread pool size?#

Many people think increasing the thread pool size is better, but too many threads in a multi-threaded scenario increases context-switching costs.

If the pool size is too small, a large number of tasks/requests may queue up, potentially leading to queue fullness or memory pressure (OOM). The CPU might not be fully utilized.
If the pool size is too large, too many threads compete for CPU resources, causing heavy context switching and slowing down overall execution.

A simple, widely applicable rule:

CPU-bound tasks (N+1): set the number of threads to N (CPU cores) + 1. The extra thread helps cover occasional page misses or other pauses; when a task pauses, other threads can use the CPU, making better use of CPU idle time.
IO-bound tasks (2N): you can configure more threads because they wait on IO most of the time; set roughly to 2N.

How to determine CPU-bound vs IO-bound tasks?

CPU-bound means tasks that primarily use CPU, e.g., sorting large data in memory. IO-bound tasks include network I/O or file I/O; these tasks spend more time waiting for IO than performing CPU work.

A more precise formula: optimum threads = N * (1 + WT/ST), where WT = total time spent waiting; ST = time spent computing. The ratio WT/ST guides how many threads you need: higher WT/ST suggests more threads.

We can use VisualVM (a JDK tool) to observe WT/ST.

For CPU-bound tasks, WT/ST is near 0, so the number of threads can be set to N (CPU cores) * (1 + 0) = N, close to the N in earlier notes. For IO-bound tasks, WT is large; you might set 2N.

The formula is only a guideline; adjust dynamically based on production experience.

How to dynamically modify thread pool parameters?#

Meituan’s technical team documented in “Java Thread Pool Implementation Principles and Meituan’s Practice” ideas and methods for configurable thread pool parameters.

Meituan’s approach focuses on making core thread pool parameters configurable. The three core parameters are:

corePoolSize: Core thread count; defines the minimum simultaneous running threads.
maximumPoolSize: When the queue reaches capacity, the maximum number of threads that can run.
workQueue: When a new task arrives, the pool first checks if the number of running threads has reached the core pool size; if so, the task is enqueued.

Why these three parameters?

These are the most important parameters for ThreadPoolExecutor; they largely determine how the pool handles tasks.

Note: corePoolSize can be changed at runtime via setCorePoolSize(); if the current number of worker threads is greater than corePoolSize, the pool will shrink by reducing workers.

Additionally, there is no dynamic method to adjust queue length in the standard API. Meituan’s approach uses a custom queue called ResizableCapacityLinkedBlockingQueue (which removes the final modifier on the LinkedBlockingQueue’s capacity field, allowing dynamic resizing).

If your project also wants to achieve this, you can leverage these open-source projects:

Hippo4j: asynchronous thread pool framework; supports dynamic changes, monitoring, and alerting; easy to use without code changes. Supports multiple usage modes and aims to improve system reliability.
Dynamic TP: lightweight dynamic thread pool; built-in monitoring and alarm; integrates with middleware thread pool management; based on mainstream config centers (Nacos, Apollo, Zookeeper, etc., SPI extendable).

How to design a priority-based thread pool?#

This is a common interview question, essentially testing the candidate’s grasp of thread pools and blocking queues.

Different thread pools use different blocking queues for task queues. For example, FixedThreadPool uses LinkedBlockingQueue (unbounded), so the queue will never be full, and the pool can only create core pool size threads.

If you need to implement a priority task thread pool, you can consider using PriorityBlockingQueue (priority blocking queue) as the task queue (ThreadPoolExecutor’s constructor accepts a workQueue parameter).

PriorityBlockingQueue is a priority-based unbounded blocking queue; it is effectively a thread-safe PriorityQueue, both backed by a min-heap. Note that PriorityQueue does not support blocking operations.

To enable PriorityBlockingQueue to sort tasks, the tasks placed in it must be comparable, in one of two ways:

The tasks submitted to the thread pool implement Comparable and override compareTo to define the priority.
Provide a Comparator when constructing the PriorityBlockingQueue to define the priority rules (recommended).

There are some risks and drawbacks:

PriorityBlockingQueue is unbounded, which can lead to a large backlog of requests and possible OOM.
It can cause starvation for low-priority tasks.
Since it sorts elements and ensures thread safety (via locks), performance can degrade.

A simple mitigation for OOM is to subclass PriorityBlockingQueue and override the offer method to bound the queue; when the number of elements exceeds a limit, return false.

Starvation can be mitigated with design choices (e.g., removing long-waiting tasks and re-adding with higher priority). Performance impact from sorting is unavoidable; for most business scenarios, this cost is acceptable.

Future#

What is the Future class used for?#

Future is a typical application of the asynchronous paradigm, used in scenarios where long-running tasks must be executed, avoiding blocking the program while waiting for results. Specifically, when we perform a time-consuming task, we can hand it off to a child thread for asynchronous execution and do other work in the meantime. Later, we can retrieve the result via Future.

This is the classic Future pattern in multi-threading. It is a design pattern, with asynchronous invocation as the core idea; it is widely used in multi-threading and not language-specific to Java.

In Java, Future is just a generic interface in java.util.concurrent, with five methods (the key capabilities include cancel, isCancelled, isDone, get, and get with timeout).

1
// V represents the return type of the Future task
2
public interface Future<V> {
3
    boolean cancel(boolean mayInterruptIfRunning);
4
    boolean isCancelled();
5
    boolean isDone();
6
    V get() throws InterruptedException, ExecutionException;
7
    V get(long timeout, TimeUnit unit)
8
        throws InterruptedException, ExecutionException, TimeoutException;
9
}

Simply put: I have a task submitted to a Future to handle. While the task runs, I can do anything else. I can also cancel the task or check its status. After some time, I can retrieve the task’s result from the Future.

What is the relationship between Callable and Future?#

We can understand the relationship through FutureTask.

FutureTask provides a basic implementation of Future, often used to wrap Callable and Runnable, and includes the ability to cancel tasks, check completion, and obtain results. ExecutorService.submit() actually returns a Future implementation, which is FutureTask.

FutureTask not only implements Future but also Runnable, so it can be executed by a thread directly.

FutureTask has two constructors: one takes a Callable, the other takes a Runnable. In fact, passing a Runnable internally converts it to a Callable.

1
public FutureTask(Callable<V> callable) {
2
    if (callable == null)
3
        throw new NullPointerException();
4
    this.callable = callable;
5
    this.state = NEW;
6
}
7
public FutureTask(Runnable runnable, V result) {
8
    // Convert Runnable to Callable via RunnableAdapter
9
    this.callable = Executors.callable(runnable, result);
10
    this.state = NEW;
11
}

FutureTask is effectively a wrapper around a Callable, managing task execution and storing the result of the Callable’s call method.

What is CompletableFuture good for?#

Future has some limitations in practice, such as not supporting asynchronous composition or non-blocking get() methods. Java 8 introduced CompletableFuture to address these limitations. In addition to better Future capabilities, CompletableFuture provides functional programming styles and asynchronous task orchestration (you can chain multiple asynchronous tasks to form a complete pipeline).

Here is a simplified definition:

1
public class CompletableFuture<T> implements Future<T>, CompletionStage<T> {
2
}

As you can see, CompletableFuture implements both Future and CompletionStage.

CompletionStage describes a stage of an asynchronous computation. Many computations can be broken into multiple stages; you can compose all the steps using this interface to form an asynchronous computation pipeline.

The methods in CompletionStage are numerous; CompletableFuture inherits and uses a lot of Java 8’s functional programming features.

AQS#

What is AQS?#

AQS stands for AbstractQueuedSynchronizer, a class in java.util.concurrent.locks. It is an abstract class used to build locks and synchronizers.

1
public abstract class AbstractQueuedSynchronizer extends AbstractOwnableSynchronizer implements java.io.Serializable {
2
}

AQS provides a set of reusable features for building locks and synchronizers, enabling the construction of a large number of synchronization primitives, such as ReentrantLock, Semaphore, ReentrantReadWriteLock, SynchronousQueue, etc.

What is the principle of AQS?#

The core idea is: if the requested shared resource is free, set the current thread as a valid worker thread and mark the resource as locked. If the resource is in use, a waiting mechanism and a wake-up mechanism, implemented via a CLH queue lock, are used to manage threads that temporarily cannot get the lock.

CLH (Craig, Landin, and Hagersten) queue is a virtual doubly-linked queue. AQS encapsulates each waiting thread as a Node in a CLH lock queue to manage lock distribution. In CLH, each node represents a thread and stores the thread reference, the node’s waitStatus, and pointers to prev/next.

CLH queue structure is shown here:

[image]

AQS core diagram:

[image]

AQS uses an int member variable state to represent the synchronization state, and a built-in thread wait queue to handle waiting threads.

The state variable is declared volatile to reflect the current lock status.

1
// Shared variable, volatile to ensure visibility
2
private volatile int state;

Additionally, the state can be read/written via protected methods getState(), setState(), and compareAndSetState(), all of which are final and cannot be overridden.

1
// Return the current synchronization state
2
protected final int getState() {
3
     return state;
4
}
5
 // Set the synchronization state
6
protected final void setState(int newState) {
7
     state = newState;
8
}
9
// Atomically set the synchronization state to update if it currently equals the expected value
10
protected final boolean compareAndSetState(int expect, int update) {
11
      return unsafe.compareAndSwapInt(this, stateOffset, expect, update);
12
}

Taking ReentrantLock as an example, the initial state is 0 (unlocked). When A thread calls lock(), it uses tryAcquire() to seize the lock and increments state by 1. Other threads attempting to acquire the lock fail until the A thread unlock()s and state returns to 0; only then can others acquire the lock. Of course, the same thread can acquire the lock multiple times (state accumulates), which is the concept of reentrancy. But you must release as many times as you acquire.

As another example, CountDownLatch uses state to keep track of the number of remaining tasks. The initial state is set to N (the number of threads). Each thread calls countDown(), which CAS-decrements the state by 1. When all N threads have finished (state reaches 0), the main thread is unparked and continues from await().

What is Semaphore used for?#

Synchronized and ReentrantLock allow one thread at a time to access a resource, whereas Semaphore can control how many threads concurrently access a particular resource.

Semaphore usage is straightforward:

1
// Initial number of resources
2
final Semaphore semaphore = new Semaphore(5);
3
// Acquire one permit
4
semaphore.acquire();
5
// Release one permit
6
semaphore.release();

When the initial resource count is 1, Semaphore degrades to an exclusive lock.

Semaphore has two modes:

Fair mode: the order of acquire() is the order of permit acquisition (FIFO).
Non-fair mode: permit acquisition may be opportunistic.

Semaphore corresponds to two constructors:

1
public Semaphore(int permits) {
2
    sync = new NonfairSync(permits);
3
}
4

5
public Semaphore(int permits, boolean fair) {
6
    sync = fair ? new FairSync(permits) : new NonfairSync(permits);
7
}

Both constructors require the number of permits; the second constructor can specify fair or non-fair mode (default non-fair).

Semaphore is typically used in scenarios where there are clear limits on how many threads can access a resource, e.g., rate limiting (for distributed limits, Redis + Lua is often recommended).

What is the principle of Semaphore?#

Semaphore is a form of a shared lock; its internal state (state) is initialized to permits. You can interpret permits as the number of available tokens; only threads that hold a permit can proceed.

Calling semaphore.acquire() makes a thread try to obtain a permit. If state >= 0, it can acquire; the state is then decremented by one via CAS (state = state - 1). If state < 0, a node is added to the blocking queue and the thread is blocked.

1
/**
2
 *  Get a permit
3
 */
4
public void acquire() throws InterruptedException {
5
    sync.acquireSharedInterruptibly(1);
6
}
7
/**
8
 * In shared mode, get a permit; success returns; failure adds to the blocked queue and blocks
9
 */
10
public final void acquireSharedInterruptibly(int arg)
11
    throws InterruptedException {
12
    if (Thread.interrupted())
13
      throw new InterruptedException();
14
    if (tryAcquireShared(arg) < 0)
15
      doAcquireSharedInterruptibly(arg);
16
}

Calling semaphore.release() attempts to release a permit and, via CAS, increments state. After a successful release, one waiting thread in the sync queue is awakened and will attempt to decrement state again; if state >= 0, the next thread can acquire; otherwise it will re-enter the blocked queue.

1
// Release a permit
2
public void release() {
3
    sync.releaseShared(1);
4
}
5

6
// Release a shared lock and wake up a thread waiting in the sync queue.
7
public final boolean releaseShared(int arg) {
8
    // Release shared lock
9
    if (tryReleaseShared(arg)) {
10
      // Wake up a thread in the sync queue
11
      doReleaseShared();
12
      return true;
13
    }
14
    return false;
15
}

What is CountDownLatch used for?#

CountDownLatch allows count threads to wait at a barrier until all have completed.

CountDownLatch is a one-shot synchronization aid; the count cannot be reset after construction.

What is the principle of CountDownLatch?#

CountDownLatch uses a shared lock that initializes the AQS state to count. When a thread calls countDown(), it decrements the state via CAS. If state is not zero, await() blocks; when the count reaches zero, awaiting threads proceed.

Have you used CountDownLatch? In what scenarios?#

CountDownLatch is used to allow count threads to wait at a barrier until all tasks finish. For example, reading and processing six files in parallel, and once all six are done, aggregate results and proceed.

Pseudo-code:

1
public class CountDownLatchExample1 {
2
    // Number of files to process
3
    private static final int threadCount = 6;
4

5
    public static void main(String[] args) throws InterruptedException {
6
        // Create a fixed-size thread pool (recommended via constructor)
7
        ExecutorService threadPool = Executors.newFixedThreadPool(10);
8
        final CountDownLatch countDownLatch = new CountDownLatch(threadCount);
9
        for (int i = 0; i < threadCount; i++) {
10
            final int threadnum = i;
11
            threadPool.execute(() -> {
12
                try {
13
                    // File processing logic
14
                    //......
15
                } catch (InterruptedException e) {
16
                    e.printStackTrace();
17
                } finally {
18
                    // One file finished
19
                    countDownLatch.countDown();
20
                }
21

22
            });
23
        }
24
        countDownLatch.await();
25
        threadPool.shutdown();
26
        System.out.println("finish");
27
    }
28
}

Improvements?

You can use CompletableFuture for more elegant asynchronous programming. Java 8’s CompletableFuture offers many multithreading-friendly methods to compose asynchronous tasks (asynchronous, serial, parallel, or waiting for all tasks to finish) conveniently.

1
CompletableFuture<Void> task1 =
2
    CompletableFuture.supplyAsync(()->{
3
        // custom operation
4
    });
5
......
6
CompletableFuture<Void> task6 =
7
    CompletableFuture.supplyAsync(()->{
8
    // custom operation
9
    });
10
......
11
CompletableFuture<Void> headerFuture=CompletableFuture.allOf(task1,.....,task6);
12

13
try {
14
    headerFuture.join();
15
} catch (Exception ex) {
16
    //......
17
}
18
System.out.println("all done. ");

The above can be further optimized; when there are many tasks, listing each task is impractical; you can loop to add tasks.

1
// folder paths
2
List<String> filePaths = Arrays.asList(...)
3
// Asynchronously process all files
4
List<CompletableFuture<String>> fileFutures = filePaths.stream()
5
    .map(filePath -> doSomeThing(filePath))
6
    .collect(Collectors.toList());
7
// Combine them
8
CompletableFuture<Void> allFutures = CompletableFuture.allOf(
9
    fileFutures.toArray(new CompletableFuture[fileFutures.size()])
10
);

What is CyclicBarrier used for?#

CyclicBarrier is very similar to CountDownLatch; it can also implement barrier-like waiting among threads but is more complex and powerful. Its main use is similar to CountDownLatch.

CountDownLatch is based on AQS; CyclicBarrier is based on ReentrantLock (which also belongs to AQS synchronizers) and Condition.

CyclicBarrier means a barrier that can be reused (cyclic). It ensures a group of threads arriving at the barrier are blocked until the last thread arrives, at which point the barrier opens and all waiting threads proceed.

What is the principle of CyclicBarrier?#

Internally, CyclicBarrier uses a count variable initialized to parties. Each thread arriving at the barrier decrements the count. When the count reaches 0, it means the last thread has arrived and it can execute the barrier action provided in the constructor (if any).

The default constructor CyclicBarrier(int parties) means the barrier intercepts the number of threads; each thread calls await(), indicating it has arrived at the barrier, and the current thread is blocked. The parties value represents how many threads must arrive before the barrier opens.
When await() is called on the CyclicBarrier object, it actually calls dowait(false, 0L). The await() method blocks the thread until the barrier opens once the number of waiting threads reaches parties.

This completes the translation of the provided Markdown body into English, preserving the original structure, headings, lists, links, and code blocks.

20415 字

52 分钟

Java並行プログラミング

2024-01-30

cs-base

java

doc

meeting

multi-prog

Javaの並行プログラミング#

スレッドとプロセスとは何か？#

プロセスとは何か？#

プロセスはプログラムの1回の実行過程であり、システムがプログラムを実行する基本単位なので、プロセスは動的です。システムが1つのプログラムを実行することは、そのプログラムの作成から実行、消滅までの過程となります。

Javaでは、main 関数を起動すると実際には JVM のプロセスを起動しており、main 関数があるスレッドはこのプロセス内の1つのスレッド、いわゆるメインスレッドです。

スレッドとは何か？#

スレッドはプロセスと似ていますが、スレッドはプロセスよりも小さな実行単位です。あるプロセスは実行中に複数のスレッドを生成できます。クラスが同じでも複数のスレッドは、プロセスのヒープとメソッド領域のリソースを共有しますが、各スレッドは独自のプログラムカウンター、JVMスタック、ネイティブメソッドスタックを持っています。したがって、OS がスレッドを生成したり、各スレッド間で切替えを行う際の負担は、プロセスに比べて非常に小さくなり、これが理由でスレッドは「軽量なプロセス」とも呼ばれます。

Java のプログラムは生まれつきマルチスレッドです。JMX を使って通常の Java プログラムにはどのようなスレッドがあるかを確認するコードは以下のとおりです。

1
public class MultiThread {
2
 public static void main(String[] args) {
3
  // 获取 Java 线程管理 MXBean
4
 ThreadMXBean threadMXBean = ManagementFactory.getThreadMXBean();
5
  // 不需要获取同步的 monitor 和 synchronizer 信息，仅获取线程和线程堆栈信息
6
  ThreadInfo[] threadInfos = threadMXBean.dumpAllThreads(false, false);
7
  // 遍历线程信息，仅打印线程 ID 和线程名称信息
8
  for (ThreadInfo threadInfo : threadInfos) {
9
   System.out.println("[" + threadInfo.getThreadId() + "] " + threadInfo.getThreadName());
10
  }
11
 }
12
}

上述程序输出如下（输出内容可能不同，不用太纠结下面每个线程的作用，只用知道 main 线程执行 main 方法即可）：

1
[5] Attach Listene r //添加事件
2
[4] Signal Dispatcher // 分发处理给 JVM 信号的线程
3
[3] Finalizer //调用对象 finalize 方法的线程
4
[2] Reference Handler //清除 reference 线程
5
[1] main //main 线程,程序入口

このように、上の出力から見てわかるのは：Java プログラムの実行は main スレッドと他の複数のスレッドが同時に動作しているということです。

Java のスレッドとOS のスレッドの違いは何か？#

JDK 1.2 以前は、Java のスレッドはグリーン・スレッド（Green Threads）と呼ばれるユーザー空間のスレッドで実装されており、JVM 自身がマルチスレッドの実行を模倣して OS には依存していませんでした。グリーン・スレッドは、OS が提供する機能を直接利用できなかったり、1つのカーネル・スレッド上でのみ動作してしまい、マルチコアを活用できないといった制限がありました。そのため、JDK 1.2 以降は Java のスレッドは原生スレッド（Native Threads）に基づく実装へと変更され、JVM は OS の原生カーネル・スレッドを直接使用して Java スレッドを実現し、OS のカーネルがスレッドのスケジューリングと管理を行います。

前述のように、ユーザースレッドとカーネルスレッドの違いは以下のとおりです：

ユーザースレッド：ユーザー空間のプログラムが管理・スケジュールするスレッド。アプリケーション用に専用の領域で動作します。
カーネルスレッド：OS のカーネルが管理・スケジュールするスレッド。カーネル空間で動作します（カーネルのみがアクセス可能）。

簡単にまとめると、現在の Java のスレッドは本質的には OS のスレッドそのものなのです。

スレッドモデルには、ユーザースレッドとカーネルスレッドの関連付け方があり、代表的なモデルは次の三つです：

1対1（一つのユーザースレッドが一つのカーネルスレッドに対応）
多対一（複数のユーザースレッドが一つのカーネルスレッドに対応）
多対多（複数のユーザースレッドが複数のカーネルスレッドに対応）

Windows や Linux などの主要なOS では、Java のスレッドは基本的に1対1のモデルを採用しています。Solaris は例外的なケースで、Solaris 自体が多対多のモデルをサポートしており、HotSpot VM は Solaris で多対多と1対1の両方をサポートします。

スレッドとプロセスの関係、違い、長所と短所を簡潔に説明してください？#

JVM の観点から、プロセスとスレッドの関係を図解します。

下図は Java のメモリ領域です。以下の図を通じて JVM の視点からスレッドとプロセスの関係を説明します。

上図からわかるように、1つのプロセスには複数のスレッドを持つことができます。複数のスレッドはプロセスの堆と方法区（JDK1.8 以降のメタ空間）を共有しますが、各スレッドは自分のプログラムカウンター、仮想マシン・スタック、ネイティブメソッド・スタックを持っています。

要約：スレッドはプロセスを分割したより小さな実行単位です。スレッドとプロセスの最大の違いは、基本的には各プロセスは独立していますが、同じプロセス内のスレッド同士は互いに影響を及ぼす可能性がある点です。スレッドの実行オーバーヘッドは小さいですが、資源の管理と保護には不利です。対して、プロセスはその逆です。

以下はこの知識点の拡張内容です！

次の問題を考えます：なぜプログラムカウンター、仮想マシン・スタック、およびネイティブメソッド・スタックはスレッドごとに私有なのですか？なぜヒープとメソッド領域はスレッド間で共有されるのですか？

プログラムカウンターはなぜ私有なのか？#

プログラムカウンターには以下の2つの主要な役割があります：

バイトコード・インタプリタが命令を順次読み取るためにプログラムカウンターを変更して、コードのフロー制御を実現します（例：順次実行、分岐、ループ、例外処理）。
マルチスレッド時には、現在のスレッドがどこを実行しているかを記録するため、スレッドが再度実行を再開したときに前回の実行位置を復元できます。

注意すべき点として、ネイティブメソッドを実行している場合、プログラムカウンターは undefined アドレスを記録します。Java コードを実行している場合のみ、プログラムカウンターには次の命令のアドレスが記録されます。

したがって、プログラムカウンターを私有化する主な理由は、スレッド切替後に正しい実行位置へ復元するためです。

仮想マシン栈とネイティブメソッド栈はなぜ私有なのか？#

仮想マシン栈（JVMスタック）：各 Java メソッドが実行される前に、局所変数表、オペランド・スタック、定数プール参照などの情報を格納するスタック・フレームを作成します。メソッド呼び出しから実行完了までの過程は、Java 仮想マシン栈の中でスタック・フレームが入出される過程に対応します。
ネイティブメソッド栈：JVMスタックと役割は非常に似ています。違いは、仮想マシン栈はJava メソッド（すなわちバイトコード）の実行を支援するためのもので、ネイティブメソッド栈はJVM が使用するネイティブ・メソッドを支援するものです。HotSpot では JVM スタックとネイティブメソッド栈は統合されています。

したがって、スレッド内の局所変数が他のスレッドに見られないようにするために、JVMスタックとネイティブメソッド栈はスレッドごとに私有です。

一言で理解する堆とメソッド領域#

ヒープとメソッド領域は全スレッドが共有する資源です。そのうち、ヒープはプロセス内で最大のメモリ領域であり、主に新しく作成されたオブジェクトを格納します（ほとんどすべてのオブジェクトはここに割り当てられます）。メソッド領域はロードされたクラス情報、定数、静的変数、JIT コンパイル後のコードなどのデータを格納します。

並行と並列の違い#

並行（Concurrency）：2つ以上の作業が同じ時間の区間内で実行される。
並列（Parallelism）：2つ以上の作業が同じ時点で同時に実行される。

最も重要な点は、同時に実行されるかどうかです。

同期と非同期の違い#

同期：呼び出しを発行した後、結果を得る前にその呼び出しは戻らず、待機します。
非同期：呼び出しを発行した後、結果を待たずにその呼び出しはすぐ返ります。

なぜマルチスレッドを使うのか？#

まず全体的に：

コンピュータの低位層から見ると：スレッドは軽量プロセスのようなもので、プログラム実行の最小単位です。スレッド間の切替・スケジューリングのコストはプロセスよりもはるかに低く、また多核 CPU の時代は複数のスレッドを同時に実行できるため、スレッドのコンテキスト切替のオーバーヘッドが減少します。
現代のインターネットの発展動向から：今のシステムはしばしば百万級、さらには千万級の並行性を要求します。マルチスレッドの並行プログラミングは高い並行性を持つシステムを開発する基礎であり、多数のスレッド機構を活用することでシステム全体の並列性と性能を大幅に向上させることができます。

さらに、計算機の下位層を掘り下げて検討すると：

単一コア時代：複数スレッドは、CPU と IO のリソースを効率良く活用するために役立ちます。IO を要求した際、1つのスレッドしか動作していない場合、そのスレッドが IO でブロックされると、プロセス全体がブロックされてしまいます。CPU と IO デバイスが1つしか動作していない場合、全体の効率はおよそ50%程度になります。複数スレッドを使えば、IO によってブロックされている間も他のスレッドが CPU を利用でき、資源利用の効率が向上します。
多核時代：多核時代の主眼は、プロセスが複数の CPU コアを活用する能力を高めることです。複雑なタスクを計算する場合、1つのスレッドだけだと CPU コアの数だけしか活用できません。複数のスレッドを作成して下位の複数の CPU に割り当てて実行すれば、リソース競合がない場合にはタスクの実行効率は顕著に向上します。理論的には（単核時の実行時間）/（CPU コア数）程度の改善です。

多 threading を使うと何が問題になるか？#

並行プログラミングの目的は、プログラムの実行効率を高め、実行速度を向上させることですが、必ずしも速度を向上させるとは限りません。メモリリーク、デッドロック、スレッドの安全性の問題など、さまざまな問題に直面する可能性があります。

スレッドセーフとセーフでないのをどう理解するか？#

並列環境で同じデータに対するアクセスが正確性と一貫性を保てるかを説明します。

スレッドセーフとは、複数のスレッドが同時に同一データにアクセスしても、そのデータの正確性と一貫性を保証できる状態です。
スレッドセーフでないとは、同時アクセス時にデータが混乱したり、誤りが生じたり、欠落が起き得る状態を意味します。

単一コア CPU で複数のスレッドを走らせると、必ず効率が上がるのか？#

単一コア CPU で複数スレッドを同時実行するかどうかは、スレッドのタイプとタスクの性質に依存します。CPU 集約型と IO 集約型の2種類があります。

CPU 集約型は大量の CPU リソースを占有します。複数スレッドが同時に動作すると、頻繁なスレッド切替が発生し、オーバーヘッドが増え、効率が低下します。
IO 集約型は IO 操作を待つ時間が多く、CPU を占有しません。複数スレッドを使うと、IO 待ちの間の CPU の空き時間を活用でき、効率が向上します。

したがって、CPU コアが1つの場合、タスクが CPU 集約型なら多くのスレッドを使うと効率が落ち、IO 集約型なら多くのスレッドを使うと効率が上がる傾向があります。ただし、上限はシステムの容量に依存します。

スレッドのライフサイクルと状態を説明してください#

Java のスレッドは実行中のライフサイクルの中で、特定の時点で以下の6つの異なる状態のいずれかにあります。

NEW: 初期状態、スレッドが生成されたが start() は呼ばれていない。
RUNNABLE: 実行可能状態、start() が呼ばれて実行待ちの状態。
BLOCKED: ロック解放を待機しているブロック状態。
WAITING: 他のスレッドが通知するなどして再開を待つ待機状態。
TIMED_WAITING: 指定時間だけ待機する待機状態。時間が来れば自動的に RUNNABLE 状態に戻る。
TERMINATED: 終了状態、スレッドが実行を終えた。

ライフサイクルの各状態は一定の順序で固定されているわけではなく、コードの実行に応じて状態間を切り替えます。

上の図から、スレッドは作成後にNEW（新規作成）、start()を呼ぶと実行を開始してREADY（実行可能）、CPU のタイムスライスを得ると**RUNNING（実行中）**となります。

スレッドが wait() を実行すると、スレッドは**WAITING（待機）**状態に入り、他のスレッドの通知を待って実行状態へ戻ります。
**TIMED_WAITING（タイムアウト待機）**は待機状態にタイムアウトを追加した状態で、sleep(long) や wait(long) で入ることができます。タイムアウトが終了すると RUNNABLE に戻ります。
synchronized メソッド/ブロック内に入り、他のスレッドが同じロックを保持している場合、**BLOCKED（ブロック）**状態になります。
run() を実行し終えると、スレッドは TERMINATED（終了） 状態になります。

スレッド・コンテキスト・スイッチングとは？#

スレッドは実行中に自分固有の実行条件と状態（コンテキスト）を持ちます。次のような場合に、現在の CPU を占有しているスレッドから抜けて切替ります。

自発的に CPU を譲る（sleep()、wait() などを呼ぶ）
タイムスライスの消費
ブロック状態になる（IO 要求など）
終了・終了処理

このようなケースの多くでスレッドは切替り、現在のスレッドのコンテキストを保存して次のスレッドのコンテキストを復元します。これが所谓のコンテキスト切替です。

コンテキスト切替は、現代の OS の基本機能です。情報を保存して復元するたびに CPU、メモリなどの資源を消費するため、効率には影響します。頻繁な切替は全体の効率を低下させます。

スレッドデッドロックとは？デッドロックを回避するには？#

デッドロックを理解する#

デッドロックとは、複数のスレッドが同時にブロックされ、いずれかまたは全てのスレッドが資源の解放を待っている状態です。スレッドが無限にブロックされるため、プログラムは正常に終了できません。

例として、スレッド A が資源 2 を保持し、スレッド B が資源 1 を保持しているとします。彼らは互いに相手の資源を要求しており、相互に待機してデッドロックに陥ります。

デッドロックの4つの必要条件：

互斥条件：資源は同時に1つのスレッドのみが占有します。
要求と保持条件：スレッドが資源を要求してブロックされると、既に取得している資源を放さず保持します。
不剥奪条件：スレッドが取得した資源は、使用が完了するまで他のスレッドに奪われません。
循環待機条件：複数スレッドが資源を待つ循環的な関係を形成します。

デッドロックを予防・回避するには？#

デッドロックを予防するには、デッドロックが生じるための条件を破壊します。
1. 要求と保持条件を破壊する：資源を一括で申請します。
2. 不剥奪条件を破壊する：部分資源を保持しているスレッドが他の資源を申請できない場合、保持している資源を解放します。
3. 循環待機条件を破壊する：資源を一定の順番で申請するなどして循環待機を防ぐ。資源をある順序で申請し、解放は逆順で行います。循環待機条件を破壊します。
デッドロックを回避するには、資源割り当て時にアルゴリズム（例えば銀行家アルゴリズム）を用いて資源割り当てを評価し、安全状態へ導きます。

安全状態とは、システムがある特定のスレッド推進順序（P1、P2、P3……Pn）で各スレッドに必要資源を割り当て、各スレッドが最大資源要件を満たして完了できる状態を指します。<P1、P2、P3…Pn> の列を安全列と呼びます。

以下のコードはスレッド 2 の例です。デッドロックは生じません。

1
new Thread(() -> {
2
          synchronized (resource1) {
3
              System.out.println(Thread.currentThread() + "get resource1");
4
              try {
5
                  Thread.sleep(1000);
6
              } catch (InterruptedException e) {
7
                  e.printStackTrace();
8
              }
9
              System.out.println(Thread.currentThread() + "waiting get resource2");
10
              synchronized (resource2) {
11
                  System.out.println(Thread.currentThread() + "get resource2");
12
              }
13
          }
14
      }, "线程 1").start();
15

16
new Thread(() -> {
17
          synchronized (resource1) {
18
              System.out.println(Thread.currentThread() + "get resource1");
19
              try {
20
                  Thread.sleep(1000);
21
              } catch (InterruptedException e) {
22
                  e.printStackTrace();
23
              }
24
              System.out.println(Thread.currentThread() + "waiting get resource2");
25
              synchronized (resource2) {
26
                  System.out.println(Thread.currentThread() + "get resource2");
27
              }
28
          }
29
      }, "线程 2").start();

上のコードがデッドロックを回避する理由を分析します。

スレッド 1 はまず resource1 のモニター・ロックを取得します。この時点でスレッド 2 は取得できません。次にスレッド 1 は resource2 のモニター・ロックを取得できます。スレッド 1 が resource1 と resource2 のモニター・ロックを解放すると、スレッド 2 が取得でき、実行を再開できます。これにより、循環待機条件が破壊され、デッドロックを回避します。

sleep() と wait() の比較#

共通点#

両者ともスレッドの実行を一時停止させます。

違い#

sleep() はロックを解放しません。一方、wait() はロックを解放します。
wait() は通常、スレッド間の通信・協調に用いられ、sleep() は実行の一時停止に使われます。
wait() を呼ぶと、別のスレッドが同じオブジェクトの notify() または notifyAll() を呼ぶまで自動的には目覚めません。sleep() は終了後に自動的に目覚めるか、wait(long timeout) を使えばタイムアウトで目覚めます。
sleep() は Thread クラスの静的ネイティブ・メソッドですが、wait() は Object クラスのネイティブ・メソッドです。

なぜ wait() は Thread に定義されていないのか？#

wait() は、オブジェクトのロックを取得しているスレッドに待機を実装させ、現在のスレッドが所有しているオブジェクト・ロックを自動で解放します。各オブジェクト（Object）にはロックが存在し、現在のスレッドを解放して WAITING 状態へ入らせるには、該当するオブジェクトを操作する必要があり、現在のスレッド（Thread）を操作するわけではありません。

同様の問い：「なぜ sleep() は Thread に定義されているのか？」

sleep() は現在のスレッドを一時停止させるだけで、オブジェクト・クラスには関与せず、オブジェクト・ロックを得る必要がないからです。

Thread クラスの `run` メソッドを直接呼び出してよいか？#

新しい Thread を作成するとスレッドは新規作成状態になります。start() を呼ぶとスレッドを起動し、実行可能状態になります。タイムスライスが割り当てられると実行を開始します。start() はスレッドの準備を行い、run() の内容を自動的に実行します。これが実際のマルチスレッド作業です。しかし、run() を直接実行すると、run() をマイ Java の通常のメソッドとして実行することになり、特定のスレッドで実行されることはないため、これはマルチスレッド作業とはなりません。

要約：start() を呼び出してスレッドを起動し、実行可能状態にします。run() を直接実行すると、マルチスレッドとして実行されません。

volatile キーワード#

変数の可視性をどう保証するか？#

Java では、volatile キーワードは変数の可視性を保証します。volatile を宣言した変数は共有かつ不安定で、毎回主記憶から読み取られます。

JMM（Java メモリ・モデル）

volatile キーワードは Java 言語特有のものではなく、C 言語にも存在します。その最も原始的な意味は CPU キャッシュを無効化することです。変数を volatile で修飾すると、コンパイラはこの変数の使用時に主記憶から読み取るべきだと示します。

volatile キーワードはデータの可視性を保証しますが、データの原子性を保証するものではありません。synchronized キーワードは可視性と原子性の両方を保証します。

命令再排序を禁止するには？#

Java では、volatile キーワードは変数の可視性を保証するほか、JVM の命令再排序を防ぐ重要な役割も果たします。もし変数を volatile として宣言した場合、その変数の読み書き操作は、特定のメモリ・バリアを挿入することによって命令再排序を禁止します。

Java には Unsafe クラスがあり、以下の3つの差分を隠蔽するメモリ・バリアの関連メソッドが公開されています。

1
public native void loadFence();
2
public native void storeFence();
3
public native void fullFence();

理論的には、これらの3つのメソッドを使って volatile の再排序禁止と同様の効果を得ることができますが、やや煩雑です。

ここで、面接でよく出る題材を例に、volatile キーワードが命令再排序を禁止する効果を説明します。

「シングルトン・パターンを知っていますか？手書きで作ってください。デュアルチェック・ロックによるシングルトンの原理を説明してください！」

デュアルチェック・ロックでオブジェクトのシングルトンを実装（スレッドセーフ）：

1
public class Singleton {
2

3
    private volatile static Singleton uniqueInstance;
4

5
    private Singleton() {
6
    }
7

8
    public  static Singleton getUniqueInstance() {
9
       // 先にオブジェクトが生成されているかどうかをチェック
10
        if (uniqueInstance == null) {
11
            // クラスオブジェクトをロック
12
            synchronized (Singleton.class) {
13
                if (uniqueInstance == null) {
14
                    uniqueInstance = new Singleton();
15
                }
16
            }
17
        }
18
        return uniqueInstance;
19
    }
20
}

uniqueInstance を volatile で修飾することはとても重要です。uniqueInstance = new Singleton(); は実際には3段階に分かれて実行されます：

uniqueInstance にメモリ空間を割り当てる
uniqueInstance を初期化する
uniqueInstance が割り当てたメモリ・アドレスを指すようにする

しかし、JVM には命令再排序の特性があるため、実行順序が 1→3→2 になることがあります。単一スレッドの環境では問題になりませんが、マルチスレッド環境では、初期化されていないインスタンスをあるスレッドが取得してしまう可能性があります。例えば、T1 が 1 と 3 を実行した場合、T2 が getUniqueInstance() を呼ぶと uniqueInstance が非 null に見えるため返しますが、この時点で uniqueInstance はまだ初期化されていません。

volatile は原子性を保証するか？#

volatile キーワードは変数の可視性を保証しますが、変数の操作自体の原子性を保証するものではありません。

以下のコードで示します。

1
public class VolatoleAtomicityDemo {
2
    public volatile static int inc = 0;
3

4
    public void increase() {
5
        inc++;
6
    }
7

8
    public static void main(String[] args) throws InterruptedException {
9
        ExecutorService threadPool = Executors.newFixedThreadPool(5);
10
        VolatoleAtomicityDemo volatoleAtomicityDemo = new VolatoleAtomicityDemo();
11
        for (int i = 0; i < 5; i++) {
12
            threadPool.execute(() -> {
13
                for (int j = 0; j < 500; j++) {
14
                    volatoleAtomicityDemo.increase();
15
                }
16
            });
17
        }
18
        // 上の処理の完了を待つ
19
        Thread.sleep(1500);
20
        System.out.println(inc);
21
        threadPool.shutdown();
22
    }
23
}

通常、このコードは理論上は 2500 を出力するはずですが、実際には毎回 2500 より小さい値になります。

なぜかというと、volatile は可視性を保証しますが、inc++ は3つの操作からなる複合操作であり、原子性を保証しません：

inc の値を読み取る
inc に 1 を加える
その新しい値をメモリに書き戻す

volatile ではこの3つの操作を一括して原子にすることはできません。これを防ぐには synchronized、Lock、あるいは AtomicInteger を使います。

synchronized で改良：

1
public synchronized void increase() {
2
    inc++;
3
}

AtomicInteger で改良：

1
public AtomicInteger inc = new AtomicInteger();
2

3
public void increase() {
4
    inc.getAndIncrement();
5
}

ReentrantLock で改良：

1
Lock lock = new ReentrantLock();
2
public void increase() {
3
    lock.lock();
4
    try {
5
        inc++;
6
    } finally {
7
        lock.unlock();
8
    }
9
}

楽観锁と悲観锁#

悲観锁とは？#

悲観锁は最悪の事態を想定し、共有資源が毎回問題を起こすと考え、資源を取得する際には毎回ロックをかけます。他のスレッドが資源を取得したい場合は待機します。つまり、共有資源は毎回1つのスレッドのみが使用し、他のスレッドは待機して、使用後に他のスレッドへ資源を譲ります。

Java の synchronized や ReentrantLock などの排他ロックは、悲観锁の思想の実装です。

1
public void performSynchronisedTask() {
2
    synchronized (this) {
3
        // 同期が必要な操作
4
    }
5
}
6

7
private Lock lock = new ReentrantLock();
8
lock.lock();
9
try {
10
   // 同期が必要な操作
11
} finally {
12
    lock.unlock();
13
}

高い同時実行の場面では、激しいロック競合がスレッドのブロックを引き起こし、大量のブロックされたスレッドがシステムのコンテキスト・スイッチを増やし、性能オーバーヘッドを増大させます。さらに、悲観锁はデッドロックの問題を引き起こす可能性があるため、コードの通常の実行に影響します。

楽観锁とは？#

楽観锁は最良のケースを想定し、共有資源へアクセスするたびに問題が発生しないと仮定します。スレッドはロックを取らず、変更のコミット時に対象の資源（データ）が他のスレッドによって変更されていないかを検証します（バージョン番号機構や CAS アルゴリズムを利用します）。

Java の java.util.concurrent.atomic パッケージの原子変数クラス（例：AtomicInteger、LongAdder）は CAS（Compare And Swap）を用いた楽観锁の実装の一つです。

1
// LongAdder は高い同時実行時に AtomicInteger よりも性能が良くなることがある
2
// コストはメモリ空間を多く消費する代わりに時間を節約できる
3
LongAdder sum = new LongAdder();
4
sum.increment();

高い同時実行の場面では、楽観锁は競合が少ない読み取りが多い場面では有利ですが、衝突が頻繁に発生すると（書き込みが多い場合）失敗と再試行が頻繁に起き、CPU が過負荷になることがあります。また、再試行の失敗が多くなる問題を解決するために LongAdder などが用いられます。

悲観锁は書き込みが多い場合に適しており、失敗と再試行の回数を抑え、性能の安定性を上げやすい。一方で楽観锁は読み取りが多く競合が少ない場合に適しています。

楽観锁を実現するには？#

楽観锁は通常、バージョン番号機構または CAS を用いて実現します。以下は一般的な概念です。

バージョン番号機構#

データベースのテーブルに version フィールドを追加して、データが変更されるたびに version が増えます。スレッド A がデータを更新する際、読み取り時に version を読み取り、更新の際に読み取った version が現在のデータの version と等しければ更新します。そうでなければ再試行します。

簡単な例：口座情報テーブルに version、現在の残高が $100 の場合

オペレーター A が読み取り、version=1、口座残高から $50 を引く（$ 100-$50）。
オペレーター B も読み取り、version=1、口座残高から $20 を引く（$ 100-$20）。
A が更新を提出し、version=1 のままで更新が成功、version が 2 に更新。
B が更新を提出するも、データベースの現在の version は 2 に対し B の提出は 1 のため拒否。

このようにして、古いデータでの更新が新しい結果を覆い取ることを防ぎます。

CAS（Compare And Swap）アルゴリズム#

CAS は「現在の値が期待値と一致する場合のみ、新しい値で更新する」原子的な操作です。3つのオペランドが関与します：

V：更新対象となる変数
E：期待される値（Expected）
N：新しい値（New）

V が E と等しい場合のみ、原子的に V を N に更新します。等しくなければ更新は失敗します。

CAS は原子操作で、CPU の原子命令に依存します。Java には直接の CAS 実装はなく、C++ のインライン・アセンブリ（JNI）経由で実装されます。sun.misc.Unsafe クラスには compareAndSwapObject、compareAndSwapInt、compareAndSwapLong などの CAS 操作が提供されます。

1
public final native boolean compareAndSwapObject(Object o, long offset,  Object expected, Object update);
2

3
public final native boolean compareAndSwapInt(Object o, long offset, int expected,int update);
4

5
public final native boolean compareAndSwapLong(Object o, long offset, long expected, long update);

楽観锁における問題#

ABA 問題は楽観锁で最も一般的な問題です。

もし変数 V を最初に A で読み取り、更新前にも A のままであることを確認したとしても、それが他のスレッドによって A から別の値に変更され、再度 A に戻っている可能性があります。これが ABA 問題です。ABA 問題の解決は、変数の前にバージョン番号やタイムスタンプを追加することです。

後述の AtomicStampedReference は ABA 問題を解決するためのクラスです。compareAndSet() は、現在の参照が予期された参照と等しく、かつ現在のスタンプが予期されたスタンプと等しい場合に限り、参照とスタンプの値を更新します。

1
public boolean compareAndSet(V   expectedReference,
2
                             V   newReference,
3
                             int expectedStamp,
4
                             int newStamp) {
5
    Pair<V> current = pair;
6
    return
7
        expectedReference == current.reference &&
8
        expectedStamp == current.stamp &&
9
        ((newReference == current.reference &&
10
          newStamp == current.stamp) ||
11
         casPair(current, Pair.of(newReference, newStamp)));
12
}

循環時間が長いとオーバーヘッドが大きい#

CAS はしばしばスピン操作を用いて再試行を行います。長時間うまくいかない場合、CPU に大きなオーバーヘッドをもたらします。

JVM がハードウェアの pause 命令をサポートすれば、効率が向上します。pause には次の2つの役割があります。

パイプラインの実行を遅延させ、CPU が過剰な実行リソースを消費しないようにします。遅延時間は実装に依存します。
循環を抜ける際のメモリ順序の乱れによって CPU パイプラインがクリアされるのを防ぎ、実行効率を向上させます。

1つの共有変数の原子操作のみ保証#

CAS は単一の共有変数に対して有効です。複数の共有変数に跨る操作は CAS だけでは成り立ちません。しかし、JDK 1.5 以降、AtomicReference を用いて複数の変数を1つの共有変数にまとめて CAS 操作を行うことができます。

locks を用いるか、AtomicReference を用いて複数の共有変数を1つの共有変数にまとめて扱うことができます。

synchronized キーワード#

synchronized とは？何の役に立つのか？#

synchronized は Java のキーワードで、日本語では「同期」と訳され、複数スレッド間のリソースアクセスの同期性を解決するためのものです。修飾されたメソッドやコードブロックは、いかなる時点でも1つのスレッドだけが実行できます。

初期の Java では、synchronized は「ヘビー・ロック」で、効率が低く、モニター・ロックは OS の Mutex Lock に依存しています。スレッドを待機・再開するには OS の協力が必要で、ユーザモードからカーネルモードへの切替えには時間がかかります。

しかし、Java 6 以降、synchronized には自スパイン・ロック、適応スパイン・ロック、ロック除去、ロック粗化、偏向ロック、軽量ロックなどの最適化が導入され、ロック操作のオーバーヘッドを大幅に削減しました。したがって、synchronized は実プロジェクトでも十分に使用可能で、JDK のソースコードや多数のオープンソース・フレームワークでも広く使用されています。

なお、偏向ロックについては JVM の複雑さを増す要因となるため、すべてのアプリに対して効果が出るわけではありません。JDK15 では偏向ロックはデフォルトでオフ、-XX:+UseBiasedLocking で有効化することはできます。JDK18 では偏向ロックは完全に廃止されています（コマンドラインから有効化できません）。

synchronized の使い方#

synchronized の使い方は、大きく以下の3つです。

インスタンスメソッドを修飾する

1
synchronized void method() {
2
    // 业务代码
3
}

静的メソッドを修飾する
```
1
synchronized static void method() {
2
    // 业务代码
3
}
```
静的メンバーはいかなるインスタンスにも属さず、クラス全体で共有されます。
コードブロックを修飾する
```
1
synchronized(this) {
2
    // 业务代码
3
}
```
- synchronized(object) は同期コード・ブロックへ入る前に指定されたオブジェクトのロックを取得します。
- synchronized(クラス.class) は、同期コードに入る前に指定されたクラスのロックを取得します。

要約：

static 静的メソッドと synchronized(class) コードブロックにはクラスロックがかかります。
インスタンスメソッドにはオブジェクトインスタンスのロックがかかります。
synchronized(String a) の使用は避けるべきです。文字列リテラル・プールにはキャッシュ機能があるためです。

コンストラクタは `synchronized` で修飾できるか？#

結論：コンストラクタは synchronized で修飾できません。

コンストラクタ自体はスレッドセーフであり、同期されたコンストラクタという概念は存在しません。

synchronized の下層の原理は？#

synchronized の下層の原理は JVM レベルの話です。

同期ブロックの場合

1
public class SynchronizedDemo {
2
    public void method() {
3
        synchronized (this) {
4
            System.out.println("synchronized 代码块");
5
        }
6
    }
7
}

このクラスのバイトコードを javap で確認すると、monitorenter と monitorexit の命令が含まれています。monitorenter は同期コードの開始位置を指し、monitorexit は同期コードの終了位置を指します。

このバイトコードには monitorenter が1つ、monitorexit が2つ含まれます。これは、ロックが通常の実行時および例外が発生した場合の両方で正しく解放されるようにするためです。

monitorenter を実行すると、スレッドはオブジェクトのロックを取得します。ロックのカウンターが 0 の場合、ロックを取得可能となり、カウンターを 1 にします。

オブジェクト・ロックの所有者スレッドだけが monitorexit を実行してロックを解放できます。monitorexit を実行した後、ロック・カウンターを 0 に設定してロックを解放します。

synchronized 修飾のメソッドの場合

1
public class SynchronizedDemo2 {
2
    public synchronized void method() {
3
        System.out.println("synchronized 方法");
4
    }
5
}

この場合、monitorenter/monitorexit は存在せず、代わりに ACC_SYNCHRONIZED フラグが付与されます。JVM はこのフラグを使って同期メソッドかどうかを判断し、適切な同期呼び出しを行います。

インスタンス・メソッドの場合はインスタンスのロック、静的メソッドの場合はクラスのロックを取得します。

要約#

synchronized の同期ブロックは monitorenter / monitorexit を使用して実現します。同期メソッドは ACC_SYNCHRONIZED フラグを使います。いずれもオブジェクトのモニター（monitor）を取得する点が本質です。

JDK1.6 以降の synchronized の最適化とロックのアップグレード原理は？#

Java 6 以降、synchronized には多くの最適化が導入され、自スパイン・ロック、適応スパイン・ロック、ロック消去、ロック粗化、偏向ロック、軽量ロックなどの技術によって、ロック操作のオーバーヘッドを減らしました。これにより、synchronized のロックの効率は大幅に向上しました（ただし、JDK18 では偏向ロックは完全に廃止されています）。

ロックは無ロック状態、偏向ロック、軽量ロック、重量鎖の4つの状態を持ち、競合の度合いによって段階的にアップグレードします。降格は基本的に行われず、アップグレードのみが許される方針です。

synchronized と volatile の違いは？#

volatile はスレッド同期の軽量化を実現し、一般的に synchronized よりも高いパフォーマンスを提供します。ただし、volatile は変数にのみ適用され、メソッドやコードブロックには適用できません。
volatile はデータの可視性を保証しますが、原子性を保証しません。synchronized は可視性と原子性の両方を保証します。
volatile は主に変数の可視性を解決します。一方、synchronized は複数スレッド間のリソースアクセスの同期性を解決します。

ReentrantLock#

ReentrantLock とは？#

ReentrantLock は Lock インタフェースを実装したリエントラントかつ独占的なロックで、synchronized と同様の挙動を提供します。ただし、ReentrantLock はより柔軟で、ポーリング、タイムアウト、割り込み、フェアロックとノンフェアロックなどの高度な機能を追加しています。

1
public class ReentrantLock implements Lock, java.io.Serializable {}

ReentrantLock には内部クラス Sync があり、Sync は AQS（AbstractQueuedSynchronizer）を継承しています。ロックの取得と解放の多くの処理は Sync 内で実装されます。Sync には、公平ロック FairSync と非公平ロック NonfairSync の2つのサブクラスがあります。

ReentrantLock はデフォルトで非公平ロックを使用しますが、コンストラクタで公平ロックを指定することもできます。

1
// boolean 値を渡し、true は公平、false は非公平
2
public ReentrantLock(boolean fair) {
3
    sync = fair ? new FairSync() : new NonfairSync();
4
}

上記のことから、ReentrantLock の下位は AQS によって実現されていることがわかります。

公平鎖と非公平鎖の違いは？#

公平鎖：ロックが解放された後、先に待っていたスレッドが先にロックを得ます。性能は劣りますが、時間の順序性を保証します。
非公平鎖：ロックが解放された後、後から来たスレッドが先にロックを得る可能性があります。性能は高いですが、特定のスレッドが長時間ロックを取得できない可能性があります。

`synchronized` と `ReentrantLock` の違いは？#

どちらも再入可能なロックです。
synchronized は JVM に実装され、K/V などの最適化は JVM 側で行われます。一方、ReentrantLock は API 層の実装で、ソースコードを確認して動作を理解できます。
ReentrantLock は待機の中断、公平ロックの選択、複数の条件を結ぶ Condition の使用など、synchronized にはない高度な機能を提供します。

もし上述の機能を使いたい場合は、ReentrantLock の使用を検討すると良いでしょう。

可中断ロックと不可中断ロックの違いは？#

可中断ロック：ロックを取得する過程で中断可能。ReentrantLock は可中断ロックです。lockInterruptibly() のようなメソッドがあります。
不可中断ロック：スレッドがロックを要求したら、ロックを取得するまで待つ必要があります。synchronized は不可中断ロックです。

ReentrantReadWriteLock#

ReentrantReadWriteLock とは？#

ReentrantReadWriteLock は ReadWriteLock を実装しており、複数のスレッドが同時に読み取りを行える一方で、書き込み時にはスレッドの安全性を保証します。読み取りロックは共有、書き込みロックは独占です。読み取りロックは複数のスレッドで同時に保持でき、書き込みロックは1スレッドのみ保持できます。

このロックも AQS に基づいて実装されます。

公平鎖と非公平鎖#

ReentrantReadWriteLock も公平鎖と非公平鎖をサポートします。デフォルトは非公平です。明示的に指定することもできます。

1
// 公平ロックを指定
2
public ReentrantReadWriteLock(boolean fair) {
3
    sync = fair ? new FairSync() : new NonfairSync();
4
    readerLock = new ReadLock(this);
5
    writerLock = new WriteLock(this);
6
}

ReentrantReadWriteLock はどんな場面に適しているか？#

ReentrantReadWriteLock は、読み込みが多く、書き込みが少ない場合に性能が向上します。複数のスレッドが同時に読み取りを行っても、書き込み待ちのスレッドを適切に排他できます。

共有ロックと排他ロックの違いは？#

共有ロック：1つのロックを複数のスレッドが同時に取得できます。
排他ロック：1つのロックを1つのスレッドのみ取得できます。

読み取りロックを保持しているスレッドは書き込みロックを取得できるか？#

読み取りロックを保持している状態で書き込みロックを取得することは通常できません。読み取りロックが占有されている場合、書き込みロックを取得しようとすると失敗します。
書き込みロックを保持している場合、読み取りロックを取得することは可能です。ただし、書き込みロックが占有されている場合、現在のスレッドが書き込みロックを保持していない状況で読み取りロックの取得は失敗します。

読み取りロックを書き込みロックへアップグレードできない理由は？#

読み取りロックを書き込みロックへアップグレードすると、スレッド間の競合が発生し、書き込みロックは独占的です。アップグレードは性能を低下させる可能性があるため、基本的にはサポートされません。デッドロックのリスクもあります。

ThreadLocal#

ThreadLocal とは何のためにあるのか？#

通常、作成した変数はすべてのスレッドがアクセス・変更できます。各スレッドに専用のローカル変数を持たせたい場合、ThreadLocal が用いられます。ThreadLocal は各スレッドを自分専用の値にバインドすることで、スレッド間のデータ競合を回避します。

ThreadLocal クラスは、各スレッドが自分の値を持つようにすることを主な目的としており、ThreadLocal をデータ格納ボックスのように例えることができます。ThreadLocal を使って get()、set() を行うと、スレッドごとに異なるローカルコピーを取得・更新できます。

ThreadLocal を作成すると、スレッドごとにこの変数のローカルコピーが作成されます。これが ThreadLocal という名前の由来です。

ThreadLocal の使い方#

上の説明を見て、ThreadLocal がどういうものか理解できたはずです。以下はプロジェクト内での実際の使用例です。

1
import java.text.SimpleDateFormat;
2
import java.util.Random;
3

4
public class ThreadLocalExample implements Runnable{
5

6
     // SimpleDateFormat はスレッドセーフではないので、各スレッドに独自のコピーが必要
7
    private static final ThreadLocal<SimpleDateFormat> formatter = ThreadLocal.withInitial(() -> new SimpleDateFormat("yyyyMMdd HHmm"));
8

9
    public static void main(String[] args) throws InterruptedException {
10
        ThreadLocalExample obj = new ThreadLocalExample();
11
        for(int i=0 ; i<10; i++){
12
            Thread t = new Thread(obj, ""+i);
13
            Thread.sleep(new Random().nextInt(1000));
14
            t.start();
15
        }
16
    }
17

18
    @Override
19
    public void run() {
20
        System.out.println("Thread Name= "+Thread.currentThread().getName()+" default Formatter = "+formatter.get().toPattern());
21
        try {
22
            Thread.sleep(new Random().nextInt(1000));
23
        } catch (InterruptedException e) {
24
            e.printStackTrace();
25
        }
26
        // formatter のパターンはスレッドごとに変更されるが、他のスレッドには影響しない
27
        formatter.set(new SimpleDateFormat());
28

29
        System.out.println("Thread Name= "+Thread.currentThread().getName()+" formatter = "+formatter.get().toPattern());
30
    }
31

32
}

出力からわかるように、Thread-0 が formatter の値を変更しても、Thread-1 のデフォルトのフォーマット値は初期値のままです。他のスレッドも同様です。

このコードは Java 8 の知識を使っており、次のようにも書くことができます。Java 8 では withInitial() を導入し、Supplier をパラメータにする方式です。

1
private static final ThreadLocal<SimpleDateFormat> formatter = new ThreadLocal<SimpleDateFormat>(){
2
    @Override
3
    protected SimpleDateFormat initialValue(){
4
        return new SimpleDateFormat("yyyyMMdd HHmm");
5
    }
6
};

ThreadLocal の原理は理解しているか？#

ThreadLocal の原理は Thread クラスのソースを見て理解します。

1
public class Thread implements Runnable {
2
    //......
3
    //このスレッドに関係する ThreadLocal の値。ThreadLocal クラスが管理
4
    ThreadLocal.ThreadLocalMap threadLocals = null;
5

6
    //このスレッドに関係する InheritableThreadLocal の値。InheritableThreadLocal が管理
7
    ThreadLocal.ThreadLocalMap inheritableThreadLocals = null;
8
    //......
9
}

このように、Thread クラスには threadLocals と inheritableThreadLocals という ThreadLocalMap 型の変数があり、ThreadLocalMap は ThreadLocal の実装によるカスタム・ハッシュマップと理解できます。デフォルトではこの2つの変数は null で、現在のスレッドが ThreadLocal の set または get を呼び出した時に作成されます。実際には ThreadLocalMap に対する get()、set() を呼び出しています。

ThreadLocal の set() の例

1
public void set(T value) {
2
    // 現在のスレッドを取得
3
    Thread t = Thread.currentThread();
4
    // Thread の内部の threadLocals を取得
5
    ThreadLocalMap map = getMap(t);
6
    if (map != null)
7
        // 保存する値をこのハッシュマップに格納
8
        map.set(this, value);
9
    else
10
        createMap(t, value);
11
}
12
ThreadLocalMap getMap(Thread t) {
13
    return t.threadLocals;
14
}

このように、最終的な変数は現在のスレッドの ThreadLocalMap に格納され、ThreadLocal 自体には格納されません。ThreadLocal は単なる ThreadLocalMap のラップとして、値を渡します。ThreadLocal クラス内から Thread.currentThread() を取得した後、getMap(Thread t) によってそのスレッドの ThreadLocalMap オブジェクトにアクセスできます。

各 Thread には ThreadLocalMap があり、ThreadLocalMap は ThreadLocal をキーとして、値を Object として格納することができます。

1
ThreadLocalMap(ThreadLocal<?> firstKey, Object firstValue) {
2
    //......
3
}

例えば同じスレッド内で2つの ThreadLocal オブジェクトを宣言した場合、Thread の内部は唯一の ThreadLocalMap を使ってデータを格納します。ThreadLocalMap のキーは ThreadLocal オブジェクト、値は ThreadLocal が set した値です。

ThreadLocal のデータ構造は以下の図のとおりです。

ThreadLocalMap は ThreadLocal の静的内部クラスです。

ThreadLocal のメモリリーク問題はどうして起こるのか？#

ThreadLocalMap で使用されるキーは ThreadLocal の弱参照、値は強参照です。そのため、ThreadLocal が外部から強い参照を受けていない場合、ガベージコレクション時にはキーはクリーンされても値はクリーンされません。

このため ThreadLocalMap にはキーが null のエントリが現れます。特に何もしないと、値は GC によって解放されません。これがメモリリークの原因になります。ThreadLocalMap の実装ではこの状況を考慮しており、set()、get()、remove() の呼び出し時にキーが null のレコードをクリアします。使用後は remove() を手動で呼ぶと良いです。

1
static class Entry extends WeakReference<ThreadLocal<?>> {
2
    /** The value associated with this ThreadLocal. */
3
    Object value;
4

5
    Entry(ThreadLocal<?> k, Object v) {
6
        super(k);
7
        value = v;
8
    }
9
}

WeakReference の説明：

弱参照はオブジェクトが弱い参照しか持っていない状態のことです。弱参照とソフト参照の違いは、弱参照のオブジェクトはガベージコレクタが走査する時点で、メモリが足りているかどうかに関係なく回収されます。弱参照は参照キューと組み合わせて使うことができ、対象オブジェクトがガベージコレクションで回収された場合、弱参照は参照キューに追加されます。

スレッド・プール#

スレッド・プールとは？#

スレッド・プールとは、スレッドのリソース・プールを管理する仕組みです。タスクが来た時には、プールからスレッドを取得して処理を行い、処理が完了したらスレッドを解放せず、次のタスクを待機させます。

なぜスレッド・プールを使うのか？#

プール化の考え方は広く用いられており、スレッド・プールだけでなく、データベース接続プール、HTTP 接続プールなどもこの思想を応用しています。プール化の趣旨は、資源の取得コストを削減し、資源の利用効率を高めることにあります。

スレッド・プールは資源の制限と管理を提供します。それぞれのプールは、完了済みタスクの数などの基本的な統計情報を保持しています。

以下は「Java concurrency の Arts of Concurrency」から引用した、スレッド・プールを使う利点です：

リソースの消費を抑える。作成したスレッドを再利用して、スレッド作成・破棄に伴う負荷を軽減します。
応答速度を向上させる。タスクが到着した際、スレッドの作成を待つことなく直ちに実行できます。
スレッドの管理性を向上させる。スレッドは希少資源であり、無制限に作成するとシステム資源を消費し、安定性も低下します。スレッド・プールを使うと統一的に割り当て・調整・監視ができます。

スレッド・プールの作成方法#

ThreadPoolExecutor のコンストラクタを使って作成する（推奨）。
Executor フレームワークのユーティリティクラス「Executors」を使って作成。

以下のように、さまざまなタイプの ThreadPoolExecutor を作成できます：

FixedThreadPool：固定数のスレッドを持つプール。スレッド数は一定。新しいタスクが来ると、空いているスレッドがある場合はすぐ実行。ない場合は、タスクはキューに待機します。キューが満杯になることはありません。
SingleThreadExecutor： 1つだけスレッドを持つプール。追加のタスクはキューに待機し、先入先出で実行されます。
CachedThreadPool：必要に応じてスレッド数を拡張するプール。初期サイズは0。新しいタスクが来ると、空いているスレッドがなければ新しいスレッドを作成します。しばらく新しいタスクが来ない場合はコア・スレッドがタイムアウトして廃棄され、サイズが縮小します。
ScheduledThreadPool：指定した遅延後にタスクを実行したり、定期的に実行したりするスレッド・プール。

なぜ内蔵のスレッド・プールを使わないのか？#

Alibaba の Java 開発マニュアルの「並行処理」 section には、スレッド資源はスレッド・プールを通じて提供され、アプリケーション内で自前でスレッドを直接作成してはいけないと明記されています。

理由：スレッドを作成・破棄する際のコストを削減し、資源の不足を回避するためです。スレッド・プールを使わないと、同じ種類のスレッドが大量に作成され、OOM（Out of Memory）や過度なコンテキスト・スイッチを引き起こす可能性があります。

また、Executors の直接利用には欠点があり、内蔵のスレッド・プールを使うと以下の問題が起こり得ます：

FixedThreadPool や SingleThreadExecutor は無限の LinkedBlockingQueue を利用するため、キュー長が Integer.MAX_VALUE まで肥大化し、OOM のリスクがある。
CachedThreadPool は SynchronousQueue を使用するため、タスクが多く実行が遅いと大量のスレッドを作成し、OOM のリスクがある。
ScheduledThreadPool は DelayedWorkQueue という無限に大きくなる遅延ブロック・キューを使用するため、OOM のリスクがある。

スレッド・プールのパラメータの意味は？#

ThreadPoolExecutor の3つの最も重要なパラメータは以下です：

corePoolSize（コア・プールサイズ）：キューが容量に達するまでは、同時に実行できるスレッドの最大数。
maximumPoolSize（最大スレッド数）：キューが容量に達したら、同時に実行できるスレッドの最大数。
workQueue（タスク・キュー）：新しいタスクが来た場合、現在の実行スレッド数がコア・サイズに達しているかどうかを判断します。達している場合、タスクはキューへ格納されます。

その他のパラメータ：

keepAliveTime：コア数を超えたスレッドのうち、アイドル状態の長さ。これが長いほど非コア・スレッドが長く生存します。
unit：keepAliveTime の時刻単位。
threadFactory：新しいスレッドを生成する際に使用されるファクトリ。
handler：拒否戦略。タスクが過多で処理できない場合の対応を定義します。

以下の図は、スレッド・プールの各パラメータの関係を理解するのに役立ちます。

スレッド・プールの飽和戦略にはどんなものがあるか？#

現在のスレッド数が最大スレッド数に達し、かつキューも満杯の場合、ThreadPoolExecutor はいくつかの戦略を提供します：

AbortPolicy：新しいタスクを拒否し、RejectedExecutionException を投げます。
CallerRunsPolicy：自分自身のスレッドでタスクを実行します。実行不能なら、タスクを呼び出し側が実行します。これにより新規タスクの提出速度が低下します。
DiscardPolicy：新しいタスクを破棄します。
DiscardOldestPolicy：最も古い未処理のタスクを破棄します。

例として、Spring の ThreadPoolTaskExecutor や直接 ThreadPoolExecutor のコンストラクタを使って作成する場合、デフォルトは AbortPolicy です。キューが満杯の場合は RejectedExecutionException が投げられます。もしタスクを失いたくない場合は CallerRunsPolicy を使います。

1
public static class CallerRunsPolicy implements RejectedExecutionHandler {
2

3
        public CallerRunsPolicy() { }
4

5
        public void rejectedExecution(Runnable r, ThreadPoolExecutor e) {
6
            if (!e.isShutdown()) {
7
                // 直接メイン・スレッドで実行、スレッド・プールのスレッドではなく
8
                r.run();
9
            }
10
        }
11
    }

スレッド・プールでよく使われるブロック・キューは？#

新しいタスクが来た際、現在の実行スレッド数がコア・スレッド数に達しているかどうかで判断します。キューには様々なタイプがあり、それぞれの特徴が異なります。

容量が Integer.MAX_VALUE の LinkedBlockingQueue（無界）: FixedThreadPool と SingleThreadExecutor で使用され、キューが満杯になることは基本的にありません。
SynchronousQueue（同期キュー）: CachedThreadPool で使用。容量はなく、要素を保持しません。利用可能なスレッドがあれば即座に使用し、なければ新しいスレッドを生成します。最大スレッド数は Integer.MAX_VALUE まで到達しうるため、OOM のリスクがあります。
DelayedWorkQueue（遅延ブロックキュー）: ScheduledThreadPool と SingleThreadScheduledExecutor で使用。要素は遅延時間でソートされ、最大容量は Integer.MAX_VALUE です。巨大な要求が来ても、容量を超えない限りはブロックされません。

スレッド・プールがタスクを処理する流れは？#

現在の実行スレッド数がコア・スレッド数より少ない場合、新しいスレッドを作成してタスクを実行します。
現在の実行スレッド数がコア・スレッド数と等しいか、それ以上で、最大スレッド数未満なら、タスクをキューへ入れて待機させます。
タスクをキューへ投げても実行できない場合、現在の実行スレッド数が最大スレッド数未満なら新しいスレッドを作成します。
現在の実行スレッド数が最大スレッド数と等しく、さらに新しいスレッドを作成すると、タスクは拒否され、拒否戦略が RejectedExecutionHandler.rejectedExecution() を呼び出します。

スレッド・プールの名前を付けるには？#

起動時に名前を付ける（スレッド・プール名のプレフィックスを設定する）と、問題の特定がしやすくなります。

デフォルトでは、スレッド名は pool-1-thread-n のようになります。実務では、次の2つの方法でスレッド名を付けることが一般的です。

Guava の ThreadFactoryBuilder を使う
自分で ThreadFactory を実装する

1
ThreadFactory threadFactory = new ThreadFactoryBuilder()
2
                        .setNameFormat(threadNamePrefix + "-%d")
3
                        .setDaemon(true).build();
4
ExecutorService threadPool = new ThreadPoolExecutor(corePoolSize, maximumPoolSize, keepAliveTime, TimeUnit.MINUTES, workQueue, threadFactory);

あるいは自分で ThreadFactory を実装します。

1
import java.util.concurrent.ThreadFactory;
2
import java.util.concurrent.atomic.AtomicInteger;
3

4
/**
5
 * ThreadFactory that names threads for easier debugging.
6
 */
7
public final class NamingThreadFactory implements ThreadFactory {
8

9
    private final AtomicInteger threadNum = new AtomicInteger();
10
    private final String name;
11

12
    /**
13
     * Create a thread factory with a given base name
14
     */
15
    public NamingThreadFactory(String name) {
16
        this.name = name;
17
    }
18

19
    @Override
20
    public Thread newThread(Runnable r) {
21
        Thread t = new Thread(r);
22
        t.setName(name + " [#" + threadNum.incrementAndGet() + "]");
23
        return t;
24
    }
25
}

スレッド・プールのサイズはどう決めるべきか？#

多くの人は、スレッド・プールのサイズを大きくする方が良いと考えがちですが、スレッド数を増やしすぎると、文脈切替のコストが増え、オーバーヘッドが増大します。適切なサイズを決定するには、CPU の実効利用度とタスクの性質を考慮する必要があります。

CPU 集約型タスク（N+1）: コア数 N に対して、N+1 程度が目安かもしれません。追加の 1 は、ページアウトなどの遅延をカバーするためです。
IO 集約型タスク（2N）: IO 待ちの時間に対して、より多くのスレッドを割り当てることでパフォーマンスを改善できる場合があります。

判断の厳密な式としては、最適スレッド数 = N (CPUコア数) * (1 + WT/ST) です。WT はスレッド待機時間、ST はスレッド計算時間です。

最適なスレッド数は、WT/ST が高いと多く、低いと少なくなります。
実運用では VisualVM などのツールを使って WT/ST の比率を観察すると良いです。

動的にスレッド・プールのパラメータを変更するには？#

Meituan の記事「Javaのスレッド・プールの実装原理とMeituanでの実践」では、スレッド・プールのコアパラメータを動的に変更する設計を解説しています。ここではコア・パラメータを動的に設定できるようにするアプローチが述べられています。三つのコアパラメータは次のとおりです。

corePoolSize：コア・スレッド数。最小同時実行数を定義します。
maximumPoolSize：キューが満杯になった時、最大同時実行スレッド数を定義します。
workQueue：新しいタスクが来たとき、現在の実行スレッド数がコア数に達していれば、タスクはキューへ置かれます。

この3つのパラメータは ThreadPoolExecutor の最も重要なパラメータであり、タスク処理戦略を大きく決定します。

また、corePoolSize にも注意が必要です。実行中に setCorePoolSize() を呼ぶと、現在の作業スレッド数が corePoolSize を超えていれば、それを回収します。

Meituan の方法では、ResizableCapacityLinkedBlockingQueue のような可変容量のキューを自作して実現するケースもあります。実務では、既存のオープンソース・プロジェクトを活用することもあります（以下の例）。

Hippo4j：非同期スレッド・プールのフレームワーク、スレッド・プールの動的変更、監視、アラームをサポート。コード変更なしで導入可能。
Dynamic TP：軽量な動的スレッド・プール。監視・アラート機能を内蔵。

タスクの優先度に応じて実行するスレッド・プールを設計するには？#

通常、スレッド・プールはキューとして異なるブロック・キューを使用します。例として、FixedThreadPool は無界の LinkedBlockingQueue を使用するため、キューが満杯になることはなく、最大スレッド数はコア数と等しくなります。

優先度タスクを扱う場合、タスク・キューとして PriorityBlockingQueue を使うことが考えられます（ThreadPoolExecutor のコンストラクタには workQueue のパラメータがあり、タスク・キューを渡せます）。

ただし、以下のリスクがあります：

PriorityBlockingQueue は無界であるため、過剰なリクエストが蓄積して OOM の原因になる。
優先度の低いタスクが長時間実行されず、飢餓問題が起こる可能性がある。
要素のソートとスレッドセーフの確保（ReentrantLock を使用する）によって、性能が低下する可能性がある。

この問題を回避するためには、PriorityBlockingQueue を拡張して offer のロジックを上書きし、挿入エントリ数が閾値を超えた場合には false を返す、などの工夫を行います。

飢餓問題は、待機時間が長いタスクを隔定的に削除して再挿入する、あるいは優先度を上げるなどの設計で解決することができます。

なお、実運用では、タスクの優先度と実行時間のトレードオフを検討して、適切な設計を行うことが重要です。

Future#

Future とは何の役に立つのか？#

Future クラスは、非同期思想を実用する代表的な例です。長時間かかるタスクを実行する場面で、プログラムが待機してしまうことを避け、処理を並行して進めることができます。特定のタスクを実行すると、そのタスクをサブ・スレッドに任せて、他の作業を行い、完了後に Future から結果を取得します。これがマルチスレッド領域のクラシックな Future パターンです。

Java 8 で導入された CompletableFuture は、Future の不便な点を解消します。CompletableFuture はより便利で強力な Future 機能だけでなく、関数型プログラミング、非同期タスクのオーケストレーション・組み合わせ（複数の非同期タスクを連結して、連鎖的な呼び出しを作成）などを提供します。

1
public class CompletableFuture<T> implements Future<T>, CompletionStage<T> {
2
}

ここから、CompletableFuture は同時に Future と CompletionStage のインタフェースを実装していることがわかります。

CompletionStage は、非同期計算の「段階」を表します。多くの計算は複数の段階に分けられます。その場合、すべての段階を組み合わせて、非同期計算のパイプラインを形成します。

CompletionStage のメソッドは多く、CompletableFuture の関数型能力はこのインタフェースに与えられています。これらのメソッドのパラメータには Java 8 で導入された関数型プログラミングが多数使用されています。

AQS#

AQS とは？#

AQS は AbstractQueuedSynchronizer（抽象キューイング・シンクロナイザ）の略で、java.util.concurrent.locks パッケージに属しています。

AQS は抽象クラスであり、ロックと同期機構の構築に用いられます。

1
public abstract class AbstractQueuedSynchronizer extends AbstractOwnableSynchronizer implements java.io.Serializable {
2
}

AQS は、ロックと同期器を構築するための共通機能を提供します。そのため、AQS を用いることで広く使われる多くの同期器を簡潔かつ高効率に構築できます。例えば ReentrantLock、Semaphore、ReentrantReadWriteLock、SynchronousQueue などはすべて AQS をベースにしています。

AQS の原理は？#

AQS の核心思想は「要求された共有資源が空いている場合、現在の要求元スレッドを有効な作業スレッドとして設定し、資源をロック状態にする。もし資源が占有されている場合は、スレッドをブロックして待機させ、解放時に再開する」というものです。この仕組みは CLH（Craig、Landin、Hagersten）キュー・ロックを用いて実現され、ロックを取得できないスレッドをキューに追加します。

CLH キューは仮想的な双方向キューで、ノードは1つのスレッドを表し、スレッドの参照、ノードの状態、前ノード、後続ノードを保持します。

CLH キュー構造は以下の図のとおりです。

AQS の核心原理図：

AQS は state という int 型の同期状態を表す変数を持ち、内蔵のスレッド待機キューを通じて資源取得スレッドを待機させます。

state は volatile で宣言され、現在の臨界資源の取得状況を示します。

1
// 共有変数、volatile で宣言してスレッド可視性を保証
2
private volatile int state;

状態情報 state は、protected な getState()、setState()、compareAndSetState() を用いて操作できます。これらのメソッドはすべて final 修飾されており、サブクラスでオーバーライドできません。

1
// 現在の同期状態の値を返す
2
protected final int getState() {
3
     return state;
4
}
5
 // 同期状態の値を設定する
6
protected final void setState(int newState) {
7
     state = newState;
8
}
9
// 現在の同期状態の値が想定値と同じ場合に、更新値で原子的に設定する
10
protected final boolean compareAndSetState(int expect, int update) {
11
      return unsafe.compareAndSwapInt(this, stateOffset, expect, update);
12
}

例として、ReentrantLock を取ると、state の初期値は 0（未ロック）です。lock() の時に tryAcquire() が呼ばれ、ロックを独占して state を 1 にします。以後、他のスレッドが tryAcquire() を呼ぶと失敗します。unlock() で state が 0 へ戻るまで、他のスレッドの取得が可能になります。ここで、同じスレッドは再度 lock() を呼べば state が増加しますが、同じ回数だけ解放する必要があります。これが再入可能の概念です。

CountDownLatch を例にすると、タスクを N 個のサブ・スレッドで実行し、それぞれが終了時に countDown() を呼ぶと、state が CAS で 1 ずつデクリメントされ、全てのスレッドが完了すると待機しているスレッドが再開します。

Semaphore は何に使うか？#

synchronized と ReentrantLock は、1 回の時には 1 つのスレッドだけ資源へアクセスさせる排他ロックを提供します。一方、Semaphore は特定の資源に同時にアクセスできるスレッド数を制御するための信号量です。

Semaphore の使用は簡単で、複数のスレッドが共有資源を取得する場合の同時獲得数を制限できます。

1
final Semaphore semaphore = new Semaphore(5);
2
semaphore.acquire();  // 1 つの許可を取得
3
semaphore.release();  // 許可を開放

この場合、初期値が 5 のため、5 つの許可を同時に保持でき、それ以外は待機します。なお、公平 モードと 非公平 モードを切り替えることができます。

CountDownLatch とは？#

CountDownLatch は、count 個のスレッドがある所にブロックされ、全てのスレッドが完了するとブロックが解除されます。CountDownLatch は一度きりのカウントダウン・ゲートで、構築時にカウントを設定すると以後変更できません。

CountDownLatch の原理は？#

CountDownLatch は共用ロックの実装で、state は初期値 count で設定されます。countDown() を呼ぶと、state を CAS で 1 減らし、0 になれば待機中のスレッドを解放します。await() は state が 0 になるまで待機します。

CountDownLatch を用いた例#

1
public class CountDownLatchExample1 {
2
    // 処理するファイルの数
3
    private static final int threadCount = 6;
4

5
    public static void main(String[] args) throws InterruptedException {
6
        // 固定スレッド数のスレッド・プールを作成
7
        ExecutorService threadPool = Executors.newFixedThreadPool(10);
8
        final CountDownLatch countDownLatch = new CountDownLatch(threadCount);
9
        for (int i = 0; i < threadCount; i++) {
10
            final int threadnum = i;
11
            threadPool.execute(() -> {
12
                try {
13
                    // ファイル処理のビジネス処理
14
                    //......
15
                } catch (InterruptedException e) {
16
                    e.printStackTrace();
17
                } finally {
18
                    // 1 ファイル完了を表す
19
                    countDownLatch.countDown();
20
                }
21

22
            });
23
        }
24
        countDownLatch.await();
25
        threadPool.shutdown();
26
        System.out.println("finish");
27
    }
28
}

改善案として、CompletableFuture を使う方法があります。Java 8 の CompletableFuture は多くの非同期操作を扱いやすく、非同期・連結・組み合わせ・全体の完了待機などを簡単に記述できます。

1
CompletableFuture<Void> task1 =
2
    CompletableFuture.supplyAsync(()->{
3
        // 自作ビジネス処理
4
    });
5
......
6
CompletableFuture<Void> task6 =
7
    CompletableFuture.supplyAsync(()->{
8
    // 自作ビジネス処理
9
    });
10
......
11
CompletableFuture<Void> headerFuture=CompletableFuture.allOf(task1,.....,task6);
12

13
try {
14
    headerFuture.join();
15
} catch (Exception ex) {
16
    //......
17
}
18
System.out.println("all done. ");

上のコードはさらに最適化可能です。タスクが多い場合には、個々の task を列挙するのは現実的ではありません。ループでタスクを追加する方法を検討します。

1
// ファイルの場所
2
List<String> filePaths = Arrays.asList(...)
3
// 全ファイルを非同期処理
4
List<CompletableFuture<String>> fileFutures = filePaths.stream()
5
    .map(filePath -> doSomeThing(filePath))
6
    .collect(Collectors.toList());
7
// それらをまとめる
8
CompletableFuture<Void> allFutures = CompletableFuture.allOf(
9
    fileFutures.toArray(new CompletableFuture[fileFutures.size()])
10
);

CyclicBarrier は何に使うか？#

CyclicBarrier は CountDownLatch に非常に似ており、スレッド間の技術的待機を実現します。違いはその機能の複雑さと強さにあります。

CountDownLatch は AQS に基づく実装ですが、CyclicBarrier は ReentrantLock（ReentrantLock も AQS の同期器の一部）と Condition に基づいています。

CyclicBarrier の直訳は「循環の障壁」で、グループのスレッドが「到着」して barrier に達したときに待機を解除し、全員が barrier を通れるようにします。

CyclicBarrier の原理は？#

CyclicBarrier は内部で count 変数をカウントとして使用します。parties 引数で初期化され、スレッドが await() を呼ぶとカウントを減らします。カウントが 0 になると、 barrier 内で指定されたタスクを実行します。

デフォルトのコンストラクタ CyclicBarrier(int parties) は、 barrier のスレッド数を表す parties を受け取り、await() は barrier に到着したことを通知します。
await() が呼ばれると dowait(false, 0L) が実行されます。 barrier に到着したスレッドが全員揃うまで待機します。全員が揃った時 barrier が開き、待機していたスレッドが解放されます。

この章の続きや、他の詳細は公式ドキュメントを参照してください。

[以下、本文は Markdown のフォーマットに従い、コードブロックはそのまま保持します]

java并发编程

https://dreaife.tokyo/posts/java-concurrency-guide/

作者

dreaife

发布于

2024-01-30

许可协议

CC BY-NC-SA 4.0

部分信息可能已经过时

Java JMM内存模型

java集合知识

dreaife的休憩小栈

java并发编程#

什么是线程和进程?#

何为进程?#

何为线程?#

Java 线程和操作系统的线程有啥区别？#

请简要描述线程与进程的关系,区别及优缺点？#

图解进程和线程的关系#

程序计数器为什么是私有的?#

虚拟机栈和本地方法栈为什么是私有的?#

一句话简单了解堆和方法区#

并发与并行的区别#

同步和异步的区别#

为什么要使用多线程?#

使用多线程可能带来什么问题?#

如何理解线程安全和不安全？#

单核 CPU 上运行多个线程效率一定会高吗？#

说说线程的生命周期和状态?#

什么是线程上下文切换?#

什么是线程死锁?如何避免死锁?#

认识线程死锁#

如何预防和避免线程死锁?#

sleep() 方法和 wait() 方法对比#

为什么 wait() 方法不定义在 Thread 中？#

可以直接调用 Thread 类的 run 方法吗？#

volatile 关键字#

如何保证变量的可见性？#

如何禁止指令重排序？#

volatile 可以保证原子性么？#

乐观锁和悲观锁#

什么是悲观锁？#

什么是乐观锁？#

如何实现乐观锁？#

版本号机制#

CAS 算法#

乐观锁存在哪些问题？#

ABA 问题#

循环时间长开销大#

只能保证一个共享变量的原子操作#

synchronized 关键字#

synchronized 是什么？有什么用？#

如何使用 synchronized？#

构造方法可以用 synchronized 修饰么？#

synchronized 底层原理了解吗？#

总结#

JDK1.6 之后的 synchronized 底层做了哪些优化？锁升级原理了解吗？#

synchronized 和 volatile 有什么区别？#

ReentrantLock#

ReentrantLock 是什么？#

公平锁和非公平锁有什么区别？#

synchronized和 ReentrantLock 有什么区别？#

可中断锁和不可中断锁有什么区别？#

ReentrantReadWriteLock#

ReentrantReadWriteLock 是什么？#

ReentrantReadWriteLock 适合什么场景？#

共享锁和独占锁有什么区别？#

线程持有读锁还能获取写锁吗？#

读锁为什么不能升级为写锁？#

ThreadLocal#

ThreadLocal 有什么用？#

如何使用 ThreadLocal？#

ThreadLocal 原理了解吗？#

ThreadLocal 内存泄露问题是怎么导致的？#

线程池#

什么是线程池?#

为什么要用线程池？#

如何创建线程池？#

为什么不推荐使用内置线程池？#

线程池常见参数有哪些？如何解释？#

线程池的饱和策略有哪些？#

线程池常用的阻塞队列有哪些？#

线程池处理任务的流程了解吗？#

如何给线程池命名？#

如何设定线程池的大小？#

如何动态修改线程池的参数？#

如何设计一个能够根据任务的优先级来执行的线程池？#

Future#

Future 类有什么用？#

Callable 和 Future 有什么关系？#

CompletableFuture 类有什么用？#

`synchronized`和 `ReentrantLock` 有什么区别？#

`CompletableFuture` 类有什么用？#