Post

Java -- Getting Started

Setting up Java in MacOS

MacOS is notorious for the lack of transparency of how everything works. Java is no exception. With a brand new MacBook, you have a few pre-installed commands: /usr/bin/java, /usr/bin/javac, /usr/libexec/java_home and etc. Do not try to figure out the source code of these commands. They are built-in scripts written in Objective-C or Swift using Apple’s Foundation framework. Its source code is not publicly available. You can use otool -L to list the shared libs used by them. Most likely, you can find /System/Library/PrivateFrameworks in the output.

The first step is installing a jdk. Let’s do it with brew install openjdk. The output has a caveat section.

1
2
3
==> Caveats
For the system Java wrappers to find this JDK, symlink it with
  sudo ln -sfn /opt/homebrew/opt/openjdk/libexec/openjdk.jdk /Library/Java/JavaVirtualMachines/openjdk.jdk

To understand this caveat, we need to understand how MacOS finds the correct version of Java. /usr/bin/java is a shim command, and from my observation/guess, it chooses the java version in two steps:

  1. If JAVA_HOME is set, then it delegates to $JAVA_HOME/bin/java.
  2. If JAVA_HOME is absent, then it calls /usr/libexec/java_home to find the most appropriate java home.

How does /usr/libexec/java_home find the available java homes? Again, from the information I collected online and my guess, it searches a few root folders: /Library/Java/JavaVirtualMachines/, /System/Library/Java/JavaVirtualMachines/ and ~/Library/Java/JavaVirtualMachines/. Every sub folder of them potentially contains a valid java jdk folder. /usr/libexec/java_home looks for a file called Contents/Info.plist which contains meta data about this jdk version. Then it sorts them by version reversely and the highest version would be used for JAVA_HOME if JAVA_HOME is not explicitly specified. For example,

1
2
3
4
5
6
7
8
9
/usr/libexec/java_home -V
Matching Java Virtual Machines (6):
    23.0.2 (arm64) "Homebrew" - "OpenJDK 23.0.2" /opt/homebrew/Cellar/openjdk/23.0.2/libexec/openjdk.jdk/Contents/Home
    19 (arm64) "Oracle Corporation" - "OpenJDK 19" /Users/xiongding/Library/Java/JavaVirtualMachines/openjdk-19/Contents/Home
    18.0.2.1 (arm64) "Eclipse Adoptium" - "OpenJDK 18.0.2.1" /Users/xiongding/Library/Java/JavaVirtualMachines/temurin-18.0.2.1/Contents/Home
    17.0.4 (arm64) "Amazon.com Inc." - "Amazon Corretto 17" /Users/xiongding/Library/Java/JavaVirtualMachines/corretto-17.0.4.1/Contents/Home
    15.0.10 (arm64) "Azul Systems, Inc." - "Zulu 15.46.17" /Users/xiongding/Library/Java/JavaVirtualMachines/azul-15.0.10/Contents/Home
    13.0.14 (arm64) "Azul Systems, Inc." - "Zulu 13.54.17" /Users/xiongding/Library/Java/JavaVirtualMachines/azul-13.0.14/Contents/Home
/opt/homebrew/Cellar/openjdk/23.0.2/libexec/openjdk.jdk/Contents/Home

It shows I have a few versions of jdk installed and openjdk/23.0.2 will be used for /usr/bin/java if JAVA_HOME is not specified. See result below.

1
2
3
4
5
6
7
8
9
$ JAVA_HOME='' /usr/bin/java --version
openjdk 23.0.2 2025-01-21
OpenJDK Runtime Environment Homebrew (build 23.0.2)
OpenJDK 64-Bit Server VM Homebrew (build 23.0.2, mixed mode, sharing)

$ JAVA_HOME='/Users/xiongding/Library/Java/JavaVirtualMachines/azul-13.0.14/Contents/Home' /usr/bin/java --version
openjdk 13.0.14 2023-01-17
OpenJDK Runtime Environment Zulu13.54+17-CA (build 13.0.14+5-MTS)
OpenJDK 64-Bit Server VM Zulu13.54+17-CA (build 13.0.14+5-MTS, mixed mode)

Let’s come back to the brew install message above. Homebrew is conservative. It installs jdk inside folder /opt/homebrew/opt, so it is user’s responsibility to soft link it to the /Library/Java/JavaVirtualMachines/. I think most likely we would put it under the home library folder not the root system folder.

1
ln -sfn /opt/homebrew/opt/openjdk/libexec/openjdk.jdk ~/Library/Java/JavaVirtualMachines/openjdk.jdk

To sum up, Setting up Java in MacOS needs three steps

1
2
3
brew install openjdk
ln -sfn /opt/homebrew/opt/openjdk/libexec/openjdk.jdk ~/Library/Java/JavaVirtualMachines/openjdk.jdk
echo 'export JAVA_HOME=/opt/homebrew/opt/openjdk/libexec/openjdk.jdk/Contents/Home/' >> ~/.bashrc

Reading Openjdk

openjdk is the starting point. However, IntelliJ fails to resolve symbols when open it directly. Actually, openjdk team provides some guidance on how to set up the repo in IntelliJ. See doc.

1
2
bash configure
bash bin/idea.sh

I encountered a few errors when running above two commands in a Macbook M1. In the configure step,

1
configure: error: XCode tool 'metal' neither found in path nor with xcrun

This post helps. Basically, we need to change the active developer directory of xcode. We can change it back once the configuration stage is done.

The second step requires ant: brew install ant.

However, after all these steps, it still does not work. Finally, I use a simple trick: File -> Project Structure -> Modules -> Add an existing JDK.

Debug

I am testing some Kafka Connect plugin recently. Instead of using print to debug, I really wish Java can provide some functionality similar to a dynamic language such as I insert a breakpoint() in the code, then the running process will jump to a REPL shell.

Then I realize Java provides remote debugging, or more specifically, JDWP (Java Debug Wire Protocol). Basically, when we start the Java program, we add some command line arguments to open some socket to allow remote debugger to attach. For the case of Kafka Connect, we can enable it by setting the KAFKA_DEBUG environment variable. See code.

1
-agentlib:jdwp=transport=dt_socket,server=y,address=5005

server=y means that it starts a server and waits for client to connect. server=n has reverse meaning.

Then we can use jdb to connect to this process

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
$ jdb -attach 127.0.0.1:5005

Initializing jdb ...

> help
** command list **
connectors                -- list available connectors and transports in this VM

run [class [args]]        -- start execution of application's main class

threads [threadgroup]     -- list threads in threadgroup. Use current threadgroup if none specified.
thread <thread id>        -- set default thread
suspend [thread id(s)]    -- suspend threads (default: all)
resume [thread id(s)]     -- resume threads (default: all)
where [<thread id> | all] -- dump a thread's stack
wherei [<thread id> | all]-- dump a thread's stack, with pc info
up [n frames]             -- move up a thread's stack
down [n frames]           -- move down a thread's stack
kill <thread id> <expr>   -- kill a thread with the given exception object
interrupt <thread id>     -- interrupt a thread
...

It is very similar to gdb. You can set breakpoint, print out thread stack trace. And I think it has better support for debugging multi-thread program.

A few notes about breakpoints. First, we must use full class path such as stop at org.opensearch.cluster.ClusterState:706. Second, for inner class, we should do stop at <full_class_path>$<inner_class_name>:line_number. For example, stop at org.opensearch.cluster.ClusterState$Builder:706. You can inspect the outer class by class <full_class_path>.

Cross compilation

javac has three options related to cross compilation: -source, -target and -releaes. JEP 247 has concise description about their usage.

This post is licensed under CC BY 4.0 by the author.