30 August 2006
I've been trying to tune a JBoss instance on a relatively low-memory system (192M physical RAM) for a long time now. I think my biggest problem was that the Java heap space would get swapped out to disk by the OS, and then GC of any sort would degrade to taking tens of seconds and at times minutes! For my set of applications, the most effective switches I had found were:
It took lots of trial and error to determine these numbers. Java 5 GC Ergonomics advertises that it should help eliminate the need to do all this tuning, so I thought I'd give it a try.
It seems that you need to be using the Throughput collector (
-XX:+UseParallelGC) to get adaptive sizing automatically enabled. I then removed my manual heap sizing options completely, and set the ergonomic options:
-XX:MaxGCPauseMillis=100 -XX:MaxGCMinorPauseMillis=100 -XX:GCTimeRatio=9. I chose a "high" GCTimeRatio (lesser than the 19 from other examples), since I knew that footprint is probably the biggest issue for my server. I figured if I made it easy to hit the GCTimeRatio and the maximum pauses, it could shrink the heap even smaller. I'm testing this all right now -- it seems to take a long time for performance to settle into its worst states (12-24 hours). Repeatedly spidering and touching all the apps (while also spawning lots of sessions) just doesn't seem to kick off the problems like I'd expect. It may ultimately be caused by other heavy nightly jobs that run on the box, so I just have to wait.
After 4 hours of testing the latest configuration, the server's settled into using only about 41M or heap space, of which only 640K is set aside for the young generation. I'm hoping that the JVM is able to recognize when GC times go long due to the OS swappig out heap and is automatically keeping it below that threshold. We'll see how it does in the longer term.
Update (30 August 2006): The server's been up for a week straight. I've redeployed a WebWork application a couple times, so the memory usage creeped up a bit, but it's now sitting around 46M. I get occassional long GCs (~10 seconds), but they're not common.
I looked up the proxy error I see in Apache occassionally, and it may be an issue with keepalive connections between Apache and JBoss. I'll have to read more about that and see if any of the tips alleviate it. I had always assumed this was caused the JBoss server going away.
I must say that I've been pleased with its behavior so far, and my java command line is shorter than it was -- that's always a good thing.