An approach for Logging in Spark jobs
Spark website provides three options for using a custom log4j configuration for logging. Here I am explaining about the 3rd option
Spark website provides three options for using a custom log4j configuration for logging. Here I am explaining about the 3rd option
Git is a very popular Distributed Version Control System. It has Become the choice for most of the open source projects due its simple yet powerful features. There are many online articles to get started with Git, but if you are in a hurry, here I am explaining how to set up a Eclipse project… Read More Setup Eclipse project with GitHub
Listing out some of the useful Java debugging tools jinfo prints the system properties or the command line flags that were used to start the VM. /opt/sun/jdk/java/bin/jinfo jinfo jinfo Link: http://java.sun.com/j2se/1.5.0/docs/tooldocs/share/jinfo.html jmap prints memory related statistics for a running VM or core file /opt/sun/jdk/java/bin/jmap jmap -heap… Read More Java Debugging Tools
JProbe is a licensed software from Quest. Its a really powerful tool to do Run-time performance analysis of Java applications. We had a special requirement on a use case to be completed within 25 seconds in every execution. However for about 10-20% of times it was exceeding 25s requirement. We have decided to do some… Read More Performance Analysis using JProbe