/ Configuration / Configuring and Using LiSA

On this page:

Configuring and Using LiSA

Configuring and Using LiSA

Note:
This website describes LiSA’s architecture and provides guides on how to use and extend it. It is intended to be valid for the latest stable release of LiSA, but should be compatible with versions 0.2 and later. Signatures or packages might differ in older versions, but the overall architecture and design principles should remain the same.

Using and configuring LiSA is straightforward: first, a Program (or more programs, one for each programming language to analyze) must be obtained, then a LiSAConfiguration object must be created and customized, and finally a LiSA instance must be created with the configuration and run on the program(s). For example:

Program program = ... // use a frontend to parse the code, or build the program programmatically
LiSAConfiguration config = new LiSAConfiguration();
// set configuration options (see below)
LiSA lisa = new LiSA(config);
LiSAReport report = lisa.run(program);
// use the report

Each configuration option can be set individually by changing the value of a field of the LiSAConfiguration object passed to the LiSA constructor. In the following sections, when an example usage is provided for a configuration option, it is assumed that the LiSAConfiguration object is stored in a variable named conf.

Available Options

Setting the Abstract Domain

Option Name: analysis

Option Type: AbstractDomain<?>

Default Value: null

Example usage:

conf.analysis = new SimpleAbstractDomain(new PointBasedHeap(), new Interval(), new InferredTypes())

The Abstract Domain to execute during the analysis can be selected through the analysis option. The value of this option decides what analysis is being run, and what shape will the computed states have. If no value is set for this option, no semantic analysis will be executed.

Built-in Abstract Domain implementations

HistoryDomain [Source code]
An abstract domain that tracks the history of fixpoint iterations as a HistoryState.

Reachability [Source code]
An abstract domain that tracks the reachability of program points, exploiting an underlying abstract domain to (i) compute approximations of the program state, and (ii) deducing which branches are taken after traversing a guard.

SimpleAbstractDomain [Source code]
An abstract domain that combines a heap, a value, and a type domain into a single abstract domain of type SimpleAbstractState. The interaction between heap and value/type domains follows the one defined in this paper.

TracePartitioning [Source code]
The trace partitioning abstract domain that splits execution traces to increase precision of the analysis. Individual traces are identified by ExecutionTraces composed of tokens representing the conditions traversed during the analysis. Note that all TraceTokens represent intraprocedural control-flow constructs, as calls are abstracted away before reaching this domain. Traces are never merged: instead, we limit the size of the traces we can track, and we leave the choice of when and where to compact traces to other analysis components. Specifically, an ExecutionTrace will contain at most max_conditions Branching tokens, and will track at most max_loop_iterations iterations for each loop (through LoopIteration tokens) before summarizing the next ones with a LoopSummary token. Both values are editable and customizable before the analysis starts.

If you adopt the Simple Abstract Domain framework to build your own abstract domain, LiSA also provides alternatives for each component.

Built-in Heap Domain implementations

NoOpHeap [Source code]
A no-op heap domain that uses SingleHeapLattice as lattice structure. This is useful in analyses where heap information is not relevant or when a placeholder is needed. Note that this domain never produces substitutions, and rewrite operations will always return the input expression wrapped in an ExpressionSet.

MonolithicHeap [Source code]
A monolithic heap implementation that abstracts all heap locations to a unique identifier.

TypeBasedHeap [Source code]
A type-based heap implementation that abstracts heap locations depending on their types, i.e., all the heap locations with the same type are abstracted into a single unique identifier.

FieldSensitivePointBasedHeap [Source code]
A field-insensitive program point-based AllocationSiteBasedAnalysis. The implementation follows X. Rival and K. Yi, "Introduction to Static Analysis An Abstract Interpretation Perspective", Section 8.3.4

PointBasedHeap [Source code]
A field-insensitive program point-based AllocationSiteBasedAnalysis. The implementation follows X. Rival and K. Yi, "Introduction to Static Analysis An Abstract Interpretation Perspective", Section 8.3.4

Built-in Value Domain implementations

ConstantValuePropagation [Source code]
A non-relational value domain tracking ConstantValues of variables for numeric, character, string and boolean values.

NoOpValues [Source code]
A no-op value domain that uses SingleValueLattice as lattice structure. This is useful in analyses where value information is not relevant or when a placeholder is needed.

WholeValueAnalysis [Source code]
The constraint-based whole-value analysis between a non-relational Boolean abstract domain, a non-relational numeric abstract domain, and a non-relational string abstract domain. This domains tracks environments of whole-value elements, which are values of one of the types produced by the client domains.

SmashedSum [Source code]
The smashed-sum abstract domain between BooleanPowerset, a non-relational numeric abstract domain, and a non-relational string abstract domain. This domains tracks environments of smashed values, which are values of one of the types produced by the client domains.

AvailableExpressions [Source code]
An implementation of the available expressions dataflow analysis, that focuses only on the expressions that are stored into some variable.

ConstantPropagation [Source code]
An implementation of the overflow-insensitive constant propagation dataflow analysis, that focuses only on integers.

Liveness [Source code]
An implementation of the liveness dataflow analysis, that determines which values might be used later on in the program.

ReachingDefinitions [Source code]
An implementation of the reaching definition dataflow analysis.

NonInterference [Source code]
Implementation of the non interference analysis, using annotations to detect low confidentiality variables/fields/functions (LOW_CONF_ANNOTATION) and high integrity variables/fields/functions (HIGH_INT_ANNOTATION).

ThreeLevelsTaint [Source code]
A BaseTaint implementation with three level of taintedness: clean, tainted and top. As such, this class distinguishes values that are always clean, always tainted, or tainted in at least one execution path.

TwoLevelsTaint [Source code]
A BaseTaint implementation with only two level of taintedness: clean and tainted. As such, this class distinguishes values that are always clean from values that are tainted in at least one execution path.

NonRedundantPowerset [Source code]
A ValueDomain that computes NonRedundantSetDomainLattice elements as the powerset of the elements of a given underlying lattice.

NonRelationalNonRedundantPowerset [Source code]
A NonRelationalValueDomain that computes NonRedundantSetLattice elements as the powerset of the elements of a given underlying lattice.

BooleanPowerset [Source code]
A NonRelationalValueDomain that tracks sets of boolean values in the environments it produces. Sets are are represented as Satisfiability values.

IntegerConstantPropagation [Source code]
The overflow-insensitive basic integer constant propagation analysis, tracking if a certain integer value has constant value or not, implemented as a BaseNonRelationalValueDomain. The lattice structure used by this domain is IntegerConstant.

Interval [Source code]
The overflow-insensitive interval abstract domain, approximating integer values as the minimum integer interval containing them. It is implemented as a BaseNonRelationalValueDomain. The lattice structure of this domain is IntInterval.

NonRedundantIntervals [Source code]
An analysis computing finite non redundant powersets of IntIntervals, approximating integer values as a non redundant set of intervals.

Parity [Source code]
The overflow-insensitive Parity abstract domain, tracking if a numeric value is even or odd, implemented as a BaseNonRelationalValueDomain.

Pentagon [Source code]
Implementation of the pentagons analysis of this paper.

Sign [Source code]
The basic overflow-insensitive Sign abstract domain, tracking zero, strictly positive and strictly negative integer values, implemented as a BaseNonRelationalValueDomain.

Stability [Source code]
Implementation of the stability abstract domain ( Stability paper). This domain computes per-variable numerical trends to infer stability, covariance and contravariance relations on program variables, exploiting an auxiliary domain of choice. This is implemented as an open product where the stability domain gathers information from the auxiliary one through boolean queries. Implementation-wise, this class is built as a product between a given ValueDomain aux and a ValueEnvironment trends of Trend instances, representing per-variable trends. Queries are carried over by the SemanticDomain.satisfies(DomainLattice, SymbolicExpression, ProgramPoint) operator invoked on aux.

UpperBounds [Source code]
Relational implementation of the upper bounds analysis of this paper.

BoundedStringSet [Source code]
A domain computing bounded set of strings, where the maximum number of elements is defined by max_size. If the number of elements exceeds this limit, the set is considered to be top. The domain is defined in this paper.

Bricks [Source code]
The bricks string abstract domain.

CharInclusion [Source code]
The character inclusion abstract domain.

Prefix [Source code]
The prefix string abstract domain.

StringConstantPropagation [Source code]
The string constant propagation abstract domain, tracking if a certain string value has constant value or not. Top and bottom cases for least upper bounds, widening and less or equals operations are handled by BaseLattice in BaseLattice.lub(L), BaseLattice.widening(L) and BaseLattice.lessOrEqual(L), respectively.

SubstringDomain [Source code]
The substring relational abstract domain, tracking relation between string expressions. This domain follows the one defined in this paper.

SubstringDomainWithConstants [Source code]
The substring relational abstract domain (see SubstringDomain) enriched with string constant propagation. This domain tracks the Cartesian product between Substrings and StringConstant. This domain follows the one defined in this paper.

Suffix [Source code]
The suffix string abstract domain.

FSA [Source code]
A class that represent the Finite State Automaton domain for strings, exploiting a SimpleAutomaton. Caution: the FSA domain is buggy and requires lots of resources, to the point where it might be hard to debug also on relatively small samples. Use with caution.

Tarsis [Source code]
A class that represent the Tarsis domain for strings, exploiting a RegexAutomaton.

Built-in Type Domain implementations

NoOpTypes [Source code]
A no-op type domain that uses SingleTypeLattice as lattice structure. This is useful in analyses where type information is not relevant or when a placeholder is needed. Note that this domain cannot produce typing information: getRuntimeTypesOf(SingleTypeLattice, SymbolicExpression, ProgramPoint, SemanticOracle) always returns all possible types, and getDynamicTypeOf(SingleTypeLattice, SymbolicExpression, ProgramPoint, SemanticOracle) always returns Untyped.INSTANCE.

InferredTypes [Source code]
A NonRelationalTypeDomain holding a set of Types, representing the inferred runtime types of an Expression.

StaticTypes [Source code]
A NonRelationalTypeDomain that tracks the static type of variables, and that computes expression types using their static type. Typing information is thus deemed to be the set of all subtypes of the tracked type.

Interprocedural Analysis and Call Graph

Option Name: interproceduralAnalysis

Option Type: InterproceduralAnalysis<?, ?>

Default Value: null

Example usage:

conf.interproceduralAnalysis = new ModularWorstCaseAnalysis<>()

Option Name: callGraph

Option Type: CallGraph

Default Value: null

Example usage:

conf.callGraph = new CHACallGraph()

Option Name: openCallPolicy

Option Type: OpenCallPolicy

Default Value: TopExecutionPolicy.INSTANCE

Example usage:

conf.openCallPolicy = new CustomPolicy()

The Interprocedural Analysis and the Call Graph regulate how the program-wide analysis is executed, and how calls are resolved and computed. The value of the interproceduralAnalysis option determines the interprocedural analysis to use, while the value of the callGraph option determines the call graph. If no value is set for interproceduralAnalysis, no semantic analysis will be executed. Instead, the value of callGraph is effectively ignored if the selected one does not require a call graph, as determined by that analysis’ needsCallGraph method. If instead the selected interprocedural analysis requires a call graph and no value is set for callGraph, an error will be raised at startup.

The Open Call Policy is used to determine the results of calls that have no targets in the program to analyze.

Built-in Interprocedural Analysis implementations

BackwardModularWorstCaseAnalysis [Source code]
A worst case modular analysis were all cfg calls are treated as open calls.

ModularWorstCaseAnalysis [Source code]
A worst case modular analysis were all cfg calls are treated as open calls.

ContextBasedAnalysis [Source code]
A context sensitive interprocedural analysis. The context sensitivity is tuned by the number of calls that tail the call stack to keep track of. This happens concretely in KDepthToken. Recursions are approximated applying the iterates of the recursion starting from bottom and using the same widening threshold of cfg fixpoints.

BaseCasesFinder [Source code]
A recursion solver that applies a single iteration of the recursion starting from bottom and using top as entry state for the recursion. This is useful for understanding what is the return value in the base cases of the recursion: as the call that closes the recursion loop returns bottom, only the returns from the base cases will affect the result.

RecursionSolver [Source code]
A recursion solver that applies the iterates of the recursion starting from bottom. This solver operates by restarting the recursion from Recursion.getInvocation() a number of times, until the results of all the members stabilize.

InliningAnalysis [Source code]
An inlining-based interprocedural analysis. This means that each call receives its own result, with no "compacting" based on context or other technique: each call receives its own result that is uniquely determined by the call's entry strate. Recursions are not supported: either they converge to a result, or the analysis (i) diverges if no maximum call stack depth is set through the constructor, or (ii) terminates with an exception when the maximum call stack depth has been reached.

Built-in Call Graph implementations

CHACallGraph [Source code]
A call graph constructed following the Class Hierarchy Analysis as defined in: Frank Tip and Jens Palsberg. 2000. Scalable propagation-based call graph construction algorithms. In Proceedings of the 15th ACM SIGPLAN conference on Object-oriented programming, systems, languages, and applications (OOPSLA '00). Association for Computing Machinery, New York, NY, USA, 281–293. DOI:https://doi.org/10.1145/353171.353190

RTACallGraph [Source code]
A call graph constructed following the Rapid Type Analysis as defined in: Frank Tip and Jens Palsberg. 2000. Scalable propagation-based call graph construction algorithms. In Proceedings of the 15th ACM SIGPLAN conference on Object-oriented programming, systems, languages, and applications (OOPSLA '00). Association for Computing Machinery, New York, NY, USA, 281–293. DOI:https://doi.org/10.1145/353171.353190

Built-in Open Call Policy implementations

ReturnTopPolicy [Source code]
An OpenCallPolicy, where the post state is exactly the entry state, with the only difference of having a the call's meta variable assigned to top only if the call returns a value. This variable, that is also stored as computed expression, represent the unknown result of the call, if any.

TopExecutionPolicy [Source code]
An OpenCallPolicy where the whole execution state becomes top and all information is lost. The return value, if any, is stored in the call's meta variable. No errors are assumed to be thrown.

WorstCasePolicy [Source code]
A worst-case OpenCallPolicy, where the whole analysis state becomes top and all information is lost. The return value, if any, is stored in the call's meta variable. All possible errors are assumed to be thrown.

Adding Syntactic and Semantic Checks

Option Name: syntacticChecks

Option Type: Collection<SyntacticCheck>

Default Value: new HashSet<>()

Example usage:

conf.syntacticChecks.add(new VariableNamesCheck())

Option Name: semanticChecks

Option Type: Collection<SemanticCheck<?, ?>>

Default Value: new HashSet<>()

Example usage:

conf.semanticChecks.add(new NullDereferenceCheck())

Syntactic Checks and Semantic Checks are visitors of the program under analysis. Syntactic checks only require syntactic information, and thus can be executed before the analysis, while semantic checks require semantic information, and thus can only be executed after the analysis. The collections of syntactic and semantic checks to execute can be customized through the syntacticChecks and semanticChecks options, respectively. The checks in these collections will be executed on the program under analysis, and the results of the checks will be included in the final report. Note that the order of execution of the checks is not guaranteed, and should not be relied upon. The collection of syntactic checks to execute should only be added to, and not replaced with other (possibly immutable) collections, as LiSA might add new checks depending on the values of other options. The same applies to the collection of semantic checks.

As of today, LiSA does not include any syntactic or semantic check implementations, as they are highly situational.

Thresholds for Widenings and GLBs

Option Name: wideningThreshold

Option Type: int

Default Value: 5

Example usage:

conf.wideningThreshold = 10

Option Name: recursionWideningThreshold

Option Type: int

Default Value: 5

Example usage:

conf.recursionWideningThreshold = 10

Option Name: glbThreshold

Option Type: int

Default Value: 5

Example usage:

conf.glbThreshold = 10

Fixpoint algorithms that use widenings and greatest lower bounds (glbs) can be customized by setting the thresholds for the application of the respective operators. wideningThreshold determines after how many iterations of the fixpoint algorithm on a given node the widening operator should be applied instead of the least upper bound operator (lub). recursionWideningThreshold determines after how many iterations of the fixpoint algorithm on a recursive call chain the widening operator should be applied instead of the least upper bound operator (lub). glbThreshold determines how many descending iterations of the fixpoint algorithm can be performed a given node using the greatest lower bound operator (glb) before the descending iteration is stopped.

Setting these thresholds to 0 or a negative number causes the respective operator to be always applied (for widenings) or to never be applied (for glbs).

Selecting the Fixpoint Algorithms

Option Name: forwardFixpoint

Option Type: ForwardCFGFixpoint<?, ?>

Default Value: new ForwardAscendingFixpoint<>()

Example usage:

conf.forwardFixpoint = new CustomForwardFixpoint<>()

Option Name: forwardDescendingFixpoint

Option Type: ForwardCFGFixpoint<?, ?>

Default Value: null

Example usage:

conf.forwardDescendingFixpoint = new CustomForwardDescendingFixpoint<>()

Option Name: backwardFixpoint

Option Type: BackwardCFGFixpoint<?, ?>

Default Value: new BackwardAscendingFixpoint<>()

Example usage:

conf.backwardFixpoint = new CustomBackwardFixpoint<>()

Option Name: backwardDescendingFixpoint

Option Type: BackwardCFGFixpoint<?, ?>

Default Value: null

Example usage:

conf.backwardDescendingFixpoint = new CustomBackwardDescendingFixpoint<>()

Option Name: fixpointWorkingSet

Option Type: WorkingSet<Statement>

Default Value: new OrderBasedWorkingSet()

Example usage:

conf.fixpointWorkingSet = new CustomWorkingSet<>()

All fixpoint algorithms that LiSA executes over control flow graphs can be customized. forwardFixpoint determines the fixpoint to compute forward fixpoints over CFGs, while backwardFixpoint determines the fixpoint to compute backward fixpoints over CFGs. The interprocedural analysis selected through the interproceduralAnalysis option determines whether forward and/or backward fixpoints are required, and thus whether the values of these options are relevant.

Optionally, descending fixpoints can be computed after the ascending ones, to refine the results. forwardDescendingFixpoint determines the fixpoint to compute descending forward fixpoints over CFGs, while backwardDescendingFixpoint determines the fixpoint to compute descending backward fixpoints over CFGs. If no value is set for forwardDescendingFixpoint or backwardDescendingFixpoint, no descending phase will be executed.

The order in which the nodes of the CFG are visited during fixpoint iterations is determined by the WorkingSet passed to the fixpointWorkingSet option.

In all options above, the instances passed to the fields are used as factories to create new fixpoint instances or new working sets.

Built-in Forward Fixpoint implementations

ForwardAscendingFixpoint [Source code]
A ForwardCFGFixpoint that traverses ascending chains using lubs and widenings.

ForwardDescendingGLBFixpoint [Source code]
A ForwardCFGFixpoint that traverses descending chains using glbs up to threshold.

ForwardDescendingNarrowingFixpoint [Source code]
A ForwardCFGFixpoint that traverses descending chains using narrowings.

OptimizedForwardAscendingFixpoint [Source code]
An OptimizedForwardFixpoint that traverses ascending chains using lubs and widenings.

OptimizedForwardDescendingGLBFixpoint [Source code]
An OptimizedForwardFixpoint that traverses descending chains using glbs up to threshold.

OptimizedForwardDescendingNarrowingFixpoint [Source code]
An OptimizedForwardFixpoint that traverses descending chains using narrowings.

Built-in Backward Fixpoint implementations

BackwardAscendingFixpoint [Source code]
A BackwardCFGFixpoint that traverses ascending chains using lubs and widenings.

BackwardDescendingGLBFixpoint [Source code]
A BackwardCFGFixpoint that traverses descending chains using glbs up to threshold.

BackwardDescendingNarrowingFixpoint [Source code]
A BackwardCFGFixpoint that traverses descending chains using narrowings.

OptimizedBackwardAscendingFixpoint [Source code]
A BackwardCFGFixpoint that traverses ascending chains using lubs and widenings.

OptimizedBackwardDescendingGLBFixpoint [Source code]
An OptimizedBackwardFixpoint that traverses descending chains using glbs up to threshold.

OptimizedBackwardDescendingNarrowingFixpoint [Source code]
An OptimizedBackwardFixpoint that traverses descending chains using narrowings.

Built-in Working Set implementations

ConcurrentFIFOWorkingSet [Source code]
A first-in, first-out working set. This implementation is thread-safe.

ConcurrentLIFOWorkingSet [Source code]
A last-in, first-out working set. This implementation is thread-safe.

DuplicateFreeFIFOWorkingSet [Source code]
A LIFO working set that guarantees that, at any time, the same element cannot appear more than once in it. It works by pushing elements only if they are not already part of the working set. This implementation is not thread-safe.

DuplicateFreeLIFOWorkingSet [Source code]
A LIFO working set that guarantees that, at any time, the same element cannot appear more than once in it. It works by pushing elements only if they are not already part of the working set. This implementation is not thread-safe.

FIFOWorkingSet [Source code]
A first-in, first-out working set. This implementation is not thread-safe.

LIFOWorkingSet [Source code]
A last-in, first-out working set. This implementation is not thread-safe.

OrderBasedWorkingSet [Source code]
A WorkingSet for Statements that sorts its contents according to their natural order. This is specifically designed for fixpoint algorithms of CFGs: since the natural order of Statements discriminates for their CodeLocation first, this allows instructions that are exit points of control-flow structures to be analyzed only when all branches of the preceding structure has been fully analyzed. This holds since, unless several GOTO-like instructions are present, contents of ifs and loops always appear earlier in the code w.r.t. the exit points. Note that this working set is backed by a set: it is thus impossible to have duplicates in it.

VisitOnceFIFOWorkingSet [Source code]
A FIFO working set that guarantees that each element will be added to this working set no more than once. It works by pushing elements only if they were not already added before (even if they have already been popped out). This implementation is not thread-safe.

VisitOnceLIFOWorkingSet [Source code]
A LIFO working set that guarantees that each element will be added to this working set no more than once. It works by pushing elements only if they were not already added before (even if they have already been popped out). This implementation is not thread-safe.

Option Name: useWideningPoints

Option Type: boolean

Default Value: true

Example usage:

conf.useWideningPoints = false

Option Name: hotspots

Option Type: Predicate<Statement>

Default Value: null

Example usage:

conf.hotspots = stmt -> stmt instanceof Assignment

Option Name: dumpForcesUnwinding

Option Type: boolean

Default Value: false

Example usage:

conf.dumpForcesUnwinding = true

LiSA can be optimized in several ways. A simple optimization is to use widenings and narrowings only on widening points (i.e., loop conditions), and to use lubs and glbs on all other nodes regardless of the threshold. This is typically more efficient, as widening and narrowing are more expensive than lub and glb, and the results of lubs and glbs are often more precise than those of widenings and narrowings. This behavior can be enabled by setting the useWideningPoints option to true. Note that widening points correspond to the conditions of loops, as identified by CFG.getCycleEntries().

A second optimization is to use optimized fixpoint algorithms (i.e., algorithms for which invocations of AnalysisFixpoint.isOptimized() on forwardFixpoint, forwardDescendingFixpoint, backwardFixpoint, or backwardDescendingFixpoint yields true — these correspond to the ones having Optimized in their name). Such algorithms exploit basic blocks, and store the fixpoint results only for widening points (i.e., loop conditions), return statements, and calls. This is doable since results for any other instruction can be recontsructed by executing a single fixpoint iteration local to the CFG that contains the instruction, that will stabilize in one iteration since the results of widening points is already a post-fixpoint. This reconstruction is called unwinding in LiSA. When such algorithms are used, the hotspots predicate can be set to determine additional statements for which the fixpoint results must be kept to avoid excessive unwinding. null is a special value corresponding to the predicate t -> false. Instead, dumpForcesUnwinding can be set to true to force unwinding of all non-hotspot statements when dumping results to output files, to ensure that results are available for all program instructions.

Hiding Error and Exceptions

Option Name: shouldSmashError

Option Type: Predicate<Type>

Default Value: null

Example usage:

conf.shouldSmashError = type -> type.getName().equals("java.lang.NullPointerException")

Some error types might pollute the analysis results, since they might not be relevant for the properties to prove or they are caused by an excessive imprecision of the analysis. While it is not possible to completely remove them (as the modifications they cause to the control flow must be taken into account), it is possible to smash them, that is, to not have a separate entry for each of their occurrences in the AnalysisState errors. Instead, all occurrences of smashed errors will share a unique ProgramState. The choice over what error types to smash is determined by the shouldSmashError predicate, that returns true for the types of errors to smash. null is a special value corresponding to the predicate t -> false.

Event Listeners

Option Name: synchronousListeners

Option Type: List<EventListener>

Default Value: new LinkedList<>()

Example usage:

conf.synchronousListeners.add(new LoggingListener())

Option Name: asynchronousListeners

Option Type: List<EventListener>

Default Value: new LinkedList<>()

Example usage:

conf.asynchronousListeners.add(new TracingListener())

EventListeners can be registered to process events emitted during the analysis. Listeners can be either synchronous or asynchronous, and are registered through the synchronousListeners and asynchronousListeners options, respectively. Synchronous listeners will be executed in the same thread as the analysis itself, and thus will block the analysis until they complete. Asynchronous listeners will be executed in a separate thread, and thus will not block the analysis. Synchronous listeners are executed before asynchronous ones, and the order of execution of the listeners preserves the insertion order into the respective collection. The lists of listeners should only be added to, and not replaced with other (possibly immutable) lists, as LiSA might add new listeners depending on the values of other options.

Built-in Event Listener implementations

BottomTopListener [Source code]
An event listener that traces bottom and top elements generated during the analysis starting from states that are not bottom or top.

CallResolutionListener [Source code]
An event listener that issues notices on call resolution events.

FlameGraphListener [Source code]
An event listener that traces StartEvents and EndEvents to build a flame graph-style html page.

TracingListener [Source code]
An event listener that traces StartEvents and EndEvents to a trace file, constructing a timeline of how the analysis performed.

Built-in Event implementations

AnalysisAssignEnd [Source code]
An event signaling the end of an assignment of a value to a symbolic expression during the analysis.

AnalysisAssignStart [Source code]
An event signaling the start of an assignment of a value to a symbolic expression during the analysis.

AnalysisAssumeEnd [Source code]
An event signaling the end of an assumption of a symbolic expression during the analysis.

AnalysisAssumeStart [Source code]
An event signaling the start of an assumption of a symbolic expression during the analysis.

AnalysisErrorsToExecEnd [Source code]
An event signaling the end of the transfer of error states to the execution state during the analysis.

AnalysisErrorsToExecStart [Source code]
An event signaling the start of the transfer of error states to the execution state during the analysis.

AnalysisExecToErrorEnd [Source code]
An event signaling the end of the transfer of the execution state to an error state during the analysis.

AnalysisExecToErrorStart [Source code]
An event signaling the start of the transfer of the execution state to an error state during the analysis.

AnalysisExecToHaltEnd [Source code]
An event signaling the end of the transfer of the execution state to the halting state during the analysis.

AnalysisExecToHaltStart [Source code]
An event signaling the start of the transfer of the execution state to the halting state during the analysis.

AnalysisOnCallReturnEnd [Source code]
An event signaling the end of the context transfer from a callee back to the caller during the analysis.

AnalysisOnCallReturnStart [Source code]
An event signaling the start of the context transfer from a callee back to the caller during the analysis.

AnalysisRemoveCaughtEnd [Source code]
An event signaling the end of the removal of caught errors during the analysis.

AnalysisRemoveCaughtStart [Source code]
An event signaling the start of the removal of caught errors during the analysis.

AnalysisSatisfiesEnd [Source code]
An event signaling the end of a satisfiability test of a symbolic expression during the analysis.

AnalysisSatisfiesStart [Source code]
An event signaling the start of a satisfiability test of a symbolic expression during the analysis.

AnalysisSmallStepEnd [Source code]
An event signaling the end of a semantics computation of a symbolic expression during the analysis.

AnalysisSmallStepStart [Source code]
An event signaling the start of a semantics computation of a symbolic expression during the analysis.

AnalysisTransferThrowersEnd [Source code]
An event signaling the end of the transfer of throwers during the analysis.

AnalysisTransferThrowersStart [Source code]
An event signaling the start of the transfer of throwers during the analysis.

DomainAssignEnd [Source code]
An event signaling the end of an assignment of a value to a symbolic expression by a domain taking part in the analysis.

DomainAssignStart [Source code]
An event signaling the start of an assignment of a value to a symbolic expression by a domain taking part in the analysis.

DomainAssumeEnd [Source code]
An event signaling the end of an assumption of a symbolic expression by a domain taking part in the analysis.

DomainAssumeStart [Source code]
An event signaling the start of an assumption of a symbolic expression by a domain taking part in the analysis.

DomainSatisfiesEnd [Source code]
An event signaling the end of a satisfiability test of a symbolic expression by a domain taking part in the analysis.

DomainSatisfiesStart [Source code]
An event signaling the start of a satisfiability test of a symbolic expression by a domain taking part in the analysis.

DomainSmallStepEnd [Source code]
An event signaling the end of a semantics computation of a symbolic expression by a domain taking part in the analysis.

DomainSmallStepStart [Source code]
An event signaling the start of a semantics computation of a symbolic expression by a domain taking part in the analysis.

HeapRewriteEnd [Source code]
An event signaling the end of the rewrite of a symbolic expression by the heap domain.

HeapRewriteStart [Source code]
An event signaling the start of the rewrite of a symbolic expression by the heap domain.

SADSubsEnd [Source code]
An event signaling the end of the substitutions application by a SimpleAbstractDomain.

SADSubsStart [Source code]
An event signaling the start of the substitutions application by a SimpleAbstractDomain.

NRDEvalEnd [Source code]
An event signaling the end of the evaluation of a symbolic expression by a non-relational domain taking part in the analysis.

NRDEvalStart [Source code]
An event signaling the start of the evaluation of a symbolic expression by a non-relational domain taking part in the analysis.

CallResolved [Source code]
An event signaling that a call has been resolved.

CFGFixpointEnd [Source code]
An event signaling the end of the fixpoint computation for a given CFG in the interprocedural analysis.

CFGFixpointStart [Source code]
An event signaling the start of the fixpoint computation for a given CFG in the interprocedural analysis.

CFGFixpointStored [Source code]
An event signaling that the result of a cfg fixpoint has been stored in the results cache. The stored version might be different from the actual result, since some lattice operations might have been applied with some pre-existing result.

ComputedCallResult [Source code]
An event signaling that the analysis computed the given result for a given call.

ComputedCallState [Source code]
An event signaling that the analysis has computed the state for a given call.

FixpointEnd [Source code]
An event signaling the end of program-wide fixpoint of the interprocedural analysis.

FixpointIterationEnd [Source code]
An event signaling the end of a fixpoint iteration during an interprocedural analysis.

FixpointIterationStart [Source code]
An event signaling the start of a fixpoint iteration during an interprocedural analysis.

FixpointStart [Source code]
An event signaling the start of program-wide fixpoint of the interprocedural analysis.

PrecomputedCallResult [Source code]
An event signaling that the analysis found an existing compatible result for a given call.

RecursionEnd [Source code]
An event signaling the end of the recursion solving during an interprocedural analysis.

RecursionStart [Source code]
An event signaling the start of the recursion solving during an interprocedural analysis.

EdgeTraverseEnd [Source code]
An event signaling the end of the traversal for a given Edge during a fixpoint computation.

EdgeTraverseStart [Source code]
An event signaling the start of the traversal for a given Edge during a fixpoint computation.

JoinPerformed [Source code]
An event signaling that the join operation between subsequent abstractions for the same Statement has been performed during a fixpoint computation.

LeqPerformed [Source code]
An event signaling that the comparison operation between subsequent abstractions for the same Statement has been performed during a fixpoint computation.

PreStateComputed [Source code]
An event signaling that the pre-state for a statement has been computed during a fixpoint computation. Depending on the analysis direction, the pre-state might actually be the post-state: we do not distinguish between the two in terms of events for ease of handling.

StatementSemanticsEnd [Source code]
An event signaling the end of the semantics computation for a given Statement during a fixpoint computation.

StatementSemanticsStart [Source code]
An event signaling the start of the semantics computation for a given Statement during a fixpoint computation.

Producing Outputs

Option Name: workdir

Option Type: String

Default Value: Paths.get(".").toAbsolutePath().normalize().toString()

Example usage:

conf.workdir = "/tmp/lisa-analysis"

Option Name: outputs

Option Type: Collection<LiSAOutput>

Default Value: new HashSet<>()

Example usage:

conf.outputs.add(new JsonReport())

All Outputs produced by LiSA are generated inside the working directory specified by the workdir option. By default, the working directory is the directory where the JVM executing LiSA was launched, but it can be customized by setting the workdir option to a different path. To generate a new output, it is sufficient to add it to the collection of outputs specified by the outputs option. Note that the order of generation of the outputs is not guaranteed, and should not be relied upon. The collection of outputs to produce should only be added to, and not replaced with other (possibly immutable) collections, as LiSA might add new outputs depending on on the values of other options.

Built-in Output implementations

DotCallGraph [Source code]
An output that dumps the CallGraph produced by the analysis, if any, in dot format.

DotInputs [Source code]
An output that dumps each input cfg as a dot file, with no information on the analysis results.

DotResults [Source code]
An output that dumps each input cfg as a dot file, including the results produced by the analysis.

HtmlCallGraph [Source code]
An output that dumps the CallGraph produced by the analysis, if any, in html format.

HtmlInputs [Source code]
An output that dumps each input cfg as an html file, optionally including subnodes, with no information on the analysis results.

HtmlResults [Source code]
An output that dumps each input cfg as an html file, optionally includiong subnodes, including the results produced by the analysis.

JSONCallGraph [Source code]
An output that dumps the CallGraph produced by the analysis, if any, in json format.

JSONInputs [Source code]
An output that dumps each input cfg as a json file, with no information on the analysis results.

JSONReportDumper [Source code]
An output that dumps the analysis report in JSON format to "report.json".

JSONResults [Source code]
An output that dumps each input cfg as a json file, including the results produced by the analysis.

Logging

Logging is not configured through the LiSAConfiguration object. LiSA produces all logging through log4j2, and will thus follow the framework’s own configuration. There are a number of ways to configure log4j2, but the simplest one is to create a log4j2.xml file in the working directory, with the desired configuration. For example, the following configuration will log all messages of level DEBUG or higher to a file or the console:

<?xml version="1.0" encoding="UTF-8"?>
<Configuration status="WARN" name="DefaultLoggingConf">
  <Appenders>
    <Console name="console">
      <PatternLayout pattern="%d [%5level] %m %ex%n"/>
    </Console>
  </Appenders>

  <Loggers>
    <Logger name="it.unive.lisa" level="DEBUG" />
    <Logger name="org.reflections" level="WARN" />

    <Root level="DEBUG">
      <AppenderRef ref="console" level="DEBUG"/>
    </Root>
  </Loggers>
</Configuration>

Please check log4j2’s documentation for more details on how to configure logging.

Tip:
If no logging is configured, LiSA will set up a default configuration that logs to the console only.

Default and Test Configuration

LiSA also offers a class named DefaultConfiguration, that provides a default value for interprocedural analysis and call graph. It also offers utility methods for building an abstract domain following the Simple Abstract Domain framework.

A common use case when developing static analyzers is to have end-to-end tests that starts from an input file and the necessary configuration, execute a full analysis as a black box, and compare the results obtained with some pre-existing results. LiSA provides a unique infrastructure for this use case to simplify testing. A TestConfiguration is a LiSAConfiguration extended with the following fields:

testDir defines the relative path to the root folder where test files are located;
testSubDir defines an optional path relative to testDir to use as workdir for the analysis, useful to keep output files separated for similar tests;
programFile holds the name of the source file to analyze, relative to testDir;
forceUpdate specifies that, should any difference be found between the results of the analysis and the pre-existing results, the pre-existing results should be updated with the new results instead of raising an error;
compareWithOptimization specifies that, should no difference be found between the results of the analysis and the pre-existing results, the analysis should be executed again with optimizations enabled, and the results of the optimized analysis should be compared with the pre-existing results as well, to check that optimizations do not change the results of the analysis;
resultComparer holds a reference to the ResultComparer instance to use to compare the results of the analysis with the pre-existing results, that can be customized to ignore irrelevant differences between the results or additional analyzer-specific settings.

A TestConfiguration can be used with an instance of AnalysisTestExecutor, an abstract class defined in LiSA to provide the standard workflow for executing end-to-end tests. The class has a constructor that takes the path to the expected results folder, where the pre-existing results are located, and a path to the actual results folder, where test files for produced by the analysis will be generated. The analysis is started by invoking one of the perform overloads, each accepting a TestConfiguration and optionally an already parsed Program. If the program is not provided, the abstract readProgram method will be invoked to parse the file located at expected-dir/testDir/programFile. Then, the execution proceeds as follows (in the following, if testSubDir is null, testDir/testSubDir should be read as testDir):

the folder actual-dir/testDir/testSubDir is cleared of all files, and it is set as workdir for the analysis;
an instance of JSONReportDumper is added to the outputs to produce;
a LiSA instance is created with the TestConfiguration as configuration, and it is run on the parsed program;
if no expected-dir/testDir/testSubDir/report.json file exists and forceUpdate is not set, the execution terminates;
if no expected-dir/testDir/testSubDir/report.json file exists and forceUpdate is set, the contents of the workdir will be copied to expected-dir/testDir/testSubDir, and the execution terminates;
if expected-dir/testDir/testSubDir/report.json exists and forceUpdate is not set, the resultComparer is used to compare it with actual-dir/testDir/testSubDir/report.json, raising an exception if there are any differences;
if expected-dir/testDir/testSubDir/report.json exists and forceUpdate is set, the resultComparer is used to compare it with actual-dir/testDir/testSubDir/report.json, and all files where at least one difference is found are copied from the workdir to expected-dir/testDir/testSubDir, replacing the pre-existing files;
if compareWithOptimization is not set, or if it is set but the fixpoints used were already optimized, the execution terminates;
if compareWithOptimization is set and the fixpoints used were not already optimized, the whole process is repeated with the same configuration but with optimized fixpoints, and the results of the optimized analysis are compared with the pre-existing results as well, raising an exception if there are any differences (in this run, forceUpdate is ignored).

Alternatively to forceUpdate, the lisa.cron.update system property can be set to true to achieve the same effect.

Frontends

Frontends for several languages have been developed over the years. Recall that frontends are not part of LiSA, but rather they are fully-fledged static analysers that use LiSA as a library to execute the analysis. They can be used as-is, can be extended, or can be used as examples to build new frontends for other languages. For more details on how to build a frontend, please check the frontend documentation.

GoLiSA

GoLiSA is a frontend for a subset of the Go programming language. It has been developed with the objective of performing analyses targeting security properties of blockchain software and smart contracts, focusing on the frameworks Hyperledger Fabric, Cosmos SDK, Ethereum Client, and Tendermint Core. The properties targeted include harmful usage of non-deterministic APIs and constructs, dangerous cross-contract invocations, and read-write inconsistencies.