Learning Linux Binary Analysis Pdf Download

Executable binary code is the authoritative source of information about program content and behavior. The compile, link, and optimize steps can cause a program's detailed execution behavior to differ substantially from its source code. Binary code analysis is used to provide information about a program's content and structure, and is therefore a foundation of many applications, including binary modification[3,12,22,31], binary translation[5,29], binary matching[30], performance profiling[13,16,18], debugging, extraction of parameters for performance modeling, computer security[7,8] and forensics[23,26]. Ideally, binary analysis should produce information about the content of the program's code (instructions, basic blocks, functions, and modules), structure (control and data flow), and data structures (global and stack variables). The quality and availability of this information affects applications that rely on binary analysis.

Discover the world's research

20+ million members
135+ million publications
700k+ research projects

Join for free

Practical Analysis of Stripped Binary Code

Laune C. Harris and Barton P. Miller

Computer Sciences Department

University of Wisconsin

1210 W. Dayton St.

Madison, WI 53706-1685 USA

{lharris,bart}@cs.wisc.edu

1. Introduction

Executable binary code is the authoritative source of

information about program content and behavior. The

compile, link, and optimize steps can cause a program's

detailed execution behavior to differ substantially from its

source code. Binary code analysis is used to provide infor-

mation about a program's content and structure, and is

therefore a foundation of many applications, including

binary modiﬁcation[3,12,22,31], binary translation[5,29],

binary matching[30], performance proﬁling[13,16,18],

debugging, extraction of parameters for performance mod-

eling, computer security[7,8] and forensics[23,26]. Ide-

ally, binary analysis should produce information about the

content of the program's code (instructions, basic blocks,

functions, and modules), structure (control and data ﬂow),

and data structures (global and stack variables). The qual-

ity and availability of this information affects applications

that rely on binary analysis.

This paper addresses the problem of using static binary

analysis to extract structural information from stripped

binary code. Stripped binaries are executables that lack

information about the locations, sizes, and layout of func-

tions and objects. This information is typically contained

in compiler generated symbol tables. Analysts, however,

are often confronted with stripped binaries. Commercial

software is usually stripped to deter reverse engineering

and unlicensed use. Malicious code is stripped to resist

analysis. System libraries and utilities are often stripped to

reduce disk space requirements. Occasionally, even avail-

able symbol tables need to be ignored because they con-

tain incorrect or misleading information.

This structural information consists of both inter- and

intra-procedural control ﬂow in addition to the start

addresses of functions, function ranges, basic blocks, entry

and exit points. Our approach to extracting structural

information builds on control ﬂow extraction and function

identiﬁcation techniques that have been employed in pre-

vious work. In particular our work builds on techniques

used by LEEL[31] and RAD[22], which both use breadth

ﬁrst static call graph and control ﬂow graph traversal to

discover and classify code. Both RAD and LEEL begin at

a program's entry point and traverse the static call graph to

ﬁnd function entry points. RAD additionally uses pattern

matching against standard function preambles to discover

functions in sections of the code space that are not reach-

able by static call instructions in previously seen code. For

functions without static references or recognizable pream-

bles, RAD uses an optimistic identiﬁcation strategy.

This paper makes the following contributions:

• Augments existing binary parsing methods with an

extended function model that more realistically

describes the structure of modern code and additional

assurance checks to boost conﬁdence in the analysis.

• Presents tests and evaluation results on a large set of

real programs.

• Reports our experiences dealing with peculiarities of

production code.

One of the ﬁrst tasks in designing a binary parser that

identiﬁes functions is deﬁning an appropriate function

model. Previous function models represent code that com-

plies with traditional code conventions; for example, func-

tions must have single entry points and contiguous code

ranges. Our code analyzer uses a multiple entry control

ﬂow graph model that treats all code connected by intra-

procedural control ﬂow as part of the same function. This

model was chosen for its ability to accurately represent

unconventional structures that are becoming increasingly

common in modern code. Our model can also, without

alteration, represent code both before and after modiﬁca-

tion such as instrumentation.

Our analysis techniques were implemented in Dyninst

[3], a run-time binary modiﬁcation tool used in a wide

variety of research and commercial environments. Since

Dyninst is multi-platform (operates on multiple operating

systems and architectures) we created a generic frame-

work for stripped code analysis. Our framework has a

modular design with replaceable and interchangeable

components making it independent of the operating sys-

tem, ﬁle format, and machine architecture.

The evaluation of our implementation is split into three

categories:

• Comparison of our parsing of stripped binaries with

the information provided by the compiler's symbol

table.

• Comparison of our parsing of stripped binaries with

other tools.

• How well we can instrument a stripped binary based

on our parse results.

Our comparisons against source code reveal that, in

general, the functions that we recover correspond well to

the source. However, in some cases compiler optimiza-

tions, such as dead code removal and inlining, can cause

signiﬁcant differences between the binary analysis results

and the source.

Compiler generated symbol tables provide a (some-

times reasonable) approximate account of the number of

functions in a binary and their locations. Initial results

from automated assessments on roughly 500 test programs

indicate that we recover, on average, better than 95% of

the functions in stripped executables.

We compared our analysis to IDAPro [10], a popular

commercial disassembler. Our parser usually recovers

more function information than IDAPro, and in several

cases, substantially more.

Instrumentation tests use our analysis results to insert

entry and exit instrumentation into the functions in our test

programs. The assessment rules were simple: abnormal

termination indicates failure and normal termination indi-

cates success. We were able to successfully instrument

better than 87% of the stripped test programs and better

than 99% of the unstripped ones. Note that most of the

failures are due to difﬁculties in instrumenting the code

rather than limitations of the analysis.

Our test experiences, along with feedback from Dyn-

inst users, gave us insights into the technical issues

involved in implementing robust stripped code analysis to

meet the needs of a general purpose binary editor. We

report some of these insights.

2. Related Work

This paper, and the related work we discuss focus on

reconstruction of structural information using static binary

analysis. There are several categories of tools that perform

static binary analysis to obtain structural information

about binary code including disassemblers, binary rewrit-

ers, decompilers and binary translators. Since the tools in

each category tend to have similar requirements or employ

similar techniques, we partition our discussion of related

work according to application category.

While static analysis is commonly used, some tools use

an alternative method called dynamic code analysis where

structural information such inter-procedural control ﬂow

and call graph is determined at run time. Dynamo[1],

DynamoRio[2], DIOTA[15], Self-Propelled Instrumenta-

tion [18], and PIN[24] are program optimization tools that

use dynamic code analysis. Other tools, IDTrace [21], for

example, use both static and dynamic analysis. The

dynamic phase is used to correct errors resulting from

static analysis and to analyze code not detectable stati-

cally.

Disassemblers

Disassemblers convert binary code to symbolic assem-

bly language representations. A primary function of disas-

semblers is to visually present these assembly language

representations of binary code to human analysts. To aid

the analyst by organizing information, some disassemblers

present control ﬂow information or attempt to partition

programs into functions. IDA-Pro [10], arguably the best

commercial disassembly tool, does both. IDA-Pro uses a

depth ﬁrst call graph traversal to determine function start

addresses and intra-procedural control ﬂow analysis to

determine function ranges. IDA-Pro, however, does not

provide support for non contiguous functions and does not

identify functions that are only reached via indirect calls

(both constructs are common in production code). An

another approach used by some tools is to partition a

binary into functions by recognizing function prologues

[20]. This method has the advantage of being simple and

fast. However, it often creates a coarse grained or inaccu-

rate function structure if functions have non-standard pro-

logues or if there is data interspersed in the code.

Producing correct disassembly is often challenging due

variable length instruction sets, data mixed with code,

indirect control ﬂow, and deliberate code obfuscation.

Scharwz et al combined linear sweep disassembly and

recursive traversal to create a hybrid disassembly algo-

rithm[27]. Their work is based on the idea that if both lin-

ear sweep and recursive traversal disassembly agree then a

disassembly is likely to be correct. Orso, et. al developed a

disassembly method for use on obfuscated code [11] in

response to a static code obfuscator developed by Linn and

Debray [14]. Their method uses CFG checks to verify dis-

assembly correctness; e.g., branches to the middle of an

instruction indicate an error. These disassembly checks

used by both methods form the core of a set of safety

checks we use for our speculative function discovery.

Binary Modification

Binary modiﬁers are tools that rewrite code either stati-

cally or at run time. Major uses of binary modiﬁcation

include post compilation optimization, instrumentation,

and proﬁling. Most binary modiﬁcation tools rely on the

presence of symbol table and debugging information.

Some tools, however, are able to reconstruct some struc-

tural information and therefore operate on stripped binary

code. EEL operates on SPARC binaries[12]. The two main

disadvantages of the EEL approach are that it is not well

suited to variable length instruction set architectures and

that it does not identify functions that are reachable only

through an indirect control transfer. The ﬁrst of these dis-

advantages is addressed by LEEL [31], an EEL-based

binary editing tool designed for Linux on Intel's IA32

architecture. Beginning at the program's entry point,

LEEL walks the static call ﬂow graph to build the set of

call targets that correspond to function starting points.

Recursive disassembly at these points ﬁnds the code

blocks in each function and establishes the function's size.

For gaps in the code space that are caused by the presence

of functions only reachable through indirect control ﬂow

or by data bytes, LEEL provides two options. The ﬁrst

option is conservative: do not analyze. The second option

is more aggressive: assume that the ﬁrst byte of a gap is

the starting address of a function. These two options

present undesirable extremes. The ﬁrst, while safer for

binary modiﬁcation, provides low code coverage in bina-

ries where a high proportion of functions are reached only

though indirect calls and in binaries where indirect calls

form bridges to large sections of the call graph. The

approach taken by RAD [22] is similar to LEEL's. RAD is

a binary editor that defends against stack based buffer

overﬂow attacks by instrumenting function entry and exit

points. RAD achieves better code coverage than LEEL's

conservative option by matching byte sequences in gaps to

known function prologues. RAD's approach will not

locate functions that have no static references and have no

known prologues. Both RAD and LEEL use a conven-

tional view of a function: i.e., a self-contained sequence of

code with a single entry point, at least one exit point, and

laid out in a contiguous area of memory. This view, while

generally applicable, is not adequate for all applications.

In binary code, functions sometimes have multiple entry

points and are spread across non-contiguous areas of the

executable. Our work is designed to extend the approaches

used by LEEL and RAD to improve code coverage and to

properly handle unconventional function structures.

LEEL and RAD are able to operate on stand-alone exe-

cutable code: i.e. no debugging information is needed.

Some other tools rely on the presence of symbol tables,

relocation tables, and other debugging information to per-

form similar analyses (code discovery, CFG creation).

Examples include Etch [25] and PLTO[28], which all

require relocation information.

3 Design

The compilation, optimization and linking steps often

create binaries that are structurally different from the cor-

responding source code. Therefore one of the ﬁrst tasks in

designing a binary parser that identiﬁes functions is deﬁn-

ing an appropriate function model. Section 3.1 presents

the model that we use. The remaining parts of this section

describe our implementation.

3.1. Function Models

The function model determines the degree to which the

analysis output reﬂects the structure of the binary. For a

our implementation, we chose a multiple-entry control

ﬂow graph model. Control ﬂow graph models treat func-

tions as sets of basic blocks that form a control ﬂow

graphs. CFG models are able to represent functions with

non contiguous code; basic blocks can be located any-

where in a program's address space. Figure 1 shows three

control ﬂow models: single entry, shared code, and multi-

ple entry. The single entry model is the simplest of the

three CFG models presented above. Each function in this

model has a has one entry block which dominates all other

blocks. The shared code model is a variant of the single

entry model that allows basic blocks to belong to multiple

functions. The multiple entry model treats all code con-

nected by intra-procedural control ﬂow as part of the same

function.

Alternative models include the Prologue/Prologue and

Prologue/Exit models where functions are the regions

between known prologues or exits, and the Symbol Table

model which deﬁnes a function as a named start address

and size pair. These models were rejected since they do

not naturally describe binary level functions that have non

contiguous code ranges, multiple entry points, or shared

code blocks. In addition, the prologue models are vulnera-

ble to the presence of data in the code space and to unrec-

ognizable or missing prologues.

3.2. Implementation

Our parser was designed to be modular, with replace-

able and interchangeable components. These components

fall into three categories: ﬁle format reader, instruction

decoder, and code parser. In this section, we describe the

components along with our generic assembly language

interface that ties them together.

The Dyninst system also includes components for

dynamic code generator and process control, but these are

outside the scope of this paper.

File format reader

Our ﬁle format readers extract information from ﬁle

headers that describe the layout of the executable and con-

tain the addresses and sizes of interesting points and sec-

tions including the locations of text, data, symbol tables

(a) Single entry (b) Shared code (c) Multiple entry

Figure 1: Control Flow Models

and the program's entry point. In addition, the ﬁle format

readers use heuristics determine the address of the func-

tion main . Most programs, after executing initial start-up

code, transfer control to a function commonly called main.

It is often difﬁcult, however, to use static control ﬂow anal-

ysis to locate main . Locating main using heuristics, while

not strictly necessary, gives two beneﬁts. First, labeling the

function main is useful for tools that expect to ﬁnd it. Sec-

ondly, the program's call graph is rooted at main . Finding

this root early tends to reduce the parser's reliance on

speculative function discovery techniques.

We provide format readers for ELF [6], COFF [19],

XCOFF [32], and PE [19].

Instruction decoder

The instruction decoder plays two roles. One is a disas-

semblers; it interprets byte streams as machine instruc-

tions. The other role is to extract semantic information

from the instructions and implement the architecture

dependent routines that are accessed through the generic

assembly language interface.

We currently provide stripped code parsers for the Intel

IA32, AMD 64, and IBM Power architectures.

Generic Assembly Language interface

The generic assembly language interface exports a uni-

form assembly language to the parser independent of the

underlying architecture. The AAL's object model allows

description of generic operations. Each instruction is

treated as an object that exports the following methods and

operators: isCondBranch , isUncondBranch , isIndirBranch,

isCall,isReturn,getBranchTarget,getMultiBranchTargets,

isPrologue,isIllegal,--,++, and size. All parsing operations

that reference machine instructions use this interface.

Parser

Control ﬂow discovery and function identiﬁcation is

done by the parser which accesses the program's instruc-

tion stream via the generic assembly language interface.

The generic assembly language interface enables us to use

a single parsing algorithm for multiple platforms. In the

best case, correct symbol tables and debugging informa-

tion will be available and we use symbol start addresses as

analysis points. In the worst case, however, the program's

entry point (obtained by the ﬁle format reader) guarantees

a reliable starting point for analysis.

This initial set of starting addresses is used as input for

our parsing algorithm. Breadth-ﬁrst call graph traversal

and recursive disassembly ﬁnds all code statically reach-

able from the reachable from the initial set of addresses.

Valid call targets addresses are named as functions then

disassembled to build control ﬂow graphs. Additional call

targets discovered during CFG creation are added to the

list of call targets for analysis. CFG conﬂicts and illegal

instructions cause rejection of a function. When the algo-

rithm terminates, there may be gaps that are not analyzed

remaining in the text space due to the presence of data,

alignment padding, or functions that are never statically

referenced in previously visited code. We use two specula-

tive gap completion phases to analyze these gaps.

The ﬁrst phase searches each gap for known function

prologues. At each potential code address, the parser cre-

ates an instruction and calls isPrologue . An architecture-

speciﬁc method in the decoder checks for instruction

sequences commonly used to implement prologues on

each platform. For example, a common prologue sequence

on IA32 is:

push ebp

mov esp, ebp

If isPrologue returns true, the corresponding address is

assumed to be the start of a function, and we invoke the

core algorithm to build the sub section of the program's

call graph rooted at this function.

On completion of the ﬁrst speculative discovery phase

completion phase, gaps may still be present in the text

space. These gaps may be due to functions that are called

indirectly and have no known prologue. We disassemble

these gaps using Orso et al's speculative completion

method. Their method is to build control ﬂow graphs start-

ing at each potential code address in a gap. CFG conﬂicts

prune the set of control ﬂow graphs. For example, if

branches in a CFG target the apparent middle of another

instruction then that CFG is rejected. In addition, we add

the requirement that during this phase CFGs must contain

at least one non-terminal control transfer instruction. This

requirement reduces the number of false positives and

boosts conﬁdence in the functions discovered.

Two issues need to be addressed to effectively follow

function local control transfers on production code. The

ﬁrst is the analysis of indirect branches; i.e branches that

have targets determined at run time. Indirect branches are

commonly used to implement switch statements. Cifu-

entes and Van Emmeriks describe a machine and compiler

independent method of using program slicing and expres-

sion substitution to statically recover the targets of indirect

branches that use jump tables[4]. In practice, however,

simple machine and compiler dependent methods have

proved effective at recovering jump table values. Our

approach is to backtrack along the control paths leading to

the indirect branch to discover the instructions that set up

the jump table access. These instructions give the base

location and number of entries in a jump table.

The second issue is the identiﬁcation of exit points. An

exit point that is identiﬁed as an intraprocedural transfer

will cause overestimation of function sizes. In addition to

return instruction detection, the instruction decoder imple-

ments platform speciﬁc tail-call detection. Also known

non-returning call instructions (for example calls to exit or

abort) are considered to be exit points. If a transfer is not

identiﬁed as an exit, it is assumed to be intra-procedural.

Our implementation was tested with the following

compilers: Microsoft's Visual C++, GNU's gcc, IBM's

xlc, and Intel's icc.

4. Evaluation

Proper evaluation of a stripped binary analysis tool is a

challenging process. Human inspection and veriﬁcation is

useful for small sets of trivial test programs. For larger sets

of programs or larger, more sophisticated programs, how-

ever, human inspection is inappropriate. For our evaluation

we used automated comparisons against compiler gener-

ated symbols tables and automated instrumentation tests.

We also compared our function recovery rate to that of

IDAPro, a popular commercial disassembly tool.

Our evaluations were done primarily on Linux and

Microsoft Windows. For evaluations requiring large num-

bers of binaries, we used programs obtained from our

department's standard bin directory (a very large collec-

tion of programs).

Comparison vs. symbol tables

We ﬁltered the output from objdump to display unique

function addresses for each of 519 programs. We then

compared the number of functions we recovered to that

reported by objdump. Our results on the test binaries show

a 95.6% average recovery rate.

Comparison vs. IDAPro

Table 1 shows a comparison of our function recover

rate against IDAPro. The results show that in general our

stripped code analysis identiﬁes more functions than

IDAPro. For the ﬁrefox binary, inspection suggests that

IDAPro is incorrectly identifying thousands of two

instruction sequences as functions. (they appear to be part

of dispatch tables).

Instrumentation tests

Using Dyninst we inserted function entry and exit point

instrumentation into 519 test binaries. Success means the

instrumented binary runs to completion without failure or

hanging. Results from these tests are report in Table 2.

5. Experiences

In this section, we describe our experiences with real

but unusual code patterns.

Pseudo call instructions

Not all call instructions are valid interprocedural trans-

fers. Calls that target the instruction immediately follow-

ing the call are often used by position independent code to

obtain the program counter. The targets of these instruc-

tions must not be used as function entries.

The targets of some call instructions are ﬁlled in a load

time. This means that the target value obtained by static

analysis of the binary is incorrect. In some cases this value

is obvious (target address 0, for example). In others the

target address is a junk value. We detect the ﬁrst case when

we encounter calls, and rely on CFG and disassembly

checks to guard against the second case.

Exception handling code: unreachable blocks

C++ exceptions creates code blocks that appear to be

unreachable. Currently, our most reliable solution to this

problem is to extract exception information from compiler

generated exception tables.

Missing "main"

Binaries without an explicit main function illustrate the

need for speculative function discovery. Analysis of the

kwrite binary (part of the KDE tools) revealed that the exe-

cutable contained no main function. main was located in a

dynamically loaded library and accessed through the Pro-

cedure Linkage Table. None of the functions in the

stripped kwrite binary were statically reachable from the

program's entry point.

Stripped Unstripped

Passed 454 87.5% 517 99.6%

Failed 65 12.5% 2 0.4%

Table 2: Success Rate of Instrumentation Tests

Binary Platform

IDA Dyninst

Number Percent Number

aim window 75 100.0% 75

alara linux 431 65.9% 654

bash linux 1,539 92.0% 1,655

bubba linux 53 96.4% 55

calc windows 168 99.4% 169

eon linux 616 53.2% 1,157

ﬁrefox* windows 34,064 116.0% 29,372

gimp linux 3,889 91.8% 4,237

kwrite linux 7 77.8% 9

notepad windows 84 98.8% 85

paradyn linux 4,506 35.7% 12,617

SecureCRT windows 4,139 97.8% 4,233

vcross linux 52 62.7% 83

X linux 3,991 92.7% 4,307

Table 1: Functions found using Dyninst and IDAPro

"Percent" measures the percentage of functions found by

IDAPro compared to the total found by Dyninst.

False positives

Speculative code discovery analyses data bytes. Data

bytes that are interpreted as terminal control ﬂow instruc-

tions often give the appearance of single-basic-block func-

tions. For example, data bytes might be interpreted as the

following instruction sequence:

mov reg, mem

ret

To eliminate the occurrence of these false positives we

require that functions discovered during the speculative

discovery phase have two or more control ﬂow instruc-

tions. This restriction, while improving reliability, reduces

coverage by excluding legitimate functions that ﬁt this pat-

tern. We are currently evaluating alternatives to this strat-

egy. One potential direction is to use simple semantic

analysis to determine whether a single block function dis-

covered during speculative discovery should be accepted

or rejected.

References

[1] V. Bala, E. Duesterwald and S. Banerjia, "Dynamo: A

Transparent Dynamic Optimization System", ACM SIGPLAN

'00 Conference on Programming Language Design and

Implementation (PLDI), June 2000.

[2] Bruening, D., Garnett, T., Amarasinghe, S. "An Infrastructure for

Adaptive Dynamic Optimization". First Annual International

Symposium on Code Generation and Optimization, March 2003.

[3] B. Buck and J.K. Hollingsworth, "An API for Runtime Code

Patching", International Journal of High Performance

Computing Applications 14 , 4, pp. 317-329, Winter 2000.

[4] C. Cifuentes and M. Van Emmerik, "Recovery of Jump Case

Statements from Binary Code", 7th International Workshop on

Program Comprehension, Washington, DC, May 1999.

[5] C. Cifuentes, M. Van Emmerik, and N. Ramsey, "The Design of

a Resourceable and Retargetable Binary Translator", Sixth

Working Conference on Reverse Engineering, Atlanta, October

1999.

[6] Executable and linking format,

http://www.skyfree.org/linux/references/ELF_Format.pdf

[7] J.T. Giffin, S. Jha, and B.P. Miller, "Detecting Manipulated

Remote Call Streams", 11th USENIX Security Symposium , San

Francisco, California, August 2002

[8] J.T. Giffin, S. Jha, and B.P. Miller, "Efficient Context-Sensitive

Intrusion Detection", 11th Network and Distributed System

Security Symposium, San Diego, California, February 2004

[9] HI-PVM, http://www.parasys.co.uk/

[10] IDAPro, http://www.datarescue.com/idabase/overview.htm.

[11] C. Kruegel, W. Robertson, F. Valeur, and G. Vigna, "Static

Disassembly of Obfuscated Binaries", 13th USENIX Security

Symposium, August 2004

[12] J.R. Larus and E. Schnarr, "Eel: Machine-independent executable

editing", SIGPLAN '95 Conference on Programming Language

Design and Implementation (PLDI), June 1995.

[13] J.R. Larus and T. Bal, "Rewriting Executable Files to Measure

Program Behavior", Software-Practice and Experience 4 , 2,

February 1994.

[14] C. Linn and S. Debray, "Obfuscation of executable code to

improve resistance to static disassembly", 10th ACM Conference

on Computer Communications and Security (CCS), October

2003.

[15] J. Maebe, M. Ronsse, K. De Bosschere, "DIOTA: Dynamic

Instrumentation, Optimization and Transformation of

Applications", WBT-2002: Workshop on Binary Translation,

Charlottesville, Virginia, September 2002.

[16] B.P. Miller, M.D. Callaghan, J.M. Cargille, J.K. Hollingsworth,

R.B. Irvin, K.L. Karavanic, K. Kunchithapadam and T. Newhall,

"The Paradyn Parallel Performance Measurement Tool", IEEE

Computer 28 , 11, November 1995, pp. 37-46.

[17] A.V. Mirgorodskiy and B.P. Miller, "CrossWalk: A Tool for

Performance Profiling Across the User-Kernel Boundary",

International Conference on Parallel Computing (ParCo),

Dresden, Germany, September 2003.

[18] A.V. Mirgorodskiy and B.P. Miller, "Autonomous Analysis of

Interactive Systems with Self-Propelled Instrumentation",

Multimedia Computing and Networking Conference, San Jose,

California, January 2005.

[19] Microsoft Portable Executable and Common Object File Format

Specification,

http://www.microsoft.com/whdc/system/platform/firmware/PE

COFF.mspx.

[20] A. Orso, M.J. Harrold, and G. Vigna, "MASSA: Mobile Agents

Security through Static/Dynamic Analysis", First ICSE

Workshop on Software Engineering and Mobility (WSEM 2001),

Toronto, Canada, April 20.

[21] J. Pierce, J. and T. Mudge, "IDtrace - A Tracing Tool for i486

Simulation", University of Michigan Tech. Report CSE-TR-203-

94. 1994.

[22] M. Prasad and T. Chiueh, "A Binary Rewriting Defense Against

Stack-based Buffer Overflow Attacks", USENIX Annual

Technical Conference, June 2003.

[23] Project Fenris: http://lcamtuf.coredump.cx/fenris/whatis.shtml

[24] C. Luk, R. Cohn, R. Muth, H. Patil, A. Klauser, G. Lowney, S.

Wallace, V. J. Reddi, K. Hazelwood, "Pin: Building Customized

Program Analysis Tools with Dynamic Instrumentation",

Programming Language Design and Implementation (PLDI),

Chicago, Illinois, June 2005.

[25] T. Romer, G. Voelker, D. Lee, A. Wolman, W. Wong, H. Levy,

B. Chen, and B. Bershad. Instrumentation and Optimization of

Win32/Intel Executables Using Etch. USENIX Windows NT

Workshop, August 1997.

[26] K. Rozinov, "Reverse Code Engineering: An In-Depth Analysis

of the Bagle Virus", Bell Labs - Government Communication

Laboratory - Internet Research, August 2004.

[27] B. Schwarz, S. K. Debray, and G. R. Andrews, "Disassembly of

executable code revisited", IEEE Ninth Working Conference on

Reverse Engineering, Richmond, October 2002.

[28] B. Schwarz, S. Debray, and G. Andrews. "PLTO: A Link-Time

Optimizer for the Intel IA-32 Architecture". 2001 Workshop on

Binary Translation (WBT-2001)", Barcelona, Spain, Sept. 2001.

[29] R. L. Sites, A. Chernoff, M. B. Kirk, M. P. Marks, and S.G.

Robinson, "Binary Translation", Digital Tech. Journal 4 , 4, 1992

[30] Z. Wang, K. Pierce, S. McFarling, "BMAT- A Binary Matching

Tool", Feedback-Directed Optimization (FDO2), Haifa, Israel,

November 1999.

[31] L. Xun, "A linux executable editing library", Masters

Dissertation, National University of Singapore, 1999.

http://www.geocities.com/fasterlu/leel.htm

[32] XCOFF File Format,

http://www.unet.univie.ac.at/aix/files/aixfiles/XCOFF.htm

... We also note that our scheme cannot be applied to stripped binary codes, whose function entry points have been removed. These stripped binary codes could be commercial software for hindering reverse engineering and unlicensed use or malicious programs for resisting analysis [57], [58]. Because function names are removed from the stripped binary codes, different types of features must be extracted for birthmark generation, e.g., runtime values from the outcome of machine instructions [59] or runtime call stacks. ...

Chih-Ko Chung
Pi-Chung Wang

Identifying the credibility of executable files is critical for the security of an operating system. Modern operating systems rely on code signing for executable files to identify their publishers. Because code signing uses a default-valid trust model, a malware could pass software validation of operating systems and security software by using counterfeit code-signing certificates. Although the counterfeit certificates can be revoked by CAs, the previous research showed that the revocation delay takes as long as 5.6 months. In this paper, we attempt to identify the credibility of software with multiple-version executable files without relying on public key infrastructure (PKI). Because a new-version executable file is usually developed incrementally based on the previous versions, the sharing features among different versions can be extracted for identifying the software. Accordingly, we present a software-birthmark scheme to serve our purpose. Our scheme generates a cross-version software birthmark for executable files of the same software. The proposed software birthmark is a binary-classification model of a machine learning algorithm based on imported and exported function names extracted from different-version executable files. To evaluate the performance of version-wide software birthmarks, our experiments include 138 versions of Windows kernel32.dll and 545 versions of firefox.exe. We also use multiple machine learning algorithms for performance comparisons. The results show that proposed software birthmark can effectively identify the derivations of these executable files. The proposed software birthmark can be used by operating systems or security software to evaluate the credibility of executable files with suspicious certificates.

... Analysis of stripped binaries: The analysis of stripped binaries, particularly function block identification, has been the subject of widespread study. Control flow analysis has been used in [65][66][67][68][69] to determine functions in PE, ELF, COFF and XCOFF binaries, and a QEMU+LLVM approach for function boundary identification was presented in [70]. These approaches may not be suited to ARM IoT analysis due to errors introduced by inline data and compiler-introduced constructs such as Thumb switch-case conditions. ...

Pallavi Sivakumaran
Jorge Blasco

Recent high-profile attacks on the Internet of Things (IoT) have brought to the forefront the vulnerability of "smart" devices, and have resulted in numerous IoT-focused security analyses. Many of the attacks had weak device configuration as the root cause. One potential source of rich and definitive information about the configuration of an IoT device is the device's firmware. However, firmware analysis is complex and automated firmware analyses have thus far been confined to devices with more traditional operating systems such as Linux or VxWorks. Most IoT peripherals, due to lacking traditional operating systems and implementing a wide variety of communication technologies, have only been the subject of smaller-scale analyses. Peripheral firmware analysis is further complicated by the fact that such firmware files are predominantly available as stripped binaries, without the ELF headers and symbol tables that would simplify reverse engineering. In this paper, we present argXtract, an open-source automated static analysis tool, which extracts security-relevant configuration information from stripped IoT peripheral firmware. Specifically, we focus on binaries that target the ARM Cortex-M architecture, due to its growing popularity among IoT peripherals. argXtract overcomes the challenges associated with stripped Cortex-M analysis and is able to retrieve arguments to security-relevant supervisor and function calls, enabling automated bulk analysis of firmware files. We demonstrate this via three real-world case studies. The largest case study covers a dataset of 243 Bluetooth Low Energy binaries targeting Nordic Semiconductor chipsets, while the other two focus on Nordic ANT and STMicroelectronics BlueNRG binaries. The results reveal widespread lack of security and privacy controls in IoT, such as minimal or no protection for data, fixed passkeys and trackable device addresses.

... Recovering function boundaries from stripped binaries is one of the most challenging disassembly tasks [4], [7], [18], [26], [32], [49], [50], [54]. Functions are source-level constructs but decay to simple control-flow transfer at the machine code level, and symbol tables containing function information are removed. ...

... Recovering function boundaries from stripped binaries is one of the most challenging disassembly tasks [4], [7], [18], [26], [32], [48]- [50], [52], [55]. Functions are source-level constructs but decay to simple control-flow transfer at the machine code level, and symbol tables containing function information are removed. ...

Kexin Pei
Jonas Guan
David Williams King
Suman Jana

Accurate and robust disassembly of stripped binaries is challenging. The root of the difficulty is that high-level structures, such as instruction and function boundaries, are absent in stripped binaries and must be recovered based on incomplete information. Current disassembly approaches rely on heuristics or simple pattern matching to approximate the recovery, but these methods are often inaccurate and brittle, especially across different compiler optimizations. We present XDA, a transfer-learning-based disassembly framework that learns different contextual dependencies present in machine code and transfers this knowledge for accurate and robust disassembly. We design a self-supervised learning task motivated by masked Language Modeling to learn interactions among byte sequences in binaries. The outputs from this task are byte embeddings that encode sophisticated contextual dependencies between input binaries' byte tokens, which can then be finetuned for downstream disassembly tasks. We evaluate XDA's performance on two disassembly tasks, recovering function boundaries and assembly instructions, on a collection of 3,121 binaries taken from SPEC CPU2017, SPEC CPU2006, and the BAP corpus. The binaries are compiled by GCC, ICC, and MSVC on x86/x64 Windows and Linux platforms over 4 optimization levels. XDA achieves 99.0% and 99.7% F1 score at recovering function boundaries and instructions, respectively, surpassing the previous state-of-the-art on both tasks. It also maintains speed on par with the fastest ML-based approach and is up to 38x faster than hand-written disassemblers like IDA Pro.

Alexander V. Mirgorodskiy
Barton P. Miller

Finding the causes of intermittent bugs and performance problems in modern systems is a challenging task. Conventional profilers focus on improving aggregate performance metrics in an application and disregard many problems that are highly visible to users but are deemed statistically insignificant. Finding intermittent bugs is also hard -- breakpoint debuggers change the timing of events, often masking the problem. To address these limitations, we propose a novel approach called self-propelled instrumentation -- using an autonomous agent to perform self-directed exploration of the system. We inject the agent into a running application, and the agent starts propagating through the code, carried by the application's flow of control. As it propagates, it inserts instrumentation dynamically to collect and analyze detailed execution information. The key feature of this approach lies in its ability to meet three requirements: high level of detail, low overhead, and autonomy (here, little reliance on human help). Existing techniques for tracing and profiling violate at least one of the requirements. As a proof of concept, we implemented a tool called spTracer that uses self-propelled instrumentation to obtain function-level traces from applications and the kernel. The tool enabled us to locate and fix three problems in unfamiliar code in the Linux environment.

In this paper, we describe DIOTA, a novel method for instrumenting binaries. The technique correctly deals with programs that contain traditionally hard to instrument features such as data in code and code in data. The technique does not re- quire reverse engineering, program understanding tools or heuris- tics about the compiler or linker used. The basic idea is that in- strumented code is generated on the fly, while the original process is used for data accesses. DIOTA comes with a number of use- ful backends to check programs for faulty memory accesses, data races, deadlocks, ... and perform basic tracing operations, e.g. tracing all memory accesses, all code being executed, to perform coverage analysis, ... DIOTA has been implemented for the IA32 architecture running the Linux operating system.

Binary translation, the automatic translation of executable programs from one machine to another, requires analyses and transformations that could be used in a wide variety of tools intended to reverse engineer binary codes.Our approach to binary translation, which is designed to allow both source and target machines to be changed at low cost, is based on a combination of machine descriptions, binary-interface descriptions, and machine-independent analyses. This approach is producing components and component generators that should be usable in a variety of tools for reverse engineering binary codes.This paper presents an overview of the full design and gives excerpts from descriptions used in component generators, and presents preliminary results of four static translators instantiated from the UQBT framework described in this paper.

Benjamin Schwarz
Saumya K. Debray
Gregory R. Andrews
Matthew Legendre

This paper describes P LTO, a link-time instrumentation and opti- mization tool we have developed for the Intel IA-32 architecture. A number of characteristics of this architecture complicate the task of link-time optimization. These include a large number of op-codes and addressing modes, which increases the complexity of program analysis; variable-length instructions, which complicates disassem- bly of machine code; a paucity of available registers, which limits the extent of some optimizations; and a reliance on using mem- ory locations for holding values and for parameter passing, which complicates program analysis and optimization. We describe how PLTO addresses these problems and the resulting performance im- provements it is able to achieve.

Alexander V. Mirgorodskiy
Barton P. Miller

fy the ultimate cause of Squid's performance problems and remove them by modifying the application's source code. 1. Introduction Many applications make heavy use of functions provided by the operating system. Naturally, the performance of such applications depends on how they make use of these functions and how e#ciently these functions are implemented by the operating system. For example, I/O is key to the e#ciency of many high-performance applications. Network performance is often the constraining factor for such tools as Web and proxy servers, and e#cient use of synchronization primitives is crucial for multithreaded applications. Finding performance problems in OS-bound applications has always been a challenging task. A user-level profiler might locate a region of application's code where most of the system time is spent, but it might be unable to explain why this is happening or how to fix the problem. For example, if an application spent 90% of its time in the open system cal

James R. Larus
Eric Schnarr

EEL (Executable Editing Library) is a library for building tools to analyze and modify an executable (compiled) program. The systems and languages communities have built many tools for error detection, fault isolation, architecture translation, performance measurement, simulation, and optimization using this approach of modifying executables. Currently, however, tools of this sort are difficult and time-consuming to write and are usually closely tied to a particular machine and operating system. EEL supports a machine- and system-independent editing model that enables tool builders to modify an executable without being aware of the details of the underlying architecture or operating system or being concerned with the consequences of deleting instructions or adding foreign code.

hen Digital started to design the Alpha AXP architecture in the fall of 1988, the Alpha AXP team was concerned with running existing VAX TM code and MIPS TM code on the new Alpha AXP computers [5, 6]. To get full performance on a new computer architecture, an application must be ported by rebuilding, using native compilers. For a single program written in a standard programming language, this is a matter of recompile and run. A complex software application can be built, however, from hundreds of source pieces using dozens of tools. A native port of such an application is only possible when all parts of the build path are running on the new architecture. Therefore, having a way to run an existing (old architecture) binary version of a complex application on a new architecture is an important interim measure. It allows a user to get applications up and running immediately, with minimal porting effort. Once a user's everyday environment is established, applications can be rebuilt over time, using native code or partially native and partially old code. Several techniques are used in the industry to run the binary code of an old architecture on a new architecture. Figure 1 shows four common techniques, from slowest to fastest: • Software interpreter (e.g., Insignia

Posted by: terinaameye0198787.blogspot.com

Source: https://www.researchgate.net/publication/220244700_Practical_analysis_of_stripped_binary_code