Generation of Executable Code
from Source Program Files

Source Program Files

"Source" program files contain text (characters) that describe the data elements and the processing required to perform some particular information processing task.

The form that this textual description takes depends upon the specific "language" being used. In general such "languages" can be divided into two major categories: "high-level" languages and "low-level" (or "assembler") languages. In either case, the textual description must be converted, from text (characters), into the binary patterns actually used by the computer's internal circuits to specify and control the activities required. A "compiler" is a program that translates "high-level" source program files into these binary patterns (often called "machine-level" code; a "compiler" translates single "high-level" statements into several (often, many) "machine-level" instructions. An "assembler" is a similar, but much simpler, translating program that translates each "low-level" instruction into a single "machine-level" instruction; each "machine-level" action is represented by a single, simple "low-level" textual code, called a "mnemonic" code. Mnemonic coding provides a reasonable way of controlling the computer at the machine level, although it would be a tedious method for developing very large programs or systems.

Understanding the "assembler" process is necessary as a starting point for understanding the translation process of higher level languages (using "compilers") into instructions the computer can process,

In addition to the use of mnemonic codes to represent machine-level actions, assembler (or mnemonic) coding typically uses symbolic labels to represent addresses (including data addresses) that at the machine-level are identified as (numeric) bit patterns. When a specific memory location is required for use in an instruction, a symbolic reference to the corresponding label is used. A label can only correspond to a single (low-level) memory address, however, there may be many symbolic references to this label.

Translation of "mnemonic code" using symbolic addresses into machine-level instruction requires that the mnemonic (source) code file be read twice (a "two-pass" assembly process):

once to determine the memory address to associate with each symbolic reference (each label), and
a second time to replace symbolic references with the associated addresses (and translate operation codes)

Steps in Assembler to Machine Language Translation

Determining symbolic-actual address pairs - read the code in a "first pass" determining the memory location for each instruction and data item; specifically, the memory addresses of any symbolic labels should be recorded
Translation - read through the code again (the "second pass"); for each instruction, replace any symbolic address with its memory location as determined during the first pass, then translate the mnemonic operation and its address into a machine level instruction (using a table of mnemonic-to-machine codes)

Modularization of Program Development

Even once mnemonic code (or high level language code) has been translated into machine level instructions, there is generally a lot of work that may need to be done before we have a working program.

Most programs are developed in a "modular" fashion and are composed of routines (or "subroutines") that have been assembled or compiled into (separate) independent "object" files. Some of these subroutine object files may have been developed as integral components of the specific program being developed, but others may be "general purpose" routines (perhaps even supplied with the "assembler" or "compiler") that may be stored in composite library file or object subroutines.

All these files need to be combined into one complete program. Addresses will not match where they were assumed to belong by the "assembler" (or "compiler") since there would be no way for these programs to tell the size of other routines that would need to appear in memory locations ahead of any specific subroutine. All such addresses, therefore, will need to be modified to compensate for where the routines have actually been loaded into real memory.

In order to more fully explore this process, we will first need to consider how the "Little Man" Computer could be expanded to permit handling of "subroutines".

Subroutine Management

Support for the use of subroutines requires additions/extensions to the basic LMC structure:

Maintaining a "return" address - A subroutine may be "called" from several different locations and must be able to return to the next instruction after whichever instruction "called" the routine; "jumping" to a routine provides no information as to from where the "jump" originated. Every CALL of a subroutine needs to record a (different) return address somewhere. The LMC doesn't provide the usual "stack" on which such information can be pushed.
Call and Return operations - The "Call" operation must somehow save the address of the instruction that physically occurs immediately after the "Call"; a special "Return" instruction is normally employed to retrieve this "saved address" and set it up as the next instruction address. The "return address" may be saved in:
- a register (the calculator's "accumulator" in the LMC)
- a fixed memory location
- a memory location at a fixed "offset" from the "called" subroutine
- a memory location that is part of a hardware maintained "stack"
Argument-Parameter passing - Passing argument values to a subroutine and retrieving them in the subroutine as parameters involves some of the same memory location problems as maintaining the subroutine "return address". Where do you put the arguments before calling a subroutine? Where do you put the return value(s) before returning from a subroutine? Generally, the same methods are available, but the actual method used depends less upon the hardware level of support and more upon the use of a "protocol" agreed to by both the "calling" and the "called" routines.

The original "Little Man" Computer posed some significant restrictions in implementing an "extension" that would permit management of subroutines. Specifically:

Only one decimal digit (0) was not used by the original "Little Man" as a machine-level "operation code". This meant that, while it was possible to add an new instruction to "call" a subroutine, there were no numeric code values left to add a "return" instruction.
Establishing a method for saving the "return address" (the location where control should resume after completion of the subroutine) is difficult. The "Little Man" Computer supplies only a single "working" register, the Accumulator, and no easy way to use the contents of this register as an address. Furthermore, if the Accumulator were used to hold the "return address" when a call were made to a subroutine, this would eliminate the possibility of using the Accumulator to pass values to or return values from the subroutine.

LMC Subroutine Implementation Method:

The method selected for the extended LMC (called the "sonOfLMC") is to use the unused numeric code 0xy as the CALL xy instruction in a method very similar to the use of 9xy as the JMP xy instruction.

The difference between the CALL xy and JMP xy is that the CALL xy first stores a "return address" trick in the memory location that immediately precedes the location being CALL'ed, i.e. at memory location xy - 1. That "trick" being stored is a JMP instruction that returns to the location physically after the CALL. In other words, the JMP being stored uses the address currently in the Counter when the CALL is being executed.

For CALL xy to work, each subroutine must be coded with a mailbox that is left unused, located immediately before the instruction that is being CALL'd in the subroutine. This unused mailbox will receive the JMP trick put there by the CALL instruction.

When a subroutine has completed its task and wishes to return to the "calling" routine, it must JMP to this trick location (that is to the mailbox containing the JMP put there by the CALL that entered the subroutine). The JMP to the JMP will cause control to return to the statement physically following the original CALL.

Parameter passing and value return for subroutines

The method for passing parameter values to a subroutine is left unspecified by the structure of this extended version of the LMC. Parameter passing is a protocol to be decided upon between the two routines involved, but, in general, information sharing (passing argument values to the subroutine and returning result values to the calling routine) is normally achieved through the use of "global" labelled memory locations.

The main routine will typically copy values into global variables in the subroutine before calling it, and the subroutine will store return values in global variables before returning.

This "normal" method assumes that, if separately assembled (or compiled modules (routines) are used, there will be some method for identifying shared, labelled memory locations.

Separate Module Assembly (to Object Files)

As has been already noted, modules (or "subroutines") are generally assembled (or compiled) from "source" files into separate "object" files. This results in three major problems:

Lost Public Labels - As a source file is translated into machine code by the assembler program, all the label address information is lost. Machine code has no label information. Object files must therefore keep a separate table of Public Labels for use by the linker in identifying important address locations inside the module. Not all labels in the source code may be public. Local variable labels do not need to be preserved in the Public Label table. The label that is the start of a subroutine must be put in the Public Label table so that other programs can call the subroutine.
Calls to External Routines - Calls to routines not included with the source can not have their actual memory addresses filled in to the machine code, since those addresses are not known at the time the source translation is done. The object file must keep a list of these "unresolved external references" for later resolution by the linker program when the linker knows where in memory these external routines have been placed. The object file must include the name of the called external routine and the location within the calling routine where the external routine is called, so that the linker knows what memory locations to fix when the actual memory addresses are known.
Relocatable Address References - Instructions or data that were translated to machine code based on the assumption that the routine would be loaded starting at a specific address, usually zero, need to be modified if the routine is loaded at a different address. Not all machine code needs to be modified, only instructions or data that refer to addresses inside the routine need to be adjusted. The object file must contain a table of all locations containing addresses that must be relocated to reflect the actual load address used for the routine.

Object File Format

As a result of the above three problems, object files typically contain three tables to accompany the machine code. This means an object file contains four parts: machine code plus three tables:

a Public Label table containing a list of all Public Labels defined within the module, and their corresponding mailbox (memory) addresses. "Public" labels are labels that should be made available to other modules. All other labels are private, local labels that are not visible to other modules.
an External Reference table containing a list of all mailbox addresses that make references to public labels defined in other modules, plus the name of the actual label being referenced.
a Relocation Address table containing a list of all the addresses containing instructions that include address references to addresses within the module (based on the false assumption that the module would be loaded starting at address 00). All relocatable memory address references will need to be adjusted to reflect the true memory address values once the true starting address of the module has been established by the linker.

All the addresses in the above three tables are themselves subject to relocation if the subroutine is loaded at a non-zero address.

LMC Relocation and Linking Example

Suppose we want a simple program that outputs pairs of numbers: the first number of each pair output will be a count value starting at 000 and stopping before reaching a MAX of 010; the second number of each pair will be double the value of the first number. In order to allow the user time to read the numbers output, the program will pause after each output for a certain amount of time (performing some time wasting loop). (Note that the program stops before reaching the MAX value; the MAX value is not output.)

We will code this as 3 separate source files: a MAIN program and two subroutines Pause() and Dble(). Here is the pseudocode for the three modules:

        // MAIN program
        int count = 1
        int max = 10
        do {
           output count
           call Pause()
           call Dble(count)
           count += 1
        } while ( count < max );
        stop

        // PAUSE subroutine
        subroutine Pause() {
           int count = 0
           do {
              count -= 1
           } while ( count != 0 );
        }

        // DBLE subroutine
        subroutine Dble(arg) {
           output arg+arg  
           call Pause()
        }

Here are the three modules, translated in to LMC assembly language and then assembled into three object files:

Mnemonic Code

Object File - machine code and tables

; main program
Repeat  LDA Count  ; output count
        OUT
        CALL Pause
        LDA  Count
        STO  Arg   ; store subroutine argument
        CALL Dble
        LDA  Count ; count += 1
        ADD  One
        STO  Count
        SUB  Max   ; while ( count < max )
        SKP
        JMP  Repeat 
        HLT
Count   DAT  000
One     DAT  001
Max     DAT  010
Pause   EXT     ; Pause is an External
Arg     EXT     ; Arg is an External
Dble    EXT     ; Dble is an External

Public Label	Location
(none, or possibly "MAIN")

External Reference	Location
Pause	02
Arg	04
Dble	05

Relocatable Address Locations
00, 03, 06, 07, 08, 09, 11

Mnemonic Code

Object File - machine code and tables

; Pause subroutine
        DAT        ; CALL return address
Pause   LDA  Count ; start of subroutine
        SUB  One
        STO  Count
        SKZ        ; while ( count != 0 )
        JMP  Pause
        JMP  Pause-1
Count   DAT  000
One     DAT  001
        PUB  Pause ; Pause is a Public Label

Public Label	Location
Pause	01

External Reference	Location
(none)

Relocatable Address Locations
01, 02, 03, 05, 06

Mnemonic Code

Object File - machine code and tables

; Dble subroutine
Arg     DAT       ; input argument to Dble
        DAT       ; CALL return address
Dble    LDA  Arg  ; start of subroutine
        ADD  Arg  ; double the input value
        OUT
        CALL Pause
        JMP  Dble-1  
        PUB  Arg  ; Arg  is a Public Label
        PUB  Dble ; Dble is a Public Label
Pause   EXT       ; Pause is an External

Public Label	Location
Arg	00
Dble	02

External Reference	Location
Pause	05

Relocatable Address Locations
02, 03, 06

As is always the case with separately created modules, the main program and each subroutine have been coded as if none of the other code existed:

Each of the three modules starts at location zero. They can't all be loaded into memory at location zero; they have to be loaded into memory one after the other, so some of the modules (and their three tables) will have to be "relocated" by the linker to work at the new memory location.
Not all labels in a module should be made public. Some labels are only for local variables or local loops. Labels that the programmer wants to be made visible to other modules have to be made Public with the PUB keyword.
Both the main program and the Pause subroutine use memory locations with the labels "Count" and "One". Because these labels are kept private (local) to each module (not made Public), there is no confusion or conflict. These are different memory locations even though the names are the same; the programmers just happen to have used the same name in each piece of code. Since the label names are local to each piece of code and are never made Public, the linker ignores them. It would cause a linker error to have two identical public labels with different memory address locations.
There is no way for a separately created piece of code to know where in memory an External function or subroutine will be loaded. The keyword EXT is used to tell the assembler that this address is External and that it is up to the linker to resolve it when all the modules are loaded into memory.

If we collect all the Public Labels from all modules, we have Public Labels for Pause, Arg, and Dble. If we collect all the External References from all modules, we have External References for Pause, Arg, and Dble. Every External Reference can be matched with a Public Label, which means all the modules are present.

Link-Edit Processing or "Linking"

Combining multiple object files together to form a single executable program (often stored as a file) is the job of a program called a link-editor, or "Linker". The link process can be viewed as a sequence of five steps:

Copy the numeric codes from all object files into sequentially contiguous locations, keeping track of the address used for each routine's new memory start address. Typically the first object file is loaded starting at address zero and the other object files go at higher memory locations.
The tables in each object file were built assuming the object file was loaded at location zero. Since we have just loaded many subroutines at non-zero start addresses, we have to fix the addresses in the tables for each routine to reflect the new location of the subroutine in memory. Adjust the memory location specifications in the Public Label, External Reference, and Relocatable Address Location tables in each object file by adding the new memory load address of the subroutine to each location specification in each table.
Having loaded all the subroutines into memory, we now know exactly where in memory each public symbol resides and we can go back and fix up the external reference information in all the loaded routines. Adjust all external reference memory locations by adding the actual public address of the externally referenced public symbol to the address portion of instruction at that location.
The code in each object file was built assuming that the object file was loaded at location zero. Since we have just loaded many subroutines at non-zero start addresses, we have to fix each relocatable instruction in each subroutine to account for the move to a new location. Using the relocatable address locations table in each object file, adjust all relocatable address locations in each subroutine by adding the new memory address of the corresponding routine to the address operand at each relocatable location.
The program is now "linked" and ready to execute as a unit. Save the modified numeric code to an executable file.

We will expand each of the above five steps by running through this process with the previously generated object files:

The Linker copies the numeric instruction codes from each of the object files into contiguous (mailbox) memory locations. The order that the modules are placed in memory is not usually under the control of the programmer - the linker chooses. We choose to load Main followed by Pause followed by Dble (with a small line showing the boundary between each):

The main program loaded at zero; Pause loaded starting at memory address 16; Dble loaded starting at memory address 25. The linker builds a table indicating the actual location where each subroutine was loaded in memory:

Routine	Assembled to load at:	Actually Loaded At:
main routine	00	00
Pause subroutine	00	16
Dble subroutine	00	25

The linker will use these new memory locations to "relocate" addresses inside each subroutine, below.

We have put subroutines into memory at non-zero starting addresses, and we now have to fix the addresses in the object file tables to reflect the new memory location of each subroutine. We have three tables to fix in each of the object files: the Public Label table, the External Reference table, and the Relocatable Address Locations table.

The linker examines all the Public Label tables in all the object files and ensures that every public label is unique; a public/global label cannot refer to two different memory locations. In our example, the linker records three unique public labels - Pause, Dble, and Arg - with no conflicts. (The other labels that were in the original source files were not made public and so are not visible to the linker.)

Since the Public Label tables were built assuming that every object file loaded at location zero, the linker has to fix the addresses in the public tables to account for the new memory locations of each subroutine. The linker adds the new load address of the corresponding subroutine to the address of each public label in that subroutine's object file:

Public Label	Modified Location (based on load address)
Pause (in the Pause object file loaded at 16)	01 17 (=01+16)
Arg (in the Dble object file loaded at 25)	00 25 (=00+25)
Dble (in the Dble object file loaded at 25)	02 27 (=02+25)

This modified Public Label table now contains the correct memory location of every public label. The linker will use these locations to satisfy references to external symbols, below.

The linker examines all the external reference tables from all the object files and ensures that every external reference has a match in some Public Label table. If a match isn't found, then some object file is referring to some label for which there is no corresponding public label and the program cannot be linked. (Perhaps someone forgot to include a subroutine in the link process?)

In our example, the linker sees references to three external labels - Pause, Dble, and Arg - and all three have entries in some Public Label table.

Since the external reference tables were built assuming that every object file loaded at location zero, the linker has to fix the addresses in the tables to account for the actual memory locations of each subroutine. The linker adds the load address of the corresponding subroutine to the address of each external reference in that subroutine's object file:
- The external reference table from the main program contains three entries, but none of these entries need adjustment because the main routine was loaded at address zero, and adding zero to each of the addresses doesn't change anything.
- The external reference table from the Pause object file is empty. If there were any references, we would add the new memory location where Pause was loaded (memory location 16) to every entry in the table.
- We have loaded the Dble subroutine at memory location 25, which means that all the locations in the Dble external reference table must be modified by the addition of the Dble subroutine's load address (25) before they can be used. The external reference table from the Dble object file says that Dble CALLs the Pause subroutine - an external reference - at location 05. The linker adds 25 to that value in the External Reference table:
  
  External Reference (Dble) Modified Location (+25)
  
  Pause 05 30 (=05+25)

External Reference (Dble)	Modified Location (+25)
Pause	05 30 (=05+25)

The linker examines all the relocatable address locations tables in all the object files. Since the relocatable address locations tables were built assuming that every object file loaded at location zero, the linker has to fix the tables to account for the actual memory locations of each subroutine. The linker adds the load address of the corresponding subroutine to the address of each relocatable location in that subroutine's object file:

The main routine was loaded at location zero, so even though it has a relocatable address locations table, the linker would simply add zero to every address, resulting in no change to the table.

For the Pause subroutine (loaded starting at 16) Modified Relocatable Address Locations (+16)
01 17, 02 18, 03 19, 05 21, 06 22

For the Dble subroutine (loaded starting at 25) Modified Relocatable Address Locations (+25)
02 27, 03 28, 06 31

Now, the relocatable address locations have been adjusted to compensate for the actual load address of the subroutine.

Having loaded all the subroutines into memory, we now know exactly where in memory each public symbol resides and we can go back and fix up the unresolved external reference information in all the loaded routines:

The linker consults the external reference table from the main object file:

External Reference (main)	Location
Pause	02
Arg	04
Dble	05

Using the above table and the modified Public Label table, the instructions in (mailbox) addresses 02, 04, and 05 are changed by adding to the current contents the value of the actual public location addresses from the (modified) Public Label table:

CALL Pause
STO  Arg
CALL Dble

02: 000 017 (17 is Pause in the Public Label table)
04: 200 225 (25 is Arg in the Public Label table)
05: 000 027 (27 is Dble in the Public Label table)

Since the Pause object file contains no external references, no modifications are required inside Pause to fill in the addresses for unresolved external references.
The (modified) external reference table from the Dble object file is used to identify which locations within the Dble subroutine refer to external symbols whose location weren't known when Dble was assembled into object code. The (modified) external reference table for Dble tells the linker which memory locations inside Dble need to be fixed to refer to which external symbols:

External Reference (Dble) Modified Location (+25)

Pause 05 30 (=05+25)

The (modified) external reference table says that location 30 refers to Pause, so the linker looks up the real location of Pause in the (modified) Public Label table (17) and adds that value to Dble memory location 30:
CALL Pause
30: 000 017 (17 is Pause in the Public Label table)

External Reference (Dble)	Modified Location (+25)
Pause	05 30 (=05+25)

All external references have now been updated to refer to their correct memory locations.

Finally, the instructions at the locations identified by the (modified) relocatable address locations table inside each object file must be adjusted by the actual location where that specific subroutine was loaded. The contents of memory at the addresses specified in the (modified) relocation address locations table must be modified to compensate for the actual load address of the subroutine:

Since the main program's object file was assembled to load at location zero and was actually loaded at location zero, no address modifications are required for main. (Adding zero to every relocatable address doesn't change it.)

The Pause subroutine was coded to load at location zero but was actually loaded at location 16. The linker adds 16 to every relocatable address in the modified relocatable address locations table for Pause:

For the range of mailboxes of the Pause subroutine
Relocatable address locations: 17, 18, 19, 21, 22
Original values from the Pause object file	Relocated instructions based on module load at 16
01: 107 02: 408 03: 207 05: 901 06: 900	17: 123 (=07+16) 18: 424 (=08+16) 19: 223 (=07+16) 21: 917 (=01+16) 22: 916 (=00+16)

The Dble subroutine was coded to load at location zero but was actually loaded at location 25. The linker adds 25 to every relocatable address in the modified relocatable address locations table for Dble:

For the range of mailboxes of the Dble subroutine
Relocatable address locations: 27, 28, 31
Original values from the Dble object file	Relocated instructions based on module load at 25
02: 100 03: 300 06: 901	27: 100 125 (=00+25) 28: 300 325 (=00+25) 31: 901 926 (=01+25)

This completes the relocation of all the instructions in all the subroutines.

The linker has resolved all external references and relocated all relocatable code. At this point all address operands should be correct and the combined block of code properly executable. The code is output to an executable file, ready for loading and execution by the system.

Link Process Summary

Compiled and/or assembled routines in object file format are combined to form an executable program by a program called a link-editor or "Linker".
Three Object File Reference Tables - To link properly, each object file must contain three tables:
1. a table of Public Labels
2. a table of External References
3. a table of Relocatable Address Locations
The Load Process and Load Module Reference Table - Starting with a "main line" routine, the Linker copies every routine's machine code into memory, one module after the other. The routine's name and actual address where it is loaded in memory are maintained in a linker Load Module Reference table. All the addresses in the three tables in the object files are relocated to take into account the new module load addresses.
Resolving External References - The Linker goes through the object file external reference and Public Label tables and matches all external references to public labels. Unmatched/unresolved external references generate link errors.
Relocating all the Code - The Linker uses the relocatable address location tables to relocate all the code for subroutines loaded in non-zero memory locations.
Executable Memory Image or Executable File - Once the Linker has completed "linking" all the object files into a single, coherent executable "image", there are basically two options: the linked "image" can be executed immediately in memory or, as is more common, the image can be written out to disk as an "executable" file, to be loaded and run by the Operating System at a later time.

Load and Execute Operating System Functions

Assuming linked images are saved in files as "executable" programs, the Operating System must perform certain functions in order for the program to actually "be run".
Loading Executable File - the first (obvious) requirement is that the Operating System must perform instructions that copy the executable file into memory.
Further Address Relocation - unless the Operating System is designed so that programs are guaranteed to always be loaded into memory at some fixed address, the Operating System will need to establish a "base" register for automatic address relocation (this requires additional hardware support) or it must go through the entire program and relocate all address references (a second time) to enable the program to run at the new memory address where it is being loaded.
Giving Control to the Loaded Program - as a final act, the Operating System must reset the Program Instruction Counter to point to the first instruction of the program just loaded. At this point the Operating System effectively hands partial control of the computer to the program being executed.

Generation of Executable Codefrom Source Program Files

Source Program Files

Steps in Assembler to Machine Language Translation

Modularization of Program Development

Subroutine Management

LMC Subroutine Implementation Method:

Parameter passing and value return for subroutines

Separate Module Assembly (to Object Files)

Object File Format

LMC Relocation and Linking Example

Link-Edit Processing or "Linking"

Link Process Summary

Load and Execute Operating System Functions

Generation of Executable Code
from Source Program Files