



Greetings! This is Worldwide Developer Conference week, and DTS is busy working at the debugging lab. Thanks to a great manager, I am able to stay home half time, check out my new kid, and at the same time write sample code and another article for FrameWorks.
In this column, I'm taking a closer look at segmentation: MacApp segmentation strategies, virtual memory possibilities, and other issues related to segmentation. Some parts will be elementary for Veterans of the MacApp Psychic Wars, but I hope the science fiction references in some of the headings will keep such readers awake. Onward!
The Segmentation Makers sidebar provides guidelines on organizing your methods into segments.
A locked handle is also unpurgeable, so you don't need to worry about purging once you have locked the object in memory. MacApp's global function DBUnloadSeg makes handles, or CODE segments, unlocked-which makes the resource available for purging as well.
Methods are the actual routines that are stored in the CODE resources; data is stored either on the stack, in the heap or in the A5-world, depending. In many cases, calling a method whose segment is not currently stored in memory causes a segment load to occur that might have to move heap blocks in order to locate a place to put the new segment. This is one reason why calling a new method can suddenly make dereferencing bugs pop up.
It's UnloadAllSegments that displays "I really don't think that you want to unload a segment into which you are going to return!" in the debug window. This happens when MacApp has determined that you are about to unload a segment containing a method that you will need to return to later. (In other words, your stack currently references a method contained in the segment that is about to be unloaded.)
If you do a "find references to UnloadAllSegments" in MacBrowse, you'll find that this function is called from many places; from the initialization phase of the application, and here and there from within various loops.
You can do additional resource manipulation from the debugger with the Toggle, or 'X' flag. Inside this level, type 'S' so that each time a segment load occurs, MacApp will break into the debugger and print the routine name that triggered the segment load.
The 'U' flag turns off the automatic segment unloading done by the UnloadAllSegments routine. This is handy for finding out if your program's crashes have been due to mysterious jumps into unexpected routines: if a pointer to another method was suddenly being made invalid as that method's segment was unloaded, the stage is set for a healthy crash. Your code may stop crashing when you use the 'U' flag to turn automatic segment unloading off; if so, that's a good hint to look for problems of this kind.
The 'R' flag checks to see if the total size of the currently loaded resource exceeds the maximum. You can also set the maximum to a new value.
Link [options...] objectfile… ≥ progress.file -sn=oldSegName1=newSegName1 -sn=oldSegName2=newSegName2
This is useful when using MPW 3.2 with MacApp 2.0 or 2.0.1 to remap segments back to the Main segment. The story is, some of the standard functions in libraries in MPW 3.2 have been split from the Main segment. This causes serious heap fragmentation in your MacApp application-for example, when you try to call SetHandleSize(). To avoid this, make the following modifications in the Basic Definitions file:
SegmentMappings = ð SegmentMappings = ð #-- insert here -sn PASLIB=Main ð -sn STDCLIB=Main ð #-- end of insertion
This causes the errant routines in the Pascal and Standard C libraries to be remapped back into the Main segment. Also, change the lines in the MacApp.r file as shown in the MacApp.r changes sidebar.
Another solution is to use the linker to mark code resources from the libraries that were once in main as locked. These segments will then be loaded into memory and placed with the main segment, avoiding fragmentation problems. To do this, modify the user variable OtherLinkOptions in the Basic Definitions file:
OtherLinkOptions = ð -ra PASLIB=resLocked ð -ra STDCLIB=resLocked
You can also use this technique of locking code resources into memory in your MAMake files (OtherLinkOptions=)-but be careful with these experiments. Finally, you can use the linker to merge old segments into new segments with the -sg option:
-sg newSeg[=old[,old]…] # merge old segments into newSeg
The MPW Lib tool also contains options for changing segment names and merging segments into a segment, which is useful for cases where you only have access to the object code library.
The res! resource defines those segments that are always resident in the heap (segments are made permanently resident via a global function called SetResidentSegment). Note that even if you define a segment in the res! resource, because it's a handle it will still float around in memory.
One use for making segments permanently resident is for time-critical functions that are grouped together in a special segment; thus, loading the segment doesn't require overhead if the method is suddenly needed. For example, this could be used to reduce overhead for time-critical communication methods. Here's an example of a res! resource defined in the resource file:
resource 'res!' (kMyMacApp, purgeable) {
{ "AWriteConn";
"AReadConn";
"APoll";
#if qInspector && !qDebug
"GDebugConn";
#endif
#if qPerform
"GPerformanceComms";
#endif
};
};
With System 7, the Macintosh operating system now has virtual memory. However, there is still need for the programmer to specify segments in the code.
When the page fault occurs, the memory manager (with the help of the MMU) first frees up physical memory so it can load the needed page by selecting unused page frames and writing them to the backing store. Then it reads the page data for the needed frame. Thus, pages that aren't needed are usually residing on the hard disk. This event, usually called page-fault handling, requires special hardware in most cases.
If the VM does not find such a frame, it looks for a page that has been modified but not replaced recently. If this doesn't work, the VM tries to find the first frame that doesn't contain a page held in physical memory.
This algorithm is simple and fast. It doesn't need to know about application states, and it's space efficient. For more on this algorithm, read the article in the November 1989 issue of Byte, "Mac VM Revealed" by Phil Goldman.
The best solution is a combination of both virtual memory and segmentation. VM allows the user to run larger programs than would otherwise be possible, and if the developer organizes segments intelligently, excess paging is avoided.
There is still a need for some smart segmentation analysis tool which could produce segmentation directives by analyzing each function in order to figure out how to produce a segment organization such that methods are grouped together for maximum efficiency. VACUUM JUMP TABLES There is a known relation between jump table sizes and segmentation. For normal procedures and functions, a jump table entry is not needed if all calls to the routine are from the same segment. But if there are calls to other segments from the routine, jump table entries are needed. Examine the segmentation of your code; you might find places where a change in segmentation would eliminate jump table entries. The linkmap output (using the MABuild -LinkMap option) shows what each segment contains. With some effort you may shrink big jump tables and improve the performance of your whole application.
Some people worry that many Get and Set methods will increase the jump table entries considerably, but you can avoid this by using clever segmentation strategies or by using C++'s inline functions. Anyway, if your classes are infested with millions of Get and Set methods, perhaps it is time to examine the object. Is it really a structure in disguise?
Caching of results inside the class decreases the need for Get and Set calls. Plus, the major parts of an object can be placed inside one single segment for another performance improvement.You can use dumpobj to dump the object file and find information about each segment.
The Segment Loader has to fill the jump table with the right addresses when the segments are loaded in. When the segment is unloaded, the jump table has to be reset with information about the missing segment. MacApp has to make sure that memory is always available for data and unloaded segments. All this takes time, so clever segmentation does improve performance. For example, if important functions are in the same segment, you eliminate other segment loading events, and when MacApp calls UnloadAllSegments, a place is created for the next suite of segments needed.
The 68020 introduced 32-bit PC-relative branching (BSR.L statements), but that didn't help the Classic and other 68000-based Macintosh computers. Instead, MPW 3.2 makes use of branch islands. This simple, elegant concept is based on the implementation of PC-relative code-to-code references. The linker splits a large code segment up into smaller 32k areas by inserting branch islands. These branch islands serve as intermediate points that are within range of PC-relative jumps, thus making it possible to make a call across a segment that would otherwise result in a larger-than-32k jump.
Another new feature is "32-bit everything," which transparently removes the infamous limitations on code segment sizes, jump table sizes and the size of the global data areas. The drawback is a larger code size footprint and some slowdown due to increased load time for the larger code segments. But hey, look what you get!
32-bit everything is activated by using -model far options while compiling and linking. The Release Notes for MPW 3.2 will explain the implementation completely; basically, the trick is that the compilers generate instructions with 32-bit addresses (instead of the normal 16-bit offsets), and that these 32-bit addresses are relocated at load time by the segment load address or by the contents of A5, as appropriate.
Finally, one can generate larger than 32K jump tables using the -wrap option. This uses unused space in the global data area for additional jump table entries when it starts to get crowded inside the 32K segment. Programmers doing large MacApp programs will love this! However, at best this utility doubles the jump table size, and if your global data area is already filled with data, you're out of luck.
If you want to use these new 32-bit everything features from MPW 3.2 with MacApp, you'll need a couple of new MacApp library files. These are available on ETO #3, as well as most of the 32-bit everything support. ETO#4 will contain the final MPW 3.2 with tools and libraries to support these new features.



