Last Updated: 2025-03-10 Mon 14:38

CMSC216 Project 2: Bit Ops, Debugging, Data Structures

CODE/TEST DISTRIBUTION: p2-code.zip

VIDEO OVERVIEW: https://youtu.be/KpBDGYXgFSE

CHANGELOG:

Mon Mar 10 02:37:31 PM EDT 2025
As reported in Post 411, clarified that scale_from_ports() should return 0 in the "normal" case that the parameter scale fields are set.
Fri Mar 7 04:48:40 PM EST 2025

Post 394 reported a typo in the Project spec for Problem 3's Hashset: when an already-present element is added to the hashset, the response message in main() should be

  HS> add Jerry     # items are unique, no changes made on detecting duplicates
  Data already present, no changes made
  HS> add Beth
  Data already present, no changes made

Previously the example indicate 'Elem' rather than 'Data' was expected.

That post also reports missing documentation for hashset_save() which should print an error message like

  ERROR: could not open file '<FILENAME>'

if the call to fopen() the file to save fails.

Documentation in the project specification / code outline has been adjusted to clarify the intended behavior in both situations.

Wed Mar 5 10:00:08 AM EST 2025
An overview video of Project 2 has been posted here: https://youtu.be/KpBDGYXgFSE. The video surveys each of the problems in the project and demonstrates how to get code set up to be compatible with the test cases. Students wanting support to get a good start should consider following the video now or viewing it in the future as they start the different problems in the project.

1 Introduction

This project addresses more advanced C programming topics each in its own problem.

  1. Bit-level operations are common in C and systems programming. This assignment features a problem in which shifting and bitwise AND/OR-ing are required to complete the requirements.
  2. Debugging is also a critical skill enabled by the debugger. The second problem in the assignment makes use of the GNU Debugger, gdb, to work through a puzzle program requiring specific inputs to pass its "phases".
  3. Data structures pervade computing and getting some practice with them in C will improve one's skill at pointers and memory usage while also giving a great appreciation for garbage collected languages. A basic "set" application which tracks a collection of unique values is implemented, backed by a hash table.

Difficulty Notice

Past students have found this content to be more challenging than Project 1.

If you were pressed for time to finish Project 1, start this project as early as possible. Most students have found the first two problems (bit manipulations and debugging) only mildly challenging but run out of time on the larger third problem.

You have been warned.

2 Download Code and Setup

Download the code pack linked at the top of the page. Unzip this which will create a project folder. Create new files in this folder. Ultimately you will re-zip this folder to submit it.

File State Notes
scale_update.c CREATE Create this file and write required function in it to complete Problem 1
scale.h Provided Problem 1 header file
scale_main.c Provided Problem 1 main() function for scale hardware simulation
scale_sim.c Provided Problem 1 suppor functions to assist with simulating scale device
     
test_scale_update.c Testing Problem 1 function tests for scale_upate.c
test_scale.org Testing Problem 1 testing data file
testy Testing Problem 1 test running script
     
scale_examples.sh Provided Problem 1 script to produce a variety of scale examples
puzzlebox.c Provided Problem 2 Debugging problem
input.txt EDIT Problem 2 Input for puzzlebox, fill this in
hashset.h Provided Problem 3 header file
hashset_funcs.c CREATE Problem 3 functions to write
hashset_main.c CREATE Problem 3 main function to write
     
data/hashset_demo.script Data Problem 3 sample input scripts to main program
data/alphabet.script Data  
data/rm.hashset Data Problem 3 sample hash set save files
data/alphabet.hashset Data  
data/alphabet_expand.hashset Data  
     
test_hashmap.org Testing Problem 3 tests
Makefile Provided Build file to compile all programs
testy Testing Test running script

Makefile

As in the first assignment, a Makefile is provided as part of this project. The essential targets pertinent to each problem are described in those sections. The Makefile is equipped with a brief help listing:

>> make help
Typical usage is:
  > make                          # build all programs
  > make clean                    # remove all compiled items
  > make zip                      # create a zip file for submission
  > make prob1                    # built targets associated with problem 1
  > make test                     # run all tests
  > make test-prob2               # run test for problem 2
  > make test-prob2 testnum=5     # run problem 2 test #5 only
  > make update                   # download and install any updates to project files

Automated Tests

As in previous assignments, automated tests are provided and associated with problems. Each problem describes how to run tests associated with it. Generally this is done as before:

>> make test-prob1
gcc -Wall -Wno-comment -Werror -g  -c scale_main.c
gcc -Wall -Wno-comment -Werror -g  -c scale_update.c
...
./testy -o md test_scale.org 
===========================================================================
== test_scale.org : Problem 1 test_scale_update and scale_main tests
== Running 40 / 40 tests
 1) scale_from_ports_01      : ok
 2) scale_from_ports_02      : ok
 3) scale_from_ports_03      : ok
...

>> make test-prob2         # run "tests" associated with problem 2
... 
>> ./puzzlebox input.txt   # same as above: run puzzlebox to test answers

3 Problem 1: Digital Scale Simulation

3.1 Overview

You are tasked with writing code which will be run by a microcontroller in a digital scale. The hardware has the following relevant features.

  • An internal sensor detects weight that is on the scale. The value reported by the device is stored in a special hardware location that is presented as a global variable to C programs.
  • A digital display controlled by a hardware port. Setting a certain global variable will change the display to show information to a user of the device.
  • User code that you will need to write to update the display based on the value reported by the internal sensors
  • A simulator program with Makefile to test your code

Each feature is discussed in subsequent sections.

3.2 Scale Features and Tare

The scale has the following features.

  • Displays in either Ounces (OZ) or Pounds (LB); there are 16 ounces per pound
  • Weight sensor detects changes of up 0.1 ounces
  • Internal sensor that works for ranges of 0 to 99.9 ounces (and equivalent pounds)
  • Has a "Tare" feature described below

Tare Feature

Many tabletop digital scales have a handy "TARE" functionality (Arabic for "that which is removed"). It is used to measure the weight of something in a container conveniently as follows.

  1. The empty container is placed on the scale
  2. The "TARE" button is pressed which stores the weight of the container in scale memory
  3. The container may be removed and filled with the actual material to be weighed; during that time, the scale will read a Negative weight
  4. The filled container is placed on the scale again and the weight of only the material is shown (internally the previous container weight is subtracted from the measured weight and displayed)

The diagram below shows steps 1, 2, and 4.

tare-feature.png

Figure 1: Example of using the Tare feature of the scale to weigh the contents of a container. The empty bowl is first placed on the scale and the "Tare" button is pressed before weighing the filled bowl.

3.3 Scale Hardware Ports

The Scale has hardware that is exposed through several global variables in C programs. These are as follows.

Variable / Port Purpose
short SCALE_SENSOR_PORT The weight currently on the scale in tenths of ounces
short SCALE_TARE_PORT A previously saved sensor value, usually subtracted to account for a "container" weight
unsigned char SCALE_STAUS_PORT Indicates whether to display ounces or pounds, whether buttons are pressed
int SCALE_DISPLAY_PORT Each bit controls part of the LCD display allowing weights to be shown

Notes on each of the ports are included in the header file and are shown below. You do not need to define any of these variables as they already exist in the simulator / hardware. Some variables are read-only while others can be read and written. The objective is to write several functions which extract information from these variables and creates a display of the device state for a user. Details of how to go about this are described in subsequent functions.

extern short SCALE_SENSOR_PORT;
// Set by the hardware to indicate the load placed on the scale. The
// sensor is in units of 0.1 oz so that a sensor value of 238 would be
// 23.8 ounces. If negative or greater than 999, a hardware error has
// occurred. This port should only be read from.

extern short SCALE_TARE_PORT;
// Internal hardware that stores a previous sensor value for use later
// such as to subtract off the weight of a container on the scale to
// determine the weight of its contents. If this value is negative or
// greater than 999, a hardware error has occurred.  This port may be
// read or written to.

extern unsigned char SCALE_STATUS_PORT;
// A series of bits indicating various aspects of the scale's state as
// detected by the hardware. While there are 8 bits, only 2 of them
// are used by the software. The bits are as follows:
// 
// Bit 0: Unused
// Bit 1: Unused
// Bit 2: Unit Select, 0 for ounces (oz), 1 for pounds (lb)
// Bit 3: Unused
// Bit 4: Unused
// Bit 5: Tare Button, 1 for pressed, 0 for not
// Bit 6: Unused
// Bit 7: Unused
//
// This port should only be read from.

extern int SCALE_DISPLAY_PORT;
// Controls the scale display. Readable and writable. Routines
// wishing to change the display should alter the bits of this
// variable.

3.4 Scale Structs and Defines

During processing, the functions that are required need to pass several pieces of data around that describe the state of the device. These are summarized in a struct that is defined in the header file and shown below. The documentation associated with the struct and #define'd constants are used in functions to manipulate the machine state.

////////////////////////////////////////////////////////////////////////////////
// scale data and structs

#define UNIT_KG 1               // show kilograms
#define UNIT_LB 2               // show pounds (2.2 pounds per kg)

#define MODE_SHOW      1        // show measured weight as sensor_val - tare_val 
#define MODE_TARE      2        // store the sensor_val into tare_val, Tare Indicator on
#define MODE_ERROR     4        // error of some kind

// store state of the scale in a struct for easier access
typedef struct{
  short weight;            // calculated weight for use elsewhere
  char mode;               // one of the MODE_XX values indicating what to do/show
  char indicators;         // Bit field with 1's for different indicators
  // Bit 0: Ounce Indicator On/Off
  // Bit 1: Pound Indicator On/Off
  // Bit 2: Tare Indicator On/Off, on when stored Tare value is non-zero
} scale_t;

3.5 scale_update.c Outline

Periodically the microcontroller managing the scale will call scale_update() which should access the sensors/ports and set the display using several functions are used to subdivide this task. The functions to write for the problem are outlined below. Each includes documentation about what is to be done in the function and constraints on the implementation. Details about how to format the digital display described in later sections.

// scale_update.c: functions that read scale hardware, convert its
// encode its state in a struct, and adjusts the display to show the
// user information.

#include "scale.h"

int scale_from_ports(scale_t *scale);
// Uses the values in SCALE_SENSOR_PORT, SCALE_TARE_PORT, and
// SCALE_STATUS_PORT to set the fields of the `scale` struct to
// appropriate values. 
//
// Does bound checking so that if the SCALE_SENSOR_PORT or
// SCALE_TARE_PORT is out of range (see their docs) sets the fields of
// scale to be 0's, sets the scale->mode to be MODE_ERROR, and
// returns 1. 
//
// If SCALE_STATUS_PORT indicates the Tare button is being pressed
// (see that variable's docs), sets fields of `scale` to be 0, sets
// the mode to be MODE_TARE, and returns 2.
//
// Otherwise sets the fields of scale as follows.
//
// - mode :: set to MODE_SHOW to show the display weight as normal
//
// - weight :: set to (SCALE_SENSOR_PORT - SCALE_TARE_PORT).  Converts
//   to pounds if SCALE_STATUS_PORT indicates that is the desired
//   unit. Conversion is done using bit shifting / masking and rounds
//   pounds up if the ounces are halfway to the next pound.
//
// - indicators :: sets a bit to turn on either the Ounce or Pound
//   Indicator light. If the SCALE_TARE_PORT is not zero, sets a bit
//   to turn on the Tare indicator light. See documentation for the
//   scale_t struct for which bits control which indicator lights.
//
// Returns 0 when `scale` has its field set as above.
// 
// CONSTRAINTS: Uses only integer operations. No floating point
// operations are used as the target machine does not have a FPU. Does
// not use any math functions such as abs().
//
// CONSTRAINTS: Does not use integer division; uses bit shifting and
// masking to compute division by powers of two and perform rounding.
// 
// CONSTRAINT: Limit the complexity of code as much as possible. Do
// not use conditionals nested deeper than 2 levels. Seek to make the
// code as short, and simple as possible. Code longer than 70 lines
// may be penalized for complexity.

int scale_display_special(scale_t scale, int *display);
// Sets the bits pointed at by `display to indicate one of two
// "special" cases.
//
// If scale.mode is MODE_ERROR, sets the display to "ERR" to indicate
// that the weight is out of range or some other hardware error has
// occurred.
//
// If scale.mode is MODE_TARE, sets the display to "STOR" to indicate
// that the Tare button is being pressed and the current weight will be
// stored in the internal saved weight space.
//
// In both the above cases, only changes the bits pointed at by
// `display` and returns 0. Does not make any other changes to the
// machine state.
//
// If scale.mode is a different value that one of the above two cases,
// this function has been called in error and returns 1.
//
// CONSTRAINT: This function should not access or change any global
// variables. 
//
// CONSTRAINT: Limit the complexity of code as much as possible. Do
// not use conditionals nested deeper than 2 levels. Seek to make the
// code as short, and simple as possible. Code longer than 35 lines
// may be penalized for complexity.

int scale_display_weight(scale_t scale, int *display);
// Called when scale.mode is MODE_SHOW. If it is not, this function
// returns 1 immediately and makes no other changes.
//
// For scale.mode of MODE_SHOW, sets `display` bits to indicate the
// weight in scale.weight. Uses integer division to determine digits
// to be shown in `display` and whether a negative sign should be
// present. Sets bits of `display` according to the bits in
// scale.indicators to show ounces / pounds as the unit and indicate a
// non-zero tare. Returns 0 on completion.
//
// CONSTRAINT: This function should not access or change any global
// PORT variables. It may use a global array of bit patterns if deemed
// useful.
//
// CONSTRAINT: Limit the complexity of code as much as possible. Do
// not use conditionals nested deeper than 2 levels. Seek to make the
// code as short, and simple as possible. Code longer than 65 lines
// may be penalized for complexity.

int scale_update();
// Updates the state of scale using previously defined functions. Uses
// stack space for a scale_t struct which is set from the ports and
// then used to alter the display. If the MODE_TARE is indicated, then
// copies SCALE_SENSOR_PORT to SCALE_TARE_PORT. Returns 0 if the
// display is set properly and 1 if an error occurs while setting the
// display. 
//
// CONSTRAINT: Does not allocate any heap memory as malloc() is NOT
// available on the target microcontroller.  Uses stack and global
// memory only.
//
// CONSTRAINT: Limit the complexity of code as much as possible. Do
// not use conditionals nested deeper than 2 levels. Seek to make the
// code as short, and simple as possible. Code longer than 20 lines
// may be penalized for complexity.

3.6 Diagram of Display

The overall display is formed with several adjacent digits along with an other indicator lights display. The diagram below shows several examples of these along with the vale for the 32-bit DISPLAY_PORT that would create the display.

digital-scale-top-down.png

Figure 2: Examples of how the bits of the DISPLAY_PORT control which parts of the display are shown. Each digit follows the same pattern of bit to bar correspondence as the right-most with bits. That pattern appears in a later diagram. The highest order bits (28-31) control indicator lights as shown. Note that several conflicting indicators could be shown at the same time but the code setting the display should not do this normally.

Notice the following.

  • SCALE_DISPLAY_PORT is accessed as a 32-bit integer
  • 31 bits are used to control the full display
  • Bits 0-6 control the tenths place
  • Bits 7-13 control the ones place
  • Bits 14-20 control the tens place
  • Bits 21-27 control the hundreds place though this is only used for a negative sign
  • Bit 28 turns on the OZ indicator
  • Bit 29 turns on the LB indicator
  • Bit 30 turns on the -T (non-zero Tare) indicator
  • Bit 31 is not used

3.7 Digit Patterns

Each part of the LCD display is controlled with individual bits. The Digits follow a regular pattern where each bit darkens certain bars in the display allowing numbers and some letters to be shown. The pattern is repented in the diagram below and should likely be encoded somewhere in the program.

digital-digits-top-down.png

Figure 3: Bit to digit display bar correspondence. The 0th bit controls the top horizontal bar, the 1th bit controls the horizontal bar counter-clockwise from it, the 2th bit the middle bar, and so on working top to bottom, left to right. When a bit is 1 (set), the bar will be darkened while when the bit is 0 (clear) the bar will not be displayed (shown as empty). The combinations of bits shown are the only ones that arise when showing digits for times.

Notice the following.

  • Bits that are set (equal to 1) will turn on (darken) one bar of the display digit
  • Bits that are clear (equal to 0) will turn off one bar of the digit
  • 7 bits are required to control the display of one digit
  • The bits are arranged with the low order bit (bit 0) at the top and progress top-to-bottom and left-to-right
    • Bit 0 top
    • Bit 1 upper left
    • Bit 2 middle
    • Bit 3 upper right
    • Bit 4 lower left
    • Bit 5 bottom
    • Bit 6 lower right
  • The programmer can set bits to any pattern which will be displayed but only patterns shown in the figure correspond to digits of interest.

To create a mult-digit display, one can set bits, then shift those bits left to place the digit further to the left in the display. All digits follow the above pattern which allows for easy combination.

3.8 Implementation Hints

Bit Shifts VS Division

Review the correspondence between right shift operations and division and note that any bits shifted off would be the remainder of a division. This pattern allows one to "mask" low order bits and check them for being above / below a certain quantity to preform rounding. This is a required element of the project so demonstrate your understanding using the prescribed method, not division/remainder operations, not by adding to the original quantity.

Digit Bit Patterns

While creating the bit display pattern in scale_display_weight(), a good trick to use when create a series of bit patterns that correspond to the various digits. For example, according to the diagrams above, the bit pattern for 9 is 0b1101111. If a 9 should appear on the clock somewhere, this bit pattern should be shifted and combined with the existing bits in display so that a 9 will show.

It may be worthwhile to create bit patterns for other items like Letters or the Negative sign. This can be done in several ways but a common one is via defining "symbols" or global variables.

// "symbol" defining a bit pattern
#define SOME_PATTERN 0b1101001

// global variable defining a bit pattern
int OTHER_PATTERN = 0b0010110;

// global array of patterns
int my_patterns[2] = {
  0b0101010,
  0b1111110,
};

Here is detailed explanation of one approach to the laying out a multi-digit pattern.

  • Create an array of bit masks for each of the digits 0-9. The 0th element of the array contains a bit pattern like 0b1111011 which represents the bits that should be set for a 0 digit, the 1th element of this array has a pattern for one, and so one. There should be ten entries in this array in indices 0-9.
  • For a given number like 839, use modulo to determine the integer value for the ones, tens, and hundreds digits in the number. Call these variables something like ones and tens etc.. Each variable should be in the range 0-9.
  • Start with a state variable of 0 (all 0 bits).
  • Use ones to index into your array of masks to determine the bits that should be set for it. Combine the state variable with ones bit pattern.
  • Combining bits here is a logical operation of setting all bits that are one in the pattern to 1 in the state variable.
  • Use tens to index into your array of masks for the right pattern for that digit. The bits corresponding to the tens place is shifted to the left by 7 bits so shift the pattern to the left and combine it with the state variable.
  • Repeat this process for the remaining digits required (shifted by 14 and 21 bits left for the other two digits).
  • Account in some way for negativity: the negative sign must be placed if the value is less than 0
  • Keep in mind that the display has a few special cases: no leading 0 should be shown in the tens place if the quantity is small
  • Set the high bits (28-30) using shifts according to any indicators that are present.
  • The state variable should now be populated.

3.9 Scale Simulator and Main

While we do not have actual hardware with the features mentioned, a simulator for the system is in the provided files scale_main.c and scale_sim.c. You do not need to modify or understand code in either file to complete the HW though it will certainly expand you C skills to spend some time examining them. Several functions are created along with the global variables that represent parts of the hardware.

You can build simulator executable via make. The main() function in accepts a command line argument which specifies the internal state of the machine. It then calls the update function and shows the resulting display.

Below are several sample runs of the simulator which show the results of running on several example inputs.

########## EXAMPLE  1 ##########
>> ./scale_main
usage: ./scale_main {sensor_val} {tare_val} {oz|lb} {tare_no|tare_yes}
  sensor_val: integer


########## EXAMPLE  2 ##########
>> ./scale_main 0 0 oz tare_no
SCALE_SENSOR_PORT: 0
SCALE_TARE_PORT:   0
SCALE_STAUS_PORT:  0000 0000
index:                4    0
result = scale_from_ports(&scale);
result: 0
scale = {
  weight     = 0
  mode       = 1
  indicators = 00000 001
}
result = scale_update();
result: 0
SCALE_SENSOR_PORT: 0
SCALE_TARE_PORT:   0
SCALE_DISPLAY_PORT:
bits:  0 001 0000000 0000000 1111011 1111011
index:    28      21      14       7       0
Scale Display:
           ~~   ~~   
          |  | |  |  
                     
          |  | |  | OZ
           ~~ o ~~   


########## EXAMPLE  3 ##########
>> ./scale_main 5 0 oz tare_no
SCALE_SENSOR_PORT: 5
SCALE_TARE_PORT:   0
SCALE_STAUS_PORT:  0000 0000
index:                4    0
result = scale_from_ports(&scale);
result: 0
scale = {
  weight     = 5
  mode       = 1
  indicators = 00000 001
}
result = scale_update();
result: 0
SCALE_SENSOR_PORT: 5
SCALE_TARE_PORT:   0
SCALE_DISPLAY_PORT:
bits:  0 001 0000000 0000000 1111011 1100111
index:    28      21      14       7       0
Scale Display:
           ~~   ~~   
          |  | |     
                ~~   
          |  |    | OZ
           ~~ o ~~   


########## EXAMPLE  4 ##########
>> ./scale_main 839 0 oz tare_no
SCALE_SENSOR_PORT: 839
SCALE_TARE_PORT:   0
SCALE_STAUS_PORT:  0000 0000
index:                4    0
result = scale_from_ports(&scale);
result: 0
scale = {
  weight     = 839
  mode       = 1
  indicators = 00000 001
}
result = scale_update();
result: 0
SCALE_SENSOR_PORT: 839
SCALE_TARE_PORT:   0
SCALE_DISPLAY_PORT:
bits:  0 001 0000000 1111111 1101101 1101111
index:    28      21      14       7       0
Scale Display:
      ~~   ~~   ~~   
     |  |    | |  |  
      ~~   ~~   ~~   
     |  |    |    | OZ
      ~~   ~~ o ~~   


########## EXAMPLE  5 ##########
>> ./scale_main 839 125 oz tare_no
SCALE_SENSOR_PORT: 839
SCALE_TARE_PORT:   125
SCALE_STAUS_PORT:  0000 0000
index:                4    0
result = scale_from_ports(&scale);
result: 0
scale = {
  weight     = 714
  mode       = 1
  indicators = 00000 101
}
result = scale_update();
result: 0
SCALE_SENSOR_PORT: 839
SCALE_TARE_PORT:   125
SCALE_DISPLAY_PORT:
bits:  0 101 0000000 1001001 1001000 1001110
index:    28      21      14       7       0
Scale Display:
      ~~             
        |    | |  | -T
                ~~   
        |    |    | OZ
              o      


########## EXAMPLE  6 ##########
>> ./scale_main 0 125 oz tare_no
SCALE_SENSOR_PORT: 0
SCALE_TARE_PORT:   125
SCALE_STAUS_PORT:  0000 0000
index:                4    0
result = scale_from_ports(&scale);
result: 0
scale = {
  weight     = -125
  mode       = 1
  indicators = 00000 101
}
result = scale_update();
result: 0
SCALE_SENSOR_PORT: 0
SCALE_TARE_PORT:   125
SCALE_DISPLAY_PORT:
bits:  0 101 0000100 1001000 0111101 1100111
index:    28      21      14       7       0
Scale Display:
           ~~   ~~   
        |    | |    -T
 ~~        ~~   ~~   
        | |       | OZ
           ~~ o ~~   


########## EXAMPLE  7 ##########
>> ./scale_main 839 125 lb tare_no
SCALE_SENSOR_PORT: 839
SCALE_TARE_PORT:   125
SCALE_STAUS_PORT:  0000 0100
index:                4    0
result = scale_from_ports(&scale);
result: 0
scale = {
  weight     = 45
  mode       = 1
  indicators = 00000 110
}
result = scale_update();
result: 0
SCALE_SENSOR_PORT: 839
SCALE_TARE_PORT:   125
SCALE_DISPLAY_PORT:
bits:  0 110 0000000 0000000 1001110 1100111
index:    28      21      14       7       0
Scale Display:
                ~~   
          |  | |    -T
           ~~   ~~   
             |    |  
              o ~~  LB


########## EXAMPLE  8 ##########
>> ./scale_main 839 125 lb tare_yes
SCALE_SENSOR_PORT: 839
SCALE_TARE_PORT:   125
SCALE_STAUS_PORT:  0010 0100
index:                4    0
result = scale_from_ports(&scale);
result: 2
scale = {
  weight     = 0
  mode       = 2
  indicators = 00000 000
}
result = scale_update();
result: 0
SCALE_SENSOR_PORT: 839
SCALE_TARE_PORT:   839
SCALE_DISPLAY_PORT:
bits:  0 000 1100111 1001001 1111011 1011111
index:    28      21      14       7       0
Scale Display:
 ~~   ~~   ~~   ~~   
|       | |  | |  |  
 ~~             ~~   
   |    | |  | |  |  
 ~~        ~~ o      


########## EXAMPLE  9 ##########
>> ./scale_main -15 0 oz tare_no
SCALE_SENSOR_PORT: -15
SCALE_TARE_PORT:   0
SCALE_STAUS_PORT:  0000 0000
index:                4    0
result = scale_from_ports(&scale);
result: 1
scale = {
  weight     = 0
  mode       = 4
  indicators = 00000 000
}
result = scale_update();
result: 0
SCALE_SENSOR_PORT: -15
SCALE_TARE_PORT:   0
SCALE_DISPLAY_PORT:
bits:  0 000 0110111 1011111 1011111 0000000
index:    28      21      14       7       0
Scale Display:
 ~~   ~~   ~~        
|    |  | |  |       
 ~~   ~~   ~~        
|    |  | |  |       
 ~~           o      


########## EXAMPLE 10 ##########
>> ./scale_main 0 -48 lb tare_no
SCALE_SENSOR_PORT: 0
SCALE_TARE_PORT:   -48
SCALE_STAUS_PORT:  0000 0100
index:                4    0
result = scale_from_ports(&scale);
result: 1
scale = {
  weight     = 0
  mode       = 4
  indicators = 00000 000
}
result = scale_update();
result: 0
SCALE_SENSOR_PORT: 0
SCALE_TARE_PORT:   -48
SCALE_DISPLAY_PORT:
bits:  0 000 0110111 1011111 1011111 0000000
index:    28      21      14       7       0
Scale Display:
 ~~   ~~   ~~        
|    |  | |  |       
 ~~   ~~   ~~        
|    |  | |  |       
 ~~           o      


########## EXAMPLE 11 ##########
>> ./scale_main 0 -48 oz tare_yes
SCALE_SENSOR_PORT: 0
SCALE_TARE_PORT:   -48
SCALE_STAUS_PORT:  0010 0000
index:                4    0
result = scale_from_ports(&scale);
result: 1
scale = {
  weight     = 0
  mode       = 4
  indicators = 00000 000
}
result = scale_update();
result: 0
SCALE_SENSOR_PORT: 0
SCALE_TARE_PORT:   -48
SCALE_DISPLAY_PORT:
bits:  0 000 0110111 1011111 1011111 0000000
index:    28      21      14       7       0
Scale Display:
 ~~   ~~   ~~        
|    |  | |  |       
 ~~   ~~   ~~        
|    |  | |  |       
 ~~           o      


3.10 Problem 1 Grading Criteria   grading 30

Weight Criteria
  AUTOMATED TESTS
20 make test-prob1 which uses programs test_scale_update and scale_main
  Provides 40 tests for functions in scale_update.c
  0.5 point per test passed
  MANUAL INSPECTION of scale_update.c
2 Code Style: Functions adhere to CMSC 216 C Coding Style Guide which dictates
  reasonable indentation, commenting, consistency of curly usage, etc.
   
3 scale_from_ports()
  Clear effort to do error checking of out of bounds values.
  Clear flow to how each field of scale_t is calculated.
  Correctly setting fields of scale_t via pointer dereference or arrow operator.
  Makes use of shifting / masking / logical operations to convert to pounds and round up if needed
  Adherence to constraints: no floats, no float math ops, no deeply nested conditionals, code length
   
3 scale_display_special() / scale_display_weight()
  Clear error checking of for the appropriate MODE in both functions
  Use of bitwise operators to shift bits appropriately
  Use of bitwise operators to combine shifted digit bits
  Clear calculation digits to be displayed
  Use of bit masks stored in arrays with indices corresponding to digits to be displayed
  Clear derference/set of the integer pointed to by the display parameter
  Refrains from accessing / altering global PORTS
  Adherence to constraints: no floats, no math float ops, no deeply nested conditionals, code length
   
2 scale_update()
  Does not re-define / re-declare these variables, only checks / alters their values
  Use of previous two functions
  Adjusts SCALE_TARE_PORT when MODE_TARE is detected
  No use of malloc()

4 Problem 2: Debugging the Puzzlebox

4.1 Overview

The file puzzlebox.c contains source code that reads inputs from a file named on the command line. If the inputs are correct, points are awarded. If inputs are incorrect, error messages are printed.

The puzzlebox is arranged into a series of phases each of which has some points associated with it.

  • Not all phases must be completed to get full credit but the phases must done in order.
  • Each phase reads inputs from the file provided on the command line and performs calculations on them to see if they are "correct" according to various criteria
  • The very first input is your internet ID like kauf0095 (first part of your UMN email address). This input is used to add randomness to the puzzle so that your answers will be different from most other students. You must you use your own internet ID.

The purpose of this problem is get familiar with using a debugger. This is a powerful tool that pauses programs, allows internal values to be printed and code to be stepped through line by line. It is nearly essential to use as the code in puzzlebox is intentionally convoluted in places. Being able to pause execution and print values at various points make it much easier to solve the puzzles.

4.2 input.txt Input File

Name your input file input.txt and put your internet ID in it along with some numbers like 1 2 3. Then compile and run the puzzlebox program on it.

>> make puzzlebox                             # compile puzzlebox
gcc -Wall -g -c puzzlebox.c
gcc -Wall -g -o puzzlebox puzzlebox.o

>> cat input.txt                              # show contents of input.txt
kauf0095 1 2 3

>> puzzlebox input.txt
========================================
PROBLEM 2: Puzzlebox
UserID 'KAUF0095' accepted: hash value = 1936486779
PHASE 1: A puzzle you say? Challenge accepted!

Ah ah ah, you didn't say the magic word...
Failure: Double debugger burger, order up!

RESULTS: 0 / 30 points

This is automated with the Makefile target make test-prob2:

>> make test-prob2                       # compile/run puzzlebox with input.txt
gcc -Wall -g -c puzzlebox.c
gcc -Wall -g -o puzzlebox puzzlebox.o
puzzlebox input.txt
========================================
PROBLEM 2: Puzzlebox
UserID 'KAUF0095' accepted: hash value = 1936486779
PHASE 1: A puzzle you say? Challenge accepted!

Ah ah ah, you didn't say the magic word...
Failure: Double debugger burger, order up!

RESULTS: 0 / 30 points

These initial forays are not positive (0 / 30 points) but the real meat of the problem is in examining the source code and determining inputs for input.txt.

4.3 gdb The GNU Debugger

You will definitely need to use a debugger to solve the puzzlebox and gdb is the quintessential debugger associated with our compiler gcc. It is installed by default on all lab machines and is an easy install on must Linux machines.

For a quick overview of GDB, here are some resources

4.4 Typical Cycle

A typical cycle of working on puzzlebox will be the following.

  • Start the debugger with puzzlebox

      >> gdb -tui ./puzzlebox
    
  • Set the arguments array to input.txt

    set args input.txt
    
  • Set a breakpoint at a phase like phase03

    break phase03
    
  • Run the program

    run
    
  • Do some stepping / nexting

    step
    next
    
  • Print some variables

    print normal_val
    print/x val_in_hex
    print/t val_in_binary
    
  • Make some changes to input.txt in a different window
  • Re-run the program to see if things have changed favorably

    kill
    run
    

4.5 Kinds of Puzzles

The puzzles presented in the different phases make use of a variety of C program techniques which we have or will discuss including.

  • Bit-wise operations and their use in place of arithmetic
  • String and character array manipulations
  • Interpreting bits as one of several kinds of things (integer, float, character) through pointers and unions
  • More extensive C control structures like goto and labels

4.6 Tests for puzzlebox.c   grading 30

puzzlebox.c itself reports how many points one can earn at the end of its execution.

Currently there are 40 points available but 30 points is considered full credit.

If any additional points are earned, they will be counted as Makeup Credit for Projects to make up for credit lost on past or future projects. Your total score on All Projects cannot exceed 100% so any points beyond will simply be dropped.

Run the following command to 'test' puzzlebox:

make test-prob2

5 Problem 3: Hash Tables in C

5.1 Overview

This problem implements a rudimentary hash table application in C along with a program that uses it. The architecture is similar to a lab problem involving linked lists so reviewing that code can provide some insight.

Basic Hash Tables are covered in most introductory data structure classes such as CSCI 1913 and 1933. Should you need to review them, the following resources may prove useful.

Our particular variant will be a Hash Set which is a collection of elements (strings) which are each unique. The hash table is used to quickly add new elements and check whether an element is contained in the set. The hash table will use the separate chaining technique to resolve hash code conflicts: each hash "bucket" has a singly linked list of elements in that bucket.

Additionally, the hash set will maintain a record of the order elements are added by overlaying a linked list on elements as they are added. This allows for efficient traversal of the hash table elements for printing and gives a queue-like feel to parts of the table.

5.2 hashset_main Demonstration

The intent of this problem is to develop a small application which behaves as the following demo indicates.

>> make hashset_main                   # Compile hash table application
gcc -Wall -Wno-comment -Werror -g  -c hashset_main.c
gcc -Wall -Wno-comment -Werror -g  -c hashset_funcs.c
gcc -Wall -Wno-comment -Werror -g  -o hashset_main hashset_main.o hashset_funcs.o

>> ./hashset_main                      # start application, shows help message with commands
Hashset Application
Commands:
  hashcode <elem>  : prints out the numeric hash code for the given key (does not change the hash set)
  contains <elem>  : prints the value associated with the given element or NOT PRESENT
  add <elem>       : inserts the given element into the hash set, reports existing element
  print            : prints all elements in the hash set in the order they were addded
  structure        : prints detailed structure of the hash set
  clear            : reinitializes hash set to be empty with default size
  save <file>      : writes the contents of the hash set to the given file
  load <file>      : clears the current hash set and loads the one in the given file
  next_prime <int> : if <int> is prime, prints it, otherwise finds the next prime and prints it
  expand           : expands memory size of hash set to reduce its load factor
  quit             : exit the program

HS> print                              # prints items in order, empty initially

HS> structure                          # show hash table structure, empty intitially
elem_count: 0
table_size: 5
order_first: NULL
order_last : NULL
load_factor: 0.0000
[ 0] : 
[ 1] : 
[ 2] : 
[ 3] : 
[ 4] : 

HS> add Rick                           # add items table
HS> add Morty                             

HS> print                              # show items in order added
  1 Rick
  2 Morty

HS> structure                          # show internal structure of the hash table
elem_count: 2
table_size: 5
order_first: Rick
order_last : Morty
load_factor: 0.4000
[ 0] :                                 # some buckets are empty
[ 1] : 
[ 2] : 
[ 3] : {2546943 Rick >>Morty}          # nodes show hash code, string stored, next string
[ 4] : {74531189 Morty >>NULL}         # last item added, points to null

HS> add Summer                         # add more items
HS> add Jerry
HS> add Beth
HS> add Tinyrick

HS> print                              # print items in order added
  1 Rick
  2 Morty
  3 Summer
  4 Jerry
  5 Beth
  6 Tinyrick

HS> structure                          # show structure
elem_count: 6                          # NOTE: recently added items added to front of bucket lists
table_size: 5
order_first: Rick
order_last : Tinyrick
load_factor: 1.2000
[ 0] : 
[ 1] : {-1964728321 Tinyrick >>NULL} 
[ 2] : {2066967 Beth >>Tinyrick} 
[ 3] : {-1807340593 Summer >>Jerry} {2546943 Rick >>Morty} 
[ 4] : {71462654 Jerry >>Beth} {74531189 Morty >>Summer} 

HS> contains Jerry                     # check if items contained / not in hashset
FOUND: Jerry
HS> contains Rick
FOUND: Rick
HS> contains Unity
NOT PRESENT
HS> contains President
NOT PRESENT

HS> add Jerry                          # items are unique, no changes made on detecting duplicates
Data already present, no changes made
HS> add Beth
Data already present, no changes made

HS> save rm.hashset                    # save the hash table file rm.hashset, can be loaded later
HS> clear                              # clear the hash table
HS> print                              # show cleared hash table is empty now
HS> structure
elem_count: 0
table_size: 5
order_first: NULL
order_last : NULL
load_factor: 0.0000
[ 0] : 
[ 1] : 
[ 2] : 
[ 3] : 
[ 4] : 

HS> add Birdperson                     # add new items to the empty hash table
HS> add Squanchy
HS> structure                          # show contents
elem_count: 2
table_size: 5
order_first: Birdperson
order_last : Squanchy
load_factor: 0.4000
[ 0] : 
[ 1] : 
[ 2] : {-724361712 Squanchy >>NULL} 
[ 3] : {2082041198 Birdperson >>Squanchy} 
[ 4] : 

HS> load rm.hashset                    # clear current hash table and load from file
HS> structure                          # show contents identical to when 'save' was used
elem_count: 6
table_size: 5
order_first: Rick
order_last : Tinyrick
load_factor: 1.2000
[ 0] : 
[ 1] : {-1964728321 Tinyrick >>NULL} 
[ 2] : {2066967 Beth >>Tinyrick} 
[ 3] : {-1807340593 Summer >>Jerry} {2546943 Rick >>Morty} 
[ 4] : {71462654 Jerry >>Beth} {74531189 Morty >>Summer} 

HS> expand                             # expand the size of the hash table's internal array
HS> print                              # no effect on items/ordering in table
  1 Rick
  2 Morty
  3 Summer
  4 Jerry
  5 Beth
  6 Tinyrick

HS> structure                          # show internal sturcture: larger array now
elem_count: 6

table_size: 11                         # array size is next_prime(2*5+1) = 11
order_first: Rick
order_last : Tinyrick
load_factor: 0.5455
[ 0] : 
[ 1] : {2066967 Beth >>Tinyrick} 
[ 2] : 
[ 3] : {-1807340593 Summer >>Jerry} {2546943 Rick >>Morty} 
[ 4] : 
[ 5] : 
[ 6] : {-1964728321 Tinyrick >>NULL} 
[ 7] : {74531189 Morty >>Summer} 
[ 8] : 
[ 9] : 
[10] : {71462654 Jerry >>Beth} 

HS> add Birdperson                     # add additional items
HS> add Squanchy
HS> add Picklerick

HS> structure                          # show structure again
elem_count: 9
table_size: 11
order_first: Rick
order_last : Picklerick
load_factor: 0.8182
[ 0] : {761532761 Picklerick >>NULL} 
[ 1] : {2066967 Beth >>Tinyrick} 
[ 2] : 
[ 3] : {-1807340593 Summer >>Jerry} {2546943 Rick >>Morty} 
[ 4] : 
[ 5] : 
[ 6] : {2082041198 Birdperson >>Squanchy} {-1964728321 Tinyrick >>Birdperson} 
[ 7] : {74531189 Morty >>Summer} 
[ 8] : {-724361712 Squanchy >>Picklerick} 
[ 9] : 
[10] : {71462654 Jerry >>Beth} 

HS> expand                             # expand again
HS> structure                          # show internal structure
elem_count: 9
table_size: 23                         # size is next_prime(2*11 + 1) = 23
order_first: Rick
order_last : Picklerick
load_factor: 0.3913
[ 0] : 
[ 1] : {761532761 Picklerick >>NULL} 
[ 2] : 
[ 3] : {2066967 Beth >>Tinyrick} 
[ 4] : 
[ 5] : 
[ 6] : 
[ 7] : 
[ 8] : {2082041198 Birdperson >>Squanchy} 
[ 9] : 
[10] : 
[11] : {-724361712 Squanchy >>Picklerick} {-1964728321 Tinyrick >>Birdperson} {74531189 Morty >>Summer} 
[12] : 
[13] : 
[14] : 
[15] : {2546943 Rick >>Morty} 
[16] : 
[17] : 
[18] : {-1807340593 Summer >>Jerry} 
[19] : 
[20] : 
[21] : {71462654 Jerry >>Beth} 
[22] : 

HS> quit                               # exit the application
>>                                     # back to normal shell

5.3 Diagram of Hash Set Data structures

hashset2.png

Figure 4: Shows the structure of a hash set and the various C data structures (structs) which are involved. Details are discussed in subsequent sections.

5.4 Hash Table Data Types

We will favor separately chained hash sets which have the following features.

  • A hash set stores elements (strings) allowing fast lookup based on the hash code of the element. When a programming environment provides a "set" data structure, hash tables are frequently the backing data structure.
  • The hash set has a 'table' which is a fixed size array. Each array element is a pointer to a linked list node or NULL.
  • Elements in the hash table are stored in linked list nodes. Elements that hash to the same table location are stored in the list at that location
  • A hashcode() function is provided which generates an integer from a string and is the first step to determining where key/value is located.
  • The hash set also tracks the ordering that elements are added via additional pointers. The hash set tracks the first and last elements added allowing queue-like adds. Each node is part of a list for the table index it is stored at AND part of the overall list of all elements added in order.

Examine the header file hashset.h to see the two C structs which will be used to represent hash tables.

// Type for linked list nodes in hash set
typedef struct hashnode {
  struct hashnode *table_next;  // pointer to next node at table index of this node, NULL if last node
  struct hashnode *order_next;  // pointer to next node in insert order, NULL if last data added
  char data[128];               // string for the data stored in this node
} hsnode_t;

// Type of hash table
typedef struct {
  int data_count;               // number of data items (elements) in the table
  int table_size;               // how big is the table array
  hsnode_t **table;             // array of "buckets" which contain nodes
  hsnode_t *order_first;        // pointer to the first node that was added
  hsnode_t *order_last;         // pointer to last node that was added
} hashset_t;

5.5 Provided hash_code() Function

The code for hash_code() is provided below. All hash tables require a means of converting elements into a non-unique number. For consistency, we will use the below code which employs several interesting C tricks that will be discussed as the course progresses. Some of the advantages/disadvantages of the hash function are mentioned in the comments.

// PROVIDED: Compute a simple hash code for the given character
// string. Uses a "polyomial code" which uses all characters of the
// string to form the hash code. This is the same approach that many
// standard libraries like Java's String.hashcode() function
// use. ADVANTAGE: Gives good distribution for all strings with even
// single character variations having different hash
// codes. DISADVANTAGE: computing the hash code is linear in time on
// the length of the string.
int hashcode(char key[]){
  int hc = 0;
  for(int i=0; key[i]!='\0'; i++){
    hc = hc*31 + key[i];
  }
  return hc;
}

5.6 hashset_funcs.c Outline

Standard operations are supported by the hash set.

  • Adding an element
  • Determining if an element is contained or not in the set
  • Clearing the entire hash table
  • Printing all data in the hash table according to their add order
  • Printing the internal layout of the hash table
  • Expanding the memory used by the hash table to improve lookup performance.

These are broken down into the following C functions which you will need to implement in hashset_funcs.c except for the first hashcode() function which is provided.

// hashset_funcs.c: utility functions for operating on hash sets. Most
// functions are used in the hashset_main.c which provides an
// application to work with the functions.

#include "hashset.h"

// PROVIDED: Compute a simple hash code for the given character
// string. Uses a "polyomial code" which uses all characters of the
// string to form the hash code. This is the same approach that many
// standard libraries like Java's String.hashcode() function
// use. ADVANTAGE: Gives good distribution for all strings with even
// single character variations having different hash
// codes. DISADVANTAGE: computing the hash code is linear in time on
// the length of the string.
int hashcode(char key[]){
  int hc = 0;
  for(int i=0; key[i]!='\0'; i++){
    hc = hc*31 + key[i];
  }
  return hc;
}

void hashset_init(hashset_t *hs, int table_size);
// Initialize the hash set 'hs' to have given size and data_count
// 0. Ensures that the 'table' field is initialized to an array of
// size 'table_size' and is filled with NULLs. Also ensures that the
// first/last pointers are initialized to NULL

int hashset_contains(hashset_t *hs, char data[]);
// Returns 1 if the parameter `data` is in the hash set and 0
// otherwise. Uses hashcode() and field `table_size` to determine
// which index in table to search.  Iterates through the list at that
// table index using strcmp() to check for `data`. NOTE: The
// `hashcode()` function may return positive or negative
// values. Negative values are negated to make them positive. The
// "bucket" (index in hs->table) for `data` is determined by with
// 'hashcode(key) modulo table_size'.

int hashset_add(hashset_t *hs, char data[]);
// If the data is already present in the hash set, makes no changes
// to the hash set and returns 0. hashset_contains() may be used for
// this. Otherwise determines the bucket to add `data` at via the same
// process as in hashset_contains() and adds it to the FRONT of the
// list at that table index. Adjusts the `hs->order_last` pointer to
// append the new data to the ordered list of data. If this is the
// first data added, also adjsuts the `hs->first` pointer. Updates the
// `data_count` field and returns 1 to indicate a successful addition.
//
// NOTE: Adding data at the front of each bucket list allows much
// simplified logic that does not need any looping/iteration.

void hashset_free_fields(hashset_t *hs);
// De-allocates nodes/table for `hs`. Iterates through the ordered
// list of the hash set starting at the `order_first` field and
// de-allocates all nodes in the list. Also free's the `table`
// field. Sets all relevant fields to 0 or NULL as appropriate to
// indicate that the hash set has no more usable space. Does NOT
// attempt to de-allocate the `hs` itself as it may not be
// heap-allocated (e.g. in the stack or a global).

void hashset_show_structure(hashset_t *hs);
// Displays detailed structure of the hash set. Shows stats for the
// hash set as below including the load factor (data count divided
// by table_size) to 4 digits of accuracy.  Then shows each table
// array index ("bucket") on its own line with the linked list of
// data in the bucket on the same line. 
// 
// EXAMPLE:
// data_count: 4
// table_size: 5
// order_first: Rick
// order_last : Tinyrick
// load_factor: 0.8000
// [ 0] : {7738144525137111380 Tinyrick >>NULL} 
// [ 1] : 
// [ 2] : 
// [ 3] : {125779953153363 Summer >>Tinyrick} {1801677138 Rick >>Morty} 
// [ 4] : {521644699469 Morty >>Summer} 
//
// NOTES:
// - Uses format specifier "[%2d] : " to print the table indices
// - Nodes in buckets have the following format:
//   {1415930697 IceT >>Goldenfold}
//    |          |       |        
//    |          |       +-> order_next->data OR NULL if last node
//    |          +->`data` string     
//    +-> hashcode("IceT"), print using format "%ld" for 64-bit longs
// 

void hashset_write_data_ordered(hashset_t *hs, FILE *out);
// Outputs all data in the hash set according to the order they
// were added. Starts at the `order_first` field and iterates through
// the list defined there. Each data is printed on its own line
// preceded by its add position with 1 for the first data, 2 for the
// second, etc. Prints output to `FILE *out` which should be an open
// handle. NOTE: the output can be printed to the terminal screen by
// passing in the `stdout` file handle for `out`.

void hashset_save(hashset_t *hs, char *filename);
// Writes the given hash set to the given `filename` so that it can be
// loaded later.  Opens the file and writes its 'table_size' and
// 'data_count' to the file. Then uses the hashset_write_data_ordered()
// function to output all data in the hash set into the file.
// EXAMPLE FILE:
// 
// 5 6
//   1 Rick
//   2 Morty
//   3 Summer
//   4 Jerry
//   5 Beth
//   6 Tinyrick
// 
// First two numbers are the 'table_size' and 'data_count' field and
// remaining text is the output of hashset_write_data_ordered();
// e.g. insertion position and data.
//
// If the `filename` cannot be openened and fopen() returns NULL,
// prints the message
//
//  ERROR: could not open file '<FILENAME>'
//
// and returns immediately.

int hashset_load(hashset_t *hs, char *filename);
// Loads a hash set file created with hashset_save(). If the file
// cannot be opened, prints the message
// 
// ERROR: could not open file 'somefile.hs'
//
// and returns 0 without changing anything. Otherwise clears out the
// current hash set `hs`, initializes a new one based on the size
// present in the file, and adds all data from the file into the new
// hash set. Ignores the indices at the start of each line and uses
// hashset_add() to insert data in the order they appear in the
// file. Returns 1 on successful loading (FIXED: previously indicated
// a different return value on success) . This function does no error
// checking of the contents of the file so if they are corrupted, it
// may cause an application to crash or loop infinitely.

int next_prime(int num);
// If 'num' is a prime number, returns 'num'. Otherwise, returns the
// first prime that is larger than 'num'. Uses a simple algorithm to
// calculate primeness: check if any number between 2 and (num/2)
// divide num. If none do, it is prime. If not, tries next odd number
// above num. Loops this approach until a prime number is located and
// returns this. Used to ensure that hash table_size stays prime which
// theoretically distributes data better among the array indices
// of the table.

void hashset_expand(hashset_t *hs);
// Allocates a new, larger area of memory for the `table` field and
// re-adds all current data to it. The size of the new table is
// next_prime(2*table_size+1) which keeps the size prime.  After
// allocating the new table, all table entries are initialized to NULL
// then the old table is iterated through and all data are added to
// the new table according to their hash code. The memory for the old
// table is de-allocated and the new table assigned to the hash set
// fields "table" and "table_size".  This function increases
// "table_size" while keeping "data_count" the same thereby reducing
// the load of the hash table. Ensures that the memory associated with
// the old table is free()'d. Makes NO special effort to preserve old
// nodes: re-adds everything into the new table and then frees the old
// one along with its nodes. Uses functions such as hashset_init(),
// hashset_add(), hashset_free_fields() to accomplish the transfer.

5.7 hashset_main.c: main function / application

In hashset_main.c implement an interactive program which allows users to type commands to manipulate the hash set.

The provided Makefile should compile the hashset_main program as follows.

>> make hashset_main                   # Compile hash table application
gcc -Wall -Wno-comment -Werror -g  -c hashset_main.c
gcc -Wall -Wno-comment -Werror -g  -c hashset_funcs.c
gcc -Wall -Wno-comment -Werror -g  -o hashset_main hashset_main.o hashset_funcs.o

>> ./hashset_main                      # start application, shows help message with commands
Hashset Application
Commands:
  hashcode <data>  : prints out the numeric hash code for the given key (does not change the hash set)
  contains <data>  : prints FOUND if data is in the set NOT PRESENT otherwise
  add <data>       : inserts the given data into the hash set, reports existing data
  print            : prints all data in the hash set in the order they were addded
  structure        : prints detailed structure of the hash set
  clear            : reinitializes hash set to be empty with default size
  save <file>      : writes the contents of the hash set to the given file
  load <file>      : clears the current hash set and loads the one in the given file
  next_prime <int> : if <int> is prime, prints it, otherwise finds the next prime and prints it
  expand           : expands memory size of hash set to reduce its load factor
  quit             : exit the program
HS> 

The following sections provide some implementation details.

Read commands, Execute

The basic flow of hashset_main.c follows the same pattern that code for a lab exercise demonstrates. A good way to get started on the main application is to copy over the lab solution and begin modifying it.

  • Create a hashset_t variable, likely on the stack as a local variable in main()
  • Start a loop that terminates when the user exits or there is no more input
  • Each time the user enters a string, read it and check for one of the built-in commands
  • On identifying the command, potentially read another string if needed (commands like add and contains)
  • Call an appropriate hashset_XXX() function to handle the command

Supported Commands

To indicate to users of the program the supported commands, use the following code to print out the initial option list.

  printf("Hashset Application\n");
  printf("Commands:\n");
  printf("  hashcode <data>  : prints out the numeric hash code for the given key (does not change the hash set)\n");
  printf("  contains <data>  : prints FOUND if data is in the set NOT PRESENT otherwise\n");
  printf("  add <data>       : inserts the given data into the hash set, reports existing data\n");
  printf("  print            : prints all data in the hash set in the order they were addded\n");
  printf("  structure        : prints detailed structure of the hash set\n");
  printf("  clear            : reinitializes hash set to be empty with default size\n");
  printf("  save <file>      : writes the contents of the hash set to the given file\n");
  printf("  load <file>      : clears the current hash set and loads the one in the given file\n");
  printf("  next_prime <int> : if <int> is prime, prints it, otherwise finds the next prime and prints it\n");
  printf("  expand           : expands memory size of hash set to reduce its load factor\n");
  printf("  quit             : exit the program\n");

Clearing Hash Table

When issuing the clear command, the contents of the current hash set should be eliminated and the hash set reinitialized to the default size. This is most easily done via.

  1. Using the hashset_free_fields() function
  2. Using hashset_init() with the HASHSET_DEFAULT_TABLE_SIZE constant defined in hashset.h.

Paths to Files for Saving and Loading

Saving and loading data involves writing to files. The names associated with the files must be specified as a path so that if files are to be saved or loaded from subdirectories, include this as part of the path. For example, the rm.hashset file is provided in the data/ directory and once loading functionality is code, it can be loaded as follows.

>> ./hashset_main
...
HS> load data/rm.hashset       # load existing hashset save file
HS> print
  1 Rick
  2 Morty
  3 Summer
  4 Jerry
  5 Beth
  6 Tinyrick

HS> structure                  # show structure
elem_count: 6
table_size: 5
order_first: Rick
order_last : Tinyrick
load_factor: 1.2000
[ 0] : 
[ 1] : {-1964728321 Tinyrick >>NULL} 
[ 2] : {2066967 Beth >>Tinyrick} 
[ 3] : {-1807340593 Summer >>Jerry} {2546943 Rick >>Morty} 
[ 4] : {71462654 Jerry >>Beth} {74531189 Morty >>Summer} 

HS> expand                     # expand
HS> add Birdperson             # add new items
HS> add Goldenfold

HS> save cur.hashset           # save to file in current directory

HS> clear                      # empty hash table
HS> print                      # show emptiness

HS> load cur.hashset           # reload
HS> print                      # items restored
  1 Rick
  2 Morty
  3 Summer
  4 Jerry
  5 Beth
  6 Tinyrick
  7 Birdperson
  8 Goldenfold
HS> save data/new.hashset      # create another save file in data/ dir

Failing to Load Files

If a file cannot be opened with the load 'file.hm' command, the underlying hashset_load() should print an error of the form

ERROR: could not open file 'somefile.hm'

and the hash_main.c main loop should NOT change the hash set and print the message

load failed

in response to the error. Below is a demonstration of this from the automated tests.

...
HS> print                      # current contents of set
  1 Rick
  2 Morty
  3 Summer
  4 Jerry
  5 Beth
  6 Tinyrick
  7 Birdperson
  8 Goldenfold

HS> load notthere.hashset      # attempt to load file that doesn't exit
ERROR: could not open file 'notthere.hashset'
load failed

HS> print                      # failed load does not modify current set
  1 Rick
  2 Morty
  3 Summer
  4 Jerry
  5 Beth
  6 Tinyrick
  7 Birdperson
  8 Goldenfold
HS> 

Command Echoing: -echo option to hashset_main

Some users may wish to "script" this the main program: allow it to process input commands from a file rather than from the user typing. This notion is discussed in a lab and should be reviewed if it is unfamiliar.

An example of an input script is in data/hashset_demo.script which looks like a lot of user commands:

print
structure
add Rick
add Morty
print
structure
add Summer
add Jerry
add Beth
add Tinyrick
print
structure
contains Jerry
contains Rick
contains Unity
contains President
save rm.hashset
clear
print
structure
add Birdperson
add Squanchy
structure
load rm.hashset
structure
expand
print
structure
add Birdperson
add Squanchy
add Picklerick
structure
expand
structure
bye

The file can be fed directly to the program without needing type it using Unix pipes as per the following:

>> cat data/hashset_demo.script | ./hashset_main -echo

Notice the use of a command line argument for hashset_main: the -echo option. This is a REQUIRED feature which prints commands typed by users to the screen. To implement this, do the following.

Model the structure of command echoing after what is shown in related Lab work.

  • Use a variable in main() to indicate whether command echoing is on or off; set it to off by default
  • Check when main() starts whether command line argument 1 is set to -echo in which case turn echoing on. A condition like the following is useful.

      if(argc > 1 && strcmp("-echo",argv[1])==0) {
    
  • Each command should check for echoing and print the command being run along with any arguments to it. This technique is demonstrated in related lab work.

It will take some work to get the exact placement of echoes correct but will ultimately lead to nice results that involve LITTLE typing like the example below.

>> cat hashset_demo.script | hashset_main -echo 
Hashset Application
Commands:
  hashcode <data>  : prints out the numeric hash code for the given key (does not change the hash set)
  contains <data>  : prints FOUND if data is in the set NOT PRESENT otherwise
  add <data>       : inserts the given data into the hash set, reports existing data
  print            : prints all data in the hash set in the order they were addded
  structure        : prints detailed structure of the hash set
  clear            : reinitializes hash set to be empty with default size
  save <file>      : writes the contents of the hash set to the given file
  load <file>      : clears the current hash set and loads the one in the given file
  next_prime <int> : if <int> is prime, prints it, otherwise finds the next prime and prints it
  expand           : expands memory size of hash set to reduce its load factor
  quit             : exit the program
HS> print

HS> structure
data_count: 0
table_size: 5
order_first: NULL
order_last : NULL
load_factor: 0.0000
[ 0] : 
[ 1] : 
[ 2] : 
[ 3] : 
[ 4] : 

HS> add Rick
HS> add Morty
HS> print
  1 Rick
  2 Morty
...

5.8 Grading Criteria for Problem 3   grading 40

The following criteria will be checked in this problem.

Weight Criteria
  AUTOMATED TESTS via make test-prob3
20 test_hashset.org tests
  20 tests for hashset_main executable, exercises functions in hashset_funcs.c
  All tests run under Valgrind
  1 point per test passed
  Code compiles without warnings (make clean followed by make test-prob3 gives no errors/warnings)
  NOTE: run individual tests via make test-prob3 testnum=4
  MANUAL INSPECTION
15 hashset_funcs.c
  Clear commenting indicating intent during functions "calculate table index" or "iterate through list"
  Relatively short functions: nothing needs to be too long (50 lines tops)
  Clearly written iteration loops to traverse linked lists in hash table functions
  Use of strcmp() to check data during hashset_contains() operation
  Use of hashset_contains() during hashset_add() to check for duplicates
  Concise and commented code for hashset_add() which calculates table index and inserts new node at the front the list via table_next pointers
  Maintains the order_first/order_last fields during add operation
  hashset_free_fields() frees all list nodes and the table array, assigns 0/NULL to all fields
  Uses the ordered list of data to easily free all nodes during hashset_free_fields()
  hashset_save() makes use of hashset_write() with an open file handle to avoid redundant code
  hashset_load() checks that files open successfully before using hashset_free_fields() to de-allocate memory
  File closed after finished during saving and loading
  next_prime() is clearly written and easy to follow; follows the algorithm described in the function documentation
  hashset_expand() uses other functions in such as next_prime(), hashset_add() and hashset_free_fields() to avoid repeating code.
   
5 hashset_main.c
  Comments indicating what high-level command is being implemented in each part of main
  Clear structure of main loop, reading commands, selecting actions
  Use of string functions to identify which command to execute
  Clear efforts to honor -echo option and echo commands
  Clear effort to free all memory prior to exit by clearing hashset on exit

6 Project Submission

6.1 Submit to Gradescope

Refer to the Project 1 instructions and adapt them for details of how to submit to Gradescope. In summary they are

  1. Type make zip in the project directory to create p2-complete.zip
  2. Log into Gradescope, select Project 2, and upload p2-complete.zip

6.2 Late Policies

You may wish to review the policy on late project submission which will cost 1 Engagement Point per day late. No projects will be accepted more than 48 hours after the deadline.

https://www.cs.umd.edu/~profk/216/syllabus.html#late-submission


Web Accessibility
Author: Chris Kauffman (profk@umd.edu)
Date: 2025-03-10 Mon 14:38