Blog

CST 334 – Week 7: Persistence
This week we moved on to the next major part of OSTEP: persistence. The lectures covered OSTEP 36, 37, 39, and 40, from I/O devices up to a working file system.

OSTEP 36: I/O Devices

I/O devices are what make computers useful. Keyboards, mice, screens, network adapters, and hard drives are all examples of I/O devices.

There are two main types of I/O devices: block devices and character devices. A block device, like a hard drive, stores data in fixed-size chunks (blocks) with addresses, allowing random access. Character devices, like keyboards, stream data one byte at a time and are not addressable. The type of device determines how the OS and software interact with it.

The CPU talks to devices through a hardware interface, usually a set of registers. There is a status register to check if the device is busy, a command register to tell it what to do, and a data register to exchange information. The main question the lecture raised is: how does the OS use these registers without freezing while waiting?

Three strategies emerged: polling, interrupts, and DMA. Polling means the CPU keeps checking the status register in a loop until the device finishes. It is simple but wasteful because the CPU just spins. Interrupts let the OS issue a request and then do other work; when the device finishes, it signals the CPU to process the result. DMA (Direct Memory Access) lets the device move data into or out of memory without involving the CPU, which is much more efficient for large transfers.

The practical takeaway is that these mechanisms trade off simplicity for efficiency: polling is easiest to reason about, while interrupts and DMA better preserve CPU time for useful work.

OSTEP 37: Hard Drives

Hard drives have a long history in computing and will likely be around for a long time, even as SSDs continue to replace them. This lecture explained the physical mechanisms that make hard drives work.

A hard drive has one or more platters (spinning metal disks), and each platter has two surfaces. Moving arms with heads read or write data as the platter spins. Accessing data on a hard drive requires three steps: seek time (moving the arm to the right track), rotational delay (waiting for the desired sector to spin under the head), and transfer time (actually reading or writing the data).

There are two ways to address a disk block: CHS and LBA. CHS is an older format, which specifies the cylinder (track), head (surface), and sector. Modern drives use Logical Block Addressing (LBA), which is just a single number for each block. The analogy from lecture is that CHS is like a home address (with a number, street, and city), whereas LBA is like if every house had its own serial number.

Because seek time and rotational delay dominate the cost of a disk access, the order in which the OS services requests matters a lot. This is where disk scheduling comes in; a few scheduling algorithms were mentioned in the lecture and OSTEP.
- SSTF (Shortest Seek Time First): always pick the pending request closest to the current head position. Fast on average, but can starve requests that are far away.
- SCAN / C-SCAN (the elevator algorithm): sweep the head across the disk in one direction servicing requests along the way, then reverse. Fairer than SSTF and avoids starvation.
- SPTF (Shortest Positioning Time First): factors in both seek and rotation. Closer to optimal but requires the OS to know detailed disk geometry, so scheduling has largely moved into the drive itself.
The key idea is that the OS works with the hard drive to find the best algorithm, and modern drives increasingly use their own internal scheduling to approach the optimal result.

OSTEP 39: Files and Directories

This lecture introduced files and directories as the key abstractions for organizing data in a file system.

A file is simply a linear array of bytes that is associated with some low-level ID, like an inode number. The inode stores metadata about the file like permissions, size, and timestamps.

A directory is a special kind of file that contains a structured list of (name, inode) pairs. Listing a directory just reads the directory file and looks up the associated inodes.

A hard link is just another name that points to the same inode. This makes it so that a single file can appear to be multiple files. Removing one name does not delete the data as long as another name still points to the inode.

A symbolic link (symlink) points to a path rather than an inode. That difference matters: symlinks can cross file system boundaries, can point to directories, and can dangle (point to something that no longer exists). Hard links can do none of those things because they are tightly bound to the inode they reference on the same file system.

A partition is a raw slice of a physical disk, while a volume is a logical unit where a file system has been created. A logical volume can consist of a single partition or span multiple disks.

A mount attaches a file system into the operating system’s main directory tree, so its contents appear under some existing path.

OSTEP 40: File System Implementation

This chapter was split into two lectures covering the two main parts of a file system: the data structures used to store files, and the access methods used to interact with stored data.

Data structures

We walked through an example file system with 64 blocks to make the idea concrete. The superblock is the first block in the file system and holds some critical information: where inodes are stored, where data blocks start, how many inodes exist, and so on. The file system looks at the superblock first to understand its own layout.

A bitmap tracks which blocks are free and which are in use — one bit per block. To allocate a new block, the file system scans the bitmap, finds a free block, marks it as used, and gives it to the file. A similar bitmap tracks free inodes.

One way to understand a file system is that it is just a data structure laid out on disk. The same kinds of pointers and structures you might see in a memory-based data structure are written to disk blocks and linked together by block numbers instead of memory addresses. The OS reads the superblock to orient itself, then uses inode numbers and block pointers to navigate.

Access methods

The second half of the chapter walked through what actually happens on common syscalls. Three operations made the cost model clear:
- open(“/foo/bar”) is surprisingly expensive. The OS has to traverse the path one component at a time: read the root inode, read the root directory’s data block, find “foo”, read the foo inode, read its data block, find “bar”, read the bar inode. Every component potentially costs an inode read and a data block read.
- read() is cheaper on an already-open file. The OS already has the inode in memory, so it consults the inode’s block pointers, reads the right data block, and updates the file’s offset.
- write() can be the most expensive. If the write extends the file, the OS must consult the data bitmap to find a free block, update the bitmap, update the inode (new size, new block pointer), and then write the actual data: multiple I/Os for a single logical write.
What was most interesting is that every step in path traversal follows the same pattern: get an inode, read its data block, search for the next entry. There is no special logic for subdirectories because directories are just files. That made the whole traversal algorithm feel intuitive.

Reflection

This is the last section of the course, but there is still more to OSTEP that I may revisit to round out the picture. What changed for me is less the volume of facts and more how I think about software and its relationship to hardware.

Before this course, things like “the file system” or “memory” were opaque boxes, but now I’m able to think of them as data structures with access methods. A path lookup is not magic; it is a sequence of inode and block reads. A context switch is not magic; it is a saved set of registers and a scheduling decision.

The core problems that operating systems solve, like virtualization (abstraction), concurrency, and persistence, show up in many technical contexts. Knowing how things work at the OS level will help me to build a better intuitive understanding of these recurring problems, which should serve me well as I continue to develop my technical skillset.
April 21, 2026
CST 334 – Week 6: Synchronization
This week we continued learning about concurrency, focusing on synchronization. We covered OSTEP 30, 31, and 32, as well as a technique for writing concurrent code called the Anderson/Dahlin method.

OSTEP 30: Condition Variables

This lecture started with an example of why locks alone are sometimes not enough. Locks either let a thread proceed or block it from executing, but have no way to communicate anything beyond “this resource is in use.” If a thread needs to be notified when some condition becomes true, another mechanism is needed.

Condition variables allow signaling between threads. A thread can go to sleep, and another thread can send a signal for it to wake up, or broadcast to wake up multiple threads. The lecture used a temperature monitor as an example: with a lock, another thread monitoring a temperature variable would have to repeatedly poll for changes, wasting CPU time. With a condition variable, it can go to sleep and wake up when the temperature variable changes.

Bounded Buffer Problem

This was a hands-on lecture walking through a thread-safe queue as an example of a bounded buffer. The bounded buffer problem involves a shared, limited storage area (a buffer), where multiple producer threads add data, while consumer threads remove data from it. There are a few problems that can arise with bounded buffers:
1. Buffer overflow: a producer thread can overflow the buffer by adding data when it is full
2. Buffer underflow: a consumer may try to remove data while the buffer is empty
3. Race conditions: threads may attempt to access the buffer at the same time, leading to indeterminate behavior
The demo showed how race conditions can lead to indeterminate behavior. When threads adding to the queue raced against threads trying to read from it, the output could include -1 (returned when the queue is full or empty) in different orders across runs. To prevent race conditions, threads can continually poll the queue to make sure that it isn’t full or empty, but this is inefficient, since the spinning thread will waste CPU time.

To make the queue more efficient, the condition variables is_not_empty and is_not_full were added, signaling to waiting threads when the queue is ready, so that they don’t need to spin and waste CPU.

Anderson/Dahlin Method

This lecture presented a repeatable process for turning a regular class into a concurrent, thread-safe class. The Anderson/Dahlin method is a five-step procedure that uses only locks and condition variables.

First, design the class as normal (without any concurrency), then add concurrency by following these steps:
1. Add a single lock
2. Use the lock around critical sections of the code
3. Add condition variables for situations where a thread will need to wait
4. Add signal and broadcast calls to coordinate threads
5. Add wait conditions within loops, so threads will sleep when the condition variable is false, then check it again when it wakes up
One thing to keep in mind is that state variable names should make the condition obvious. For example, can_push and can_pop make it clear when the push and pop methods can be used.

OSTEP 31: Semaphores

Semaphores are a synchronization primitive that combines the behavior of a lock and a condition variable. It’s based on an integer value, which supports two operations after initialization: increment (signal) and decrement (wait). The integer value of the semaphore represents how many “slots” are available for threads to run.

The initialization value of the semaphore matters. Setting a semaphore to 1 makes it behave like a mutex. Setting it to 0 makes the first wait block immediately, which is useful for ordering.

Threads can also use semaphores to “rendezvous”, so that both threads will wait until the other thread has reached a certain point before proceeding. This can be accomplished by initializing the semaphore to 0, then having the thread signal with one semaphore, while it waits on the semaphore that the other thread will use to signal.

Synchronization Barriers

This lecture introduced the concept of a synchronization barrier, which is like the rendezvous pattern mentioned above, but extended to many threads.

The example in the lecture used two semaphores and a shared counter to create a synchronization barrier:
- A mutex semaphore (initialized to 1) ensures that updates to the counter are atomic
- A barrier semaphore (initialized to 0) blocks all threads until the last one arrives
- A shared count varaible tracks how many threads have checked in
Each thread locks the mutex, increments count, then unlocks it so the next thread can update the counter. After that, the thread waits at the barrier. When the count is equal to the number of threads, that means it’s the last thread, so it uses the barrier semaphore to signal to the other threads that they can all proceed.

The synchronization barrier example made it clear how semaphores can act as both a lock and a condition variable. In the same piece of code, one semaphore acts as a mutex and the other acts as a gate that signals when it is safe to move forward.

Reflection

After reviewing the lectures and readings from this week, I feel like I have a better understanding of concurrency and the kinds of bugs that can appear in concurrent programs. The code examples were especially helpful, since they made the problems more concrete and showed how synchronization tools like condition variables and semaphores can solve common issues. The Anderson/Dahlin method was also helpful, since it provides a structured process for designing thread-safe code.

Looking ahead, next week focuses on persistence, which should pair well with our group project on the Google File System. For the project, I’ve been learning about the challenges involved in designing a distributed file system, so it will be helpful to compare it to the challenges that are involved with a traditional file system.
April 14, 2026
CST 334 – Week 5: Concurrency
This week we switched gears from memory management to concurrency, covering OSTEP chapters 26, 27, 28, and 29.

OSTEP 26: Concurrency and Threads

Concurrency is an important topic for this course because the OS must handle simultaneous activities that can overlap or interfere with each other. For example, two separate processes may attempt to write to the same file at the same time, and the OS needs to be able to manage that somehow. Since concurrency is an issue that the OS has dealt with for a long time, operating systems have developed many techniques to manage concurrency.

One of those techniques is to use threads. A thread is similar to a separate instance of a process, except a multi-threaded program runs multiple threads inside the same process, giving the process multiple points of execution. Threads in a process share the same virtual address space with other threads, allowing threads to interact with each other and requiring less overhead than launching a new process. Each thread has its own execution context, which includes CPU registers and a call stack.

Threads introduce some benefits and drawbacks. One of the benefits is parallelism, which splits work across CPUs to improve performance. Another benefit is responsiveness, since separate threads can handle different tasks, which reduces blocking the program’s execution. Threads are useful when a process needs to handle multiple events simultaneously, while still allowing shared access to the same data.

One of the drawbacks of threads is that they can make a program’s execution indeterminate, since threads will not always execute in the same order. Even in a simple program, execution order can change across runs. A related issue introduced by threads is a race condition, which is a situation where multiple threads “race” to modify shared data. Code where shared data or resources are accessed is called a critical section, where race conditions can be an issue. Since the execution order of threads can vary, race conditions cause unpredictable behavior in programs.

OSTEP 27: Thread API

This chapter introduced the thread API in C. The pthread (POSIX thread) library offers several functions to manage concurrency with threads. A few of the important functions are:
- pthread_create creates a new thread
- pthread_join waits (blocking) until the specified thread completes, optionally capturing a pointer to its return value
- pthread_mutex_lock protects a critical section with a lock, blocking other threads from accessing it
- pthread_mutex_unlock releases control of the mutex so that other threads will be able to access it
- pthread_cond_wait puts a thread to sleep until it receives a signal
- pthread_cond_signal sends the signal to wake a sleeping thread
The main synchronization primitives covered in this course are locks and condition variables. Locks provide safety by preventing other threads from accessing critical sections of code, while condition variables enable coordination between threads through signaling.

OSTEP 28: Locks

Locks provide a mechanism to solve the race condition problem introduced by multiple threads, by allowing a thread to block other threads from accessing a shared resource (critical section) while the lock is held.

There are two main types of locks: spinning and blocking.
- Spinning locks cause a thread to repeatedly check if a lock is available, which is fast but wastes CPU
- Blocking locks cause a thread to go to sleep until the lock is released, which doesn’t waste CPU like a spinning lock, but requires more overhead to handle the context switch
The main goals of lock implementations are correctness, fairness, and performance.
- Correctness: does the lock actually provide mutual exclusion?
- Fairness: is there an orderly process for fulfilling lock requests?
- Performance: does the lock minimize overhead and maximize throughput?
Locks are designed to balance these goals using a combination of hardware support to ensure correctness, queues to provide fairness, and hybrid spinning/blocking strategies to maximize performance.

OSTEP 29: Lock-Based Concurrent Data Structures

The key idea with lock-based data structures is to make the data structure thread safe, so that multiple threads can use it and always produce the correct results. In general, this can be accomplished by integrating locks into the data structure around critical sections of the code, where shared data is modified.

When designing a lock-based data structure, a simple coarse-grained approach is usually a good starting point, where a single “big lock” can be placed around the entire data structure, which should produce a correct result. From there, a more fine-grained approach can be used, but more concurrency adds additional complexity and does not always improve performance.

Reflection

Concurrency is a practical topic that comes up a lot in programming, for example, in distributed systems, databases, and even a simple web server. It’s been helpful to learn more about it, since it’s an important topic to understand. We will continue to learn about concurrency in the next section of the course, and it looks like we’ll be working with threads more in the next programming assignment, which should be helpful.
April 7, 2026
CST 334 – Week 4: Memory Virtualization (cont’d)
This week we continued to learn about how the OS handles virtual memory. The lectures and readings got into the details of how the OS works with hardware to optimize performance and solve problems that arise when dealing with virtual memory, while the assignments helped with learning how the mechanisms involved work.

Topics covered this week

This week’s lectures covered OSTEP chapters 17, 19, 20, 21, and 22.

OSTEP 17: Free-Space Management

This chapter/lecture explained how the OS keeps track of free (unused) memory. The main design goals include the following:
- Correctness: in-use memory should never be corrupted, and free memory should actually be unused; this is the most important goal.
- Speed: memory allocation should be as fast as possible.
- Avoid memory overhead: minimize space used for metadata to maximize space for actual memory allocations.
- Satisfy as Many Requests as Possible: memory allocation should not fail and should not be a bottleneck.
To achieve these goals, the OS can choose from different allocation policies, like first fit, best fit, and worst fit.
- Best fit: finds the smallest chunk of free memory that fits.
- Worst fit: finds the largest chunk of free memory.
- First fit: finds the first chunk of free memory that will fit.
We also discussed two mechanisms, splitting and coalescing, that work together to reduce wasted memory space.
- Splitting breaks large blocks of memory into smaller ones when allocating, so that the free space isn’t wasted.
- Coalescing combines smaller blocks of adjacent free memory into larger blocks, so that it becomes contiguous free space.
Together, these mechanisms keep memory efficient and flexible.

OSTEP 19: Translation Lookaside Buffers (TLBs)

This chapter/lecture explained the TLB and why we need it. Paging involves an extra memory lookup, which makes it slow compared to accessing the memory directly. The TLB solves this problem by providing a hardware cache that returns cache hits in a single clock cycle.

The TLB uses the locality of memory access patterns to determine what is cached:
- Temporal locality: caching memory that was used recently.
- Spatial locality: caching memory addresses that are near recently used ones, since they are more likely to be accessed soon.
We also learned how the TLB performs an address translation:
1. Split virtual address (VA) into VPN and offset.
2. Check the TLB cache for the VPN → PFN mapping.
  - If there is a cache hit, use the PFN from the cache.
  - If there is a cache miss, get the PFN from the page table and cache it for the next translation.
3. Check if the memory access is valid.
4. Use PFN plus offset to form the physical address (PA).
OSTEP 20: Multi-Level Paging

This chapter/lecture introduced a problem with paging and its solution. Each process needs its own page table, and much of the address space for each process is not used, which leads to a lot of wasted memory. The solution is multi-level paging, which only allocates the memory that is necessary and stores an index of page table chunks in the page directory. Both the page directory index and page table index are included in the VPN.

This improves memory utilization, but requires an additional step to perform address translations. The TLB caches the PFN to speed up memory access, so only TLB cache misses will require the additional PD lookup step.

OSTEP 21-22: Swapping

This lecture covered chapters 21 & 22 in OSTEP, which describe the mechanisms and policies involved with swapping memory. Storage space in RAM is limited, so the OS must store some memory pages in secondary storage, which has a lot more space to work with.

Swapping introduces the present bit, which indicates whether a page is in memory, or if the OS must fetch the page from disk. The mechanics of swapping look like this:
1. Check TLB for VPN.
  - If VPN is in the TLB, return the PFN.
2. Otherwise, lookup the VPN in the page table.
3. Check if the present bit is set in the PTE.
  - If it is set, cache the PFN in the TLB and return the PFN.
  - If it is not set, raise a page fault.
The OS handles the page fault by fetching the page from disk and storing it in memory.

The OS needs a policy to determine which pages get swapped out when memory is full. There are a few different policies for this:
1. FIFO: evict the oldest page.
2. LRU: evict the least recently used page.
3. Random: select a random page to evict.
4. Optimal (Belady): evict the page that is used furthest in the future.
The Optimal (Belady) strategy is not achievable in practice, since we can’t know the future, but it can be helpful for comparison when benchmarking.

There were two metrics described that are useful for benchmarking:
1. Cache hit rate: the percentage of requests that result in cache hits. Calculated as (hits / total requests) where total requests can include or exclude compulsory misses, when the page is first requested.
2. Average Memory Access Time (AMAT): the average time to access a page in memory, considering cache hits and misses. Calculated as (hit rate * memory access time) + (miss rate * disk access time).
Reflection

After reviewing the course material for this week and the previous week (which also covered virtual memory), I’m feeling like I have a better handle on the topic. Looking ahead, next week we’re going to learn about concurrency, which should be an interesting shift from memory management into how the OS can do multiple things at the same time.

Concurrency in programming is something that I’m familiar with, but haven’t learned about in detail, so it will be interesting to find out how it’s handled at the OS level. I know that some programming languages support multithreading, while others do not, so it will be a good opportunity to explore this topic more.
March 31, 2026
CST 334 – Week 3: Virtual Memory Fundamentals
This week focused on virtual memory fundamentals and practical memory management in C. The lectures and assignments mostly dealt with understanding how virtual memory works at the OS and hardware level. The main idea was that the OS uses virtual memory to create the illusion that each process has its own private memory, while in reality all processes share the same physical RAM.

Topics covered this week

Some of the topics we covered this week:
- OSTEP 13: Address spaces
  - An address space is the memory view a process thinks it has. The process uses virtual addresses, while the hardware and OS translate those into physical addresses in RAM. This lets many processes run safely at once without each one needing to know where it really sits in physical memory.
  - A typical address space has three main regions
    
    Program Code: the compiled instructions of the program. This memory region is static (it doesn’t grow at runtime), so it sits at the top of the address space and stays fixed.
    
    Heap: memory for dynamically-allocated data. The heap grows upward (toward higher addresses) as the program requests more memory.
    
    Stack: used for function calls, local variables, and return values. The stack grows downward (toward lower addresses) with each function call and shrinks as functions return.
  - Memory management goals
    
    Transparency: programs should be able to act as if they have their own memory.
    
    Efficiency: translation should be fast and not waste too much memory.
    
    Protection: one process should not read/write another process’s memory.
- OSTEP 14: C Memory API
  - Stack vs. heap allocation
    
    Stack memory is managed automatically by the compiler, e.g., when declaring a local variable inside a function. Once the function returns, the stack frame is gone, and any pointer to it becomes invalid.
    
    Heap memory is for data that needs to outlive a single function call. It is managed manually with malloc and released with free.
  - Common memory bugs
    
    Forgetting to allocate memory: if you declare a pointer but never call malloc, then use the pointer, it will point to some random memory address.
    
    Allocating too little memory: not allocating enough memory leads to buffer overflow, which results in undefined behavior.
    
    Not initializing allocated memory: allocated memory should point to the intended data or NULL.
    
    Memory leaks: when allocated memory isn’t explicitly freed, it results in a memory leak, which will degrade performance, especially for long-running processes.
  - Garbage Collection
    
    Many programming languages (but not C) perform automatic garbage collection by freeing unreachable memory objects.
- OSTEP 15: Address Translation
  - Base-and-bounds. The memory management unit (MMU) has two special registers: a base register and a bounds register.
    
    Base register: where memory for a process starts
    
    Bounds register: how much memory is allocated for a process
    
    Address translation
    
    physical address = virtual address + base
- OSTEP 16: Segmentation
  - To avoid wasting memory by requiring large contiguous memory blocks, the OS will separate and store smaller parts of each process’s virtual memory into physical memory.
  - Segmentation treats the code, heap, and stack as separate parts called segments, each with its own base and bounds, which results in better memory utilization.
  - Segmentation introduces a bin packing problem: how should irregular pieces of virtual memory fit together in an optimal way?
- OSTEP 18: Paging
  - Paging sidesteps the bin packing problem by storing virtual memory into fixed-sized chunks instead of irregularly-sized segments.
  - Virtual memory is divided into pages, physical memory into page frames, and a per-process page table maps virtual page numbers (VPNs) to physical frame numbers (PFNs), allowing flexible placement in physical memory.
  - This avoids fragmentation problems, but introduces extra translation overhead.
Reflection

The biggest challenge this week was just dealing with the sheer volume of information. There are a lot of things that operating systems and hardware do to handle memory, and it’s a lot of dense technical information to unpack.

I think the most valuable takeaway this week was gaining more exposure to low-level systems programming and design. It’s been interesting to learn about the design challenges and technical solutions that are involved in making virtual memory work, and it looks like we will continue studying this topic in the next module, so we’ll have some more time to process it all.
March 24, 2026
CST 334 – Week 2: Processes and CPU Scheduling
This week we got into the nitty-gritty details of process management and CPU scheduling. The assignments this week mostly dealt with how the OS decides which process the CPU should run. The lecture material and readings fit together nicely, with each lecture corresponding to a chapter in OSTEP.

Topics covered this week

Some of the topics we covered this week were:
- OSTEP Chapter 4: Processes
  - Processes as the main CPU virtualization abstraction
  - Program versus process
    
    A program is just some data on a disk, which contains instructions on how to run the program, while a process is a program that is actually running. A process has an execution state and data in memory, and a single program can have multiple processes running at any given time
  - Machine state, including memory, registers, the program counter, and stack pointer
  - Process creation and process states
    
    The OS creates a process by reading the program’s code from disk and loading it into memory, allocating memory for the run-time stack and the heap, then allows the program execute by transferring control to the program’s main entrypoint
  - OS process-management data structures, like the task_struct in Linux, which contains information about the execution state of the process
  - Multiprogramming and context switching, which allow multiple processes to appear to run at the same time, keeping the system responsive for users
  - The difference between a mechanism and a policy
    
    A mechanism is how the OS performs some task, like context switching
    
    A policy describes which processes the mechanism should be applied to
- OSTEP Chapter 5: The C process API
  - The process tree represents processes as parent and child nodes, so that a single parent process can have many child processes.
  - OS process creation with the C Process API
    
    The C Process API provides functions to manage processes
    
    fork() clones the calling process, creating a new process that continues execution from the same point as the calling process
    
    wait() lets a parent pause until a child process finishes
    
    exec() replaces the calling process with a new process
- OSTEP Chapter 6: Limited direct execution
  - How does the OS control processes if the process controls the CPU while it is running?
    
    Limited Direct Execution allows processes to run directly on hardware, while allowing the OS to maintain control
  - Kernel mode vs. user mode
    
    Kernel mode allows all instructions to run, allowing direct access to hardware and memory
    
    User mode only allows non-privileged instructions to run, which is how programs normally run
  - Hardware interrupts allow devices to run instructions on the CPU through an interrupt handler
    
    If a program needs to run a privileged instruction, it uses a system call, which triggers a trap into the kernel. The kernel handles the request and then returns control back to user mode
- OSTEP Chapter 7: Process Scheduling
  - Scheduling metrics
    
    Turnaround time (TAT) measures how long it takes a job to finish after arriving
    
    Response time measures how long it takes the system to start running the job after it arrives.
  - Scheduling policies
    
    First-In First-Out (FIFO) or First-Come First-Served (FCFS)
    
    Processes are run sequentially, in the order that they arrive
    
    Shortest Job First (SJF)
    
    The processes that will take the least time to complete will run first
    
    Shortest Time to Completion First (STCF)
    
    Processes that will complete quickly can preempt longer-running processes
    
    Round Robin scheduling
    
    Each process runs for a short time slice, so that processes appear to be running simultaneously
- OSTEP Chapter 8: Multi-Level Feedback Queue (MLFQ)
  - The OS does not know when jobs will arrive or how long they will take, so it uses a multi-level feedback queue (MLFQ) to dynamically balance the priority of processes, optimizing responsiveness and turnaround time
    
    New jobs start at high priority, and the scheduler adjusts their priority based on how they behave
    
    Jobs that use a lot of CPU gradually move down, while jobs that frequently give up the CPU, such as interactive programs, can stay higher
  - Problems
    
    Starvation: lower-priority jobs will never run if there are always higher-priority jobs
    
    Gaming the system: processes may be able to maintain high priority through certain tricks, like requesting I/O before the end of its time slice
  - Solutions
    
    Boosting: intermittently move every job to highest-priority, so that it is guaranteed to run eventually
    
    CPU allotment: give a process a certain amount of CPU time before moving it to a lower priority
Reflection

The most challenging concept this week was calculating scheduling metrics, like turnaround time. This is especially tricky to calculate when dealing with time-sliced scheduling and different job arrival times, since the jobs will all consume part of the CPU’s time, and there are a lot of different factors to keep track of.

For the lab this week, I made Gantt charts with matplotlib to visualize scheduling metrics, which involved creating a scheduling simulation algorithm in Python. This helped a lot with understanding the decisions that the OS makes when running processes.

It’s been interesting to learn about how a program actually runs, and I now have a better understanding of things I’ve used before but hadn’t thought about much. For example, I hadn’t really thought about what happens when entering a command into a command prompt, but now I have a better idea of how a shell can handle user input with the C Process API.
March 16, 2026
CST 334 – Week 1: Introduction to Operating Systems
This week we got started with the course and covered a lot of ground: computer architecture, C programming, the history of Unix and Linux, and the core abstractions that operating systems provide. The reading from OSTEP Chapter 2 paired well with the lectures, and it looks like it should be a helpful resource for the rest of the course.

Topics covered this week

Some of the topics we covered were:
- Computer architecture
  - Basic components of a computer (Von Neumann model)
    
    The CPU fetches instructions from memory and then executes them.
    
    The system’s memory (RAM or volatile storage) stores instructions and the data that the instructions will operate on, while non-volatile storage, like a hard drive, stores persistent data.
    
    I/O devices act as an interface between the system and users, or other systems over the network.
  - Buses connect the different components of the system. Faster buses are closer to the CPU.
  - The storage hierarchy refers to different types of storage used by a computer. The fastest storage, like CPU registers, store the least amount of data, while slower storage, like hard drives, can store more data.
  - The operating system as a resource manager
    
    The OS allocates CPU time, memory, and disk space fairly and efficiently across all running processes.
  - Physical vs. virtual memory
    
    Physical memory is the actual RAM in the computer, while virtual memory is an operating system abstraction that gives each program its own address space and maps those addresses to physical memory.
  - The memory layout of a running program describes how the system stores different kinds of data needed to run a program. For example, the program code and heap are stored at the beginning of the memory block, while the call stack is stored at the end.
- Function calls and the stack
  - The system stores function calls in a LIFO memory structure called the call stack. In an x86 architecture, the esp register points to the top of the stack, and the ebp register points to the stack frame, which contains data related to the function call, e.g., arguments, local variables, and the return address.
- The C programming language is a simple, low-level language created by Dennis Ritchie at Bell Labs in 1973. Many operating systems, like Linux, are implemented in C, and many programming languages, like JavaScript and Go, use the familiar C syntax.
  - Data types
    
    C has primitive data types, like char, int, long, float, and double, which are the basis for derived types like pointers, arrays, and structs.
  - Pointers point to locations in memory, and are useful for passing a reference to some data that a function should modify.
  - Structs are structured data types, similar to objects, but without object-oriented behavior, like inheritance and methods. Structs can be used for creating custom data types, like a String.
  - Memory allocation in C is handled manually by functions like malloc, calloc, and free.
- Linux and the shell
  - Unix and Linux history
    
    Unix was created by Ken Thompson and Dennis Ritchie at Bell Labs in 1969.
    
    Linux was created by Linus Torvalds in 1991 as a clone of Unix. Many programs from the GNU software project were integrated with the Linux kernel to create the Linux operating system.
  - Shell scripting with bash
    
    The shell describes the command-line interface between the user and the OS.
    
    The shell reads commands entered by the user, interprets them, and then asks the operating system (usually through system calls) to execute the requested actions.
    
    The Bourne Again Shell (bash) is a commonly used shell.
Reflection

After reviewing these topics, the area that I’m least familiar with is low-level memory management. I tend to use higher-level programming languages, like Python and JavaScript, where memory management is handled automatically. We explored memory management a bit in this week’s programming assignment, so I’m getting some more experience with it.

Structs and pointers are a bit easier for me to understand, since these concepts are often used in higher-level programming languages. I’m familiar with the difference between passing by value vs. passing by reference and object-oriented programming, so I’ve been able to translate that knowledge to C programming.

I’m also pretty familiar with Linux, and recently earned my Linux+ certification. It looks like next week’s lesson will cover process management. I’m already familiar with Linux concepts like systemd, process kill signals, and controlling background tasks, so it will be interesting to see how that is handled at the hardware level.
March 9, 2026
CST 338: Week 7/8

Looking back on the HW1 assignment, I would probably approach this assignment in a similar way. Now that I have a better understanding of the game’s mechanics, however, I would implement some things differently. Although it wasn’t part of the assignment, creating a game with a Java Swing UI would be a fun challenge. Hangman is a visual game with simple graphical components, so it shouldn’t be too difficult to make a UI for it.

The thing I enjoyed most about this course was learning Kotlin. It’s a modern, expressive, multi-paradigm programming language, which allows for more flexibility than Java. Kotlin supports most Java language features, with a few exceptions, making it possible to follow both object-oriented and functional programming paradigms when designing an application.

I also learned a lot about Android development. Learning how to use Jetpack Compose has been pretty intuitive for me, since I’m already familiar with React. Before this course, I hadn’t done much native programming for mobile devices, having mostly focused on web development, so it was great to learn more about this area of software development. Maybe I’ll learn Swift next, since I have an iPhone, or spend some more time learning React Native to target Android and iOS.

Overall, this course has been helpful for learning more about software development and design patterns. There are a ton of resources for learning more, like Node.js Design Patterns and Patterns.dev. I’ll make sure to spend some more time continuing to learn about this topic.

December 15, 2025
CST 338: Week 5

I reviewed Glenn’s code for the Markov assignment. Glenn’s strategy was to first implement methods that didn’t call any other methods, then implement the methods that called those methods, which makes sense, since you can make sure the methods without any dependencies work before implementing the methods that call other methods.

My strategy was to first scaffold out the code according to the documentation, so that everything was syntactically correct. From there, I implemented the logic as it was described in the documentation, focusing on getting the tests to pass. After that, I reviewed the documentation again and checked the output to make sure it made sense.

I think both of our strategies are reasonable for this assignment. Glenn’s code follows the Google Java Style Guide, and so does my code.

I also worked on developing an Android app this week using Kotlin, which is Google’s recommended language for Android development, and I’ve been liking it so far. I’m finding Kotlin to be more expressive than Java, similar to TypeScript, which is my go-to programming language. Kotlin also works with Java, so both languages can work together in the same project.

Kotlin appears to be gaining more traction in recent years, so it could be a good language to learn.

December 2, 2025
CST 338: Week 4

Project 1 Code Review

I reviewed code from Glenn and Jack for Project 1. One of the first things I noticed from reviewing my teammates’ code was that I overlooked implementing a method in my own code. All of the tests passed on my code, and it seemed like everything was working correctly, but I guess that one slipped by me. It’s helpful to work with a team for this reason, since one person will often see what another person might miss. Other than that, we all completed the assignment and wrote code in a clear and understandable way.

Another thing I did differently was to handle the resistance between between ElementalTypes with a matrix, since the values were given as a table, so a matrix made sense. I also simplified the constructor and setPhrase function a bit to avoid leaking this out of the constructor.

My general strategy for approaching this assignment was to first read the documentation, then scaffold out out the classes and resolve any syntax errors so that the code would compile. From there, I worked on implementing the logic as it was described in the documentation, focusing on getting the tests to pass. After all the tests were passing, I reviewed the output from the tests to make sure it made sense, then cleaned up the code a bit and added some javadocs.

My teammates described their strategies as follows:

I just did what make sense to me at the moment and skip that I don’t get quickly. If I ran into a method that uses another method, I stop and try to do the other method first. Basically I’m solving methods that doesn’t need other method because it makes more sense to me workflow wise. I also try to debug and step into code anytime I can. That all I can think of.

– Glenn

When solving this assignment, I tried my best to follow the documentation and understand why certain tests didn’t pass. I would try different approaches until I figured out what worked and didn’t work.

– Jack

I think my strategy worked for the most part, but in the future, I’ll make sure to review the documentation again, even after the tests are passing, to catch anything that may have been overlooked. I used an automated Google Java Style Guide formatter, and my teammates’ code also appears to follow the Google Java Style Guide.

The most challenging part of this project was probably understanding and implementing the Monster battle mechanics, but it was also the most interesting. It might help to create a state diagram to explain how everything should work together.

Overall, I’m proud of completing this project by implementing code that is clear and concise, while learning more about object-oriented software design in the process. I didn’t do anything to celebrate for this project, but I will be sure to celebrate after the next project, especially since it’s the final project before the holiday break. Maybe I’ll buy a new bike for Christmas, or use the time to work on some personal projects.

November 24, 2025