# Linuxgraphy by Strabo 'define expected-reading 38 min 'define created 19 January 2023 'define edited 04 March 2023 [$pagenav] ### Custom assets *[Licensing information in the last slide, at the bottom of the page]* .bulleted - [url ./img/unix-layers.png UNIX layers image] - [url ./img/unix-dir-tree.png UNIX directory tree] - [url ./mll-set.sh mll-set.sh] ### Presented at :defs ; [$bg] Unofficial [url https://fmi.uni-sofia.bg/en FMI] Computer Science Discord server   *[04.03.2023 || 14:50 - 17:20]* : In front of a tiny audience of other uni students. Exact revision [url https://gitlab.com/Syndamia/syndamiadotcom/-/blob/6feed223ff48fe00cf12fcd63f3f56179c298cae/src/talks/linuxgraphy-by-strabo/index.md here]. ; [$bg] Unofficial [url https://fmi.uni-sofia.bg/en FMI] Computer Science Discord server   *[22.01.2023 || 18:00 - 21:00]* : In front of a tiny audience of other uni students. Exact revision [url https://gitlab.com/Syndamia/syndamiadotcom/-/blob/6eb7a8fff01c6814253a0e738efb53734d297be9/src/talks/linuxgraphy-by-strabo/index.md here]. [$presentation-controller] :title_slide # Linuxgraphy by Strabo *[Made by [url https://syndamia.com Syndamia]]* [$br4] :sqrow [$br1] :font22 2000 years ago [url https://en.wikipedia.org/wiki/Strabo Strabo] published [url https://en.wikipedia.org/wiki/Geography Geography], in which he laid out the world as it was known to the Romans and Greeks. [$br1] Today, I'll lay out Linux as it is known to me. [image https://upload.wikimedia.org/wikipedia/commons/thumb/e/ed/Strabo.jpg/199px-Strabo.jpg] [$br1] :slide35 # Contents :sqrow :column :numbered 1. Historical context 2. Structure of UNIX-likes .numbered 1. Shell 2. UNIX-style file system 2. Kernel [$br2] 3. Linux from the inside .numbered 1. Shells 2. Files and directories 3. Common configuration files 4. System management 4. Software licensing :column :numbered 5. What makes up a useful OS .numbered 1. Bootloader 2. Init system 3. Package manager 4. Desktop manager 6. War on distros [$br4] 7. Demos - virtual machine and setup from zero :slide # 1. Historical context [$br1] #### 1945 - 1955 .p You directly loaded instructions into memory and let it execute your "code" (processor instructions). Nothing else ran on the machine. .p This process was labor intensive: a qualified operator loaded your program, dumped the memory contents, remove any external media, reset the machine and load the next job. #### 1956 - 1959 .p Computers became faster, but a lot of time was spend on managing jobs. .p [url https://en.wikipedia.org/wiki/Resident_monitor resident monitors]: very small programs, which always *[resided]* in memory, and *[monitored]* what the state of the current job was.\n Jobs were loaded in series ([url https://en.wikipedia.org/wiki/Batch_processing batches]). When the current job finished, memory would be dumped and the next one would automatically be loaded and started. :slide #### 1960s [$br1] .p Peripherals (tapes, punch cards, ...) were extremely slow.\n [url https://en.wikipedia.org/wiki/Computer_multitasking#Multiprogramming Multiprogramming]: the current job is waiting for a peripheral, another job would be started. .p [url https://en.wikipedia.org/wiki/Sabre_%28travel_reservation_system%29 Travel reservation system] by American Airlines: where travel agents would search, price and book services. A computer system now had to support: .bulleted - User accounts with concurrent access - Fast response time and ease of use - Storage organization, also following the rise of disks - Data communication links (often phone lines) [$br1] .p More and more business started using computers, thanks to [url https://en.wikipedia.org/wiki/Minicomputer minicomputers], so demand for OS software increased. :slide #### 1964 - 1969 .p IBM's [url https://en.wikipedia.org/wiki/IBM_System/360 System/360] line of computers, each with expansion capabilities, backwards compatibility, all under one instruction set and operating system. .p [url https://en.wikipedia.org/wiki/Multics Multics], an influential operating system that was designed for a General Electric mainframe. Some of it's novel ideas include, but are not limited to: :bulleted - [url https://en.wikipedia.org/wiki/Time-sharing Time-sharing]: sharing resources of a singular machine among many users (via [url https://en.wikipedia.org/wiki/Multiprogramming multiprogramming] and [url https://en.wikipedia.org/wiki/Computer_multitasking multi-tasking]) - [url https://en.wikipedia.org/wiki/Single-level_store Single level store] (aka [url https://en.wikipedia.org/wiki/Virtual_memory virtual memory]): Processes would write into contiguous memory, but in reality it is scattered and managed by the operating system. - [url https://en.wikipedia.org/wiki/Protection_ring Protection rings]: Allowing different levels of access to different resources (programs). In general, this also often requires a [url https://en.wikipedia.org/wiki/Protected_mode CPU protected mode]. - [url https://en.wikipedia.org/wiki/Directory_%28computing%29 Hierarchical file system]: Arbitrary directories with subdirectories, rather than just having files next to each other or having directories which could only contain files. - [url https://en.wikipedia.org/wiki/Shell_%28computing%29 Command processor] (that is separated from the OS): More on that later :slide #### 1969: Death of Multics and birth of UNIX .p Ultimately Multics grew too large, becoming unusable and unmaintainable.\n In the beginning, programming was done mostly by [url https://en.wikipedia.org/wiki/Bell_Labs Bell Labs], with [url https://en.wikipedia.org/wiki/Ken_Thompson Ken Thompson] being one of the developers. [$br1] :sqrow .font18 [image https://upload.wikimedia.org/wikipedia/commons/f/f8/Ken-Thompson-2019.png]\n *[Ken Thompson image by National Inventors Hall of Fame]* [image https://upload.wikimedia.org/wikipedia/commons/1/1b/Ken_Thompson_and_Dennis_Ritchie--1973.jpg ]\n *[Ken Thompson and Dennis Ritchie in 1973]* :column .p Thompson still had some desire to work on operating systems, after Bell Labs pulled out. The tools he made while rewriting [url https://en.wikipedia.org/wiki/Space_Travel_%28video_game%29 his own video game] on a PDP-7 lead to a whole operating system: UNIX. .p Based, expanded on and improved upon many of Multic's ideas, UNIX became the father of modern OSs. :slide # 2. Structure of UNIX-likes .p Ignoring [url https://unix.stackexchange.com/questions/14368/difference-between-posix-single-unix-specification-and-open-group-base-specifi/14415 standards] and [url https://unix.stackexchange.com/questions/87686/what-makes-an-operating-system-unix-like specifications], in my opinion, from the point of a user, a UNIX-like OS has the following components: [$br2] .ordered 1. =[shell]=: A programmable replaceable command-line interpreter, with utilities for managing the whole system and support for pipelines and I/O redirection 2. =[UNIX-style file system]=: [url https://en.wikipedia.org/wiki/Unix_filesystem Filesystem] as a single rooted tree, objects (nodes) in the file system are [url https://en.wikipedia.org/wiki/Inode inodes] and an inode can be [url https://en.wikipedia.org/wiki/Unix_file_types (at least)] either a regular file, directory or devices. Permissions per users and group. 3. =[kernel]=: Program that manages all communication and operations between the hardware and software [$br2] .font18 *[For now we'll generally just look over some of the common stuff and go into detail later, while in Linux.]* :slide35 ## 2.1. shell [$br1] :sqrow :column The operating system is divided into layers, [url https://youtu.be/-FtCTW2rVFM?t=57 like an onion].\n Only the kernel has access to the hardware.\n shells and executables (binaries) have access to the kernel. .p shells provide user interfaces, [url http://en.wikipedia.org/wiki/Shell_%28computing%29 both] command-line and graphical. While on the topic of UNIX, we'll only discuss the former. .centered [image ./img/unix-layers.png] :slide ### Command-line shells .p Command-line shells operate solely with text, often only with ASCII characters. Usually connecting to a computer (to the shell) is said to be done with either a "terminal" ("console") or "teleprinter" ("teletype"). :row .p Video displays became widely available in the late 1970s, so before that access was done with a "teleprinter", a typewriter-printer combo. .width25% .font18 [image https://upload.wikimedia.org/wikipedia/commons/thumb/d/df/ASR-33_at_CHM.agr.jpg/1280px-ASR-33_at_CHM.agr.jpg]\n *[Teletype Model 33 by [url https://commons.wikimedia.org/wiki/User:ArnoldReinhold Arnold Reinhold]]* [$newrow] .p With the advent of computer displays, teleprinters were replaced with terminals, video display-keyboard combo. Thanks to the widely popular [url https://en.wikipedia.org/wiki/VT100 VT100], almost all terminals support [url https://en.wikipedia.org/wiki/ANSI_escape_code ANSI escape codes]. .font18 [image https://upload.wikimedia.org/wikipedia/commons/thumb/9/99/DEC_VT100_terminal.jpg/675px-DEC_VT100_terminal.jpg?20131106192408]\n *[DEC VT100 by [url https://www.flickr.com/people/54568729@N00 Jason Scott]]* :slide .p Today we don't usually use such specialised hardware, but the names persist with slightly different meanings. .p [url https://en.wikipedia.org/wiki/Virtual_console Virtual Terminal] (Console, tty): on some modern UNIX-likes (like Linux), the kernel/OS provides special devices, where a console is directly implemented/simulated with the connected computer and display. .bulleted - systems that don't provide it usually expose another way to connect a console (now external hardware and/or software is required), like a [url https://en.wikipedia.org/wiki/Serial_port serial] port .p [url https://en.wikipedia.org/wiki/Terminal_emulator Terminal Emulator]: a special program that emulates a virtual terminal (within another display system) .p [url https://en.wikipedia.org/wiki/Pseudoterminal Pseudoterminal] (pty): a program that sits between a terminal and a shell (or other program), and makes them both think they're directly connected .bulleted - ssh, a program to remotely connect to a device's shell, establishes a pseudoterminal. Your local terminal thinks it's connected to a local shell and vice versa with the remote shell, however the pty in between handles the true communication. :middle_slide ### Command location and parameters .p Most commands are regular programs, which exist somewhere on the system. When entering a command name, the shell searches for an executable with that name. Searching the whole drive is very inefficient, so the search is limited to the few directories, listed in a variable called "PATH". .p You can give a full or relative path, and that will be executed directly. .p Each command can be given parameters, separated by spaces, each (by convention) being either text values or options (switches). .p Options usually start with a hyphen `[-]` (like `[-c]` or `[-current-time]`) and often support abbreviations (`[-a -b -c]` to `[-abc]`). Newer conventions also allow start with two hyphens `[--]` (`[--current-time]`), though if shown as-is indicate the rest is a string. More rarely you might see `[--OPTION=VALUE]` or just `[OPTION=VALUE]`. :middle_slide ### Thompson shell .p The first UNIX shell is the [url https://en.wikipedia.org/wiki/Thompson_shell Thompson shell]. The two major features that every other shell supports are: :bulleted - I/O redirection: redirection of input and output, allowing insertion of input from a file or storage of output into a file: ``` command [args...] < filepath command [args...] > filepath - pipelines: being able to redirect the output from one command to another without limit ``` command1 [args...] | command2 [args...] | ... :slide ## 2.2. UNIX-style file system .bulleted - Rooted singular tree of directories (the root folder of which is called "root", denoted with a "/") - Each tree object (node) is an [url https://en.wikipedia.org/wiki/Inode i(ndex)node], containing type, ownership, access, internal data and any other metadata, but =[not]= name! - Directory inodes (files) contain a table with rows of names and inode numbers, mapping a file name to it's contents. Each row is called a [url https://en.wikipedia.org/wiki/Hard_link hard link]. Every directory has a hard link to itself, called ".", and a hard link to it's parent, called "..". .p [image ./img/unix-dir-tree.png ] :middle_slide ### Attributes of an inode :bulleted - [url https://en.wikipedia.org/wiki/Unix_file_types file type]: Each inode represents some sort of file (data), but a "file" can also be a directory or device, so we need to know how to handle it.\n There are 7 main types (but there can exist more, depending on OS): [$br2] .bulleted - =[regular]=: just a plain old file - =[directory]=: as explained, a file containing hard links to other files. Each directory is allowed to appear *[only once]* in a single parent. - =[symbolic link]=: points to any file (or directory). It contains the (relative) path to that object (as a simple string), so a symlink not could even be valid. Think of a C++ pointer (pseudocode). ``` symlink /bin/mprog = "/usr/local/bin/mprog"; exec(/bin/mprog); -> Executes /usr/local/bin/mprog :slide :bulleted - =[FIFO special]=: aka [url https://en.wikipedia.org/wiki/Named_pipe named pipe], a special type of regular file, where at any moment only one process can read from it and one can write to it. It takes the concept of shell pipes to inter-process communication. ``` [Process] --write-> [FIFO] x-write-- [Process] [ A ] x--read-- [ ] --read--> [ B ] - =[block special]=: a device which is randomly accessible, like a hard drive or cdrom drive. Think of a C++ array (pseudocode): ``` BlockSpecial[438] = 1 ; var = BlockSpecial[438] - =[character special]=: devices which are accessed via serial streams of input or output, like keyboards, mice, graphics cards and teletypes.\n Think of a C++ stream (pseudocode): ``` CharacterSpecial << 1 ; CharacterSpecial >> var :middle_slide :bulleted - =[sockets]=: file for inter-process communication. Compared to FIFO specials, they can be used by more than two processes, used in both directions and support [url https://en.wikipedia.org/wiki/File_descriptor file descriptors] and packets [$br1] ``` [Process A] <--read&write--> [Socket 1] [Process B] <--read&write--> [Socket 1] ... [Process A] --write--> [Socket 1] --read--> [Process B] --read--> [Process C] :slide #### Side-step into users and groups [$br3] .p A user in UNIX is essentially a small collection of data, most important of it being a =[unique ID]= (number), a =[name]= (string), a =[group id]= (number; files created by the user are in that group) and a =[password]=. .p Groups are also small collections of data, but much simpler, comprised only of a =[unique ID]= (number), a =[name]= (string) and a =[list of users]= that are "in" the group. Their main purpose is to simplify access control. .p User with ID 0 is called root, it is the "system administrator", all actions made by the system itself are done as that user. Every user has their own "home" folder (under "/home/USERNAME/", except for root, which is "/root/"), in which they store personal files, as well as user-specific configuration files. :slide :bulleted - [url https://en.wikipedia.org/wiki/File_system_permissions file permissions]: control the freedom of a user to read (navigate for directories), change and execute files\n File permissions are split into three sets: what the =[owner]= can do, what =[group]= members can do and what =[anyone else]= can do.\n Each set contains a mix of three permissions: =[r]=ead, =[w]=rite and e=[x]=ecute, and each set stores some combination of them. [$br1] .p Each is represented by a bit in the order above (`[101]` means you *[can]* read, *[cannot]* write, *[can]* execute) and the overall permissions are often shown as a three digit number or character string.\n Example: `[111101100]` will be shown as `[754]` or `[rwxr-xr--]` - link count: the count of hard links that point to the inode. A file is deleted =[only when there are no hard links left to it]=. Every directory inode has at least two - one from parent and one from itself ("."), though the latter =[isn't]= counted. Child and parent directories also =[add]= to the count. [$br1] - User ID of owner, Group ID, file size and timestamps :slide ## 2.3. Kernel .font18 [$br1] .p The kernel has the core functionality of the operating system and bridges the gap between programs and hardware. Some important subsystems include scheduling, file, device, process and memory management. .p To preserve the [url https://en.wikipedia.org/wiki/Everything_is_a_file everything is a file] methodology, devices and processes can be handled as files (inside `[/dev]` and `[/proc]`). They (most of the time, in modern kernels) aren't actual files on a hard drive, but "virtual" files, where operations on them are handled in a different manner than normal by the kernel. [$br2] .font35 Linux is =[not]= an operating system, it is a kernel!!! [$br2] .p Two of the important things a kernel does is manage =[processes]= and =[memory]=. :slide35 ### 2.3.1 Processes [$br1] .p A program is some collection of code and other data, stored in a file (or noncompiled code).\n A process is the program in motion, the program itself *[alongside]* certain system states. .p The system states include (but are not limited to): .bulleted - the processor state, including the program counter and register values - memory map, indicating what regions of memory are allocated (to the process) - [url https://en.wikipedia.org/wiki/File_descriptor file descriptors], unique identifiers for files :middle_slide35 .p On a multitasking (multiprocessing) system, which is any modern desktop system, multiple processes are always running "at once". .p In reality, constantly one process is stopped and another is ran, switching between all of them ([url https://en.wikipedia.org/wiki/Context_switch context switching]). All "waiting" processes are put in a prioritized queue ([url https://en.wikipedia.org/wiki/Run_queue run queue]). .p Stopping is either done by the process itself (more on that later), or by the system when the process has been actively running for too much time. :slide35 #### Process Control Block .p All process information is stored in a data structure, called a *[process control block]* (or process descriptor). Not to be confused with a [url https://en.wikipedia.org/wiki/Printed_circuit_board printed circuit board]!\n A PCB's data can be split into three categories: .numbered 1. =[Process identification]= - the ID of the process itself, of it's parent process, of it's owner (user), etc. 2. =[Process state]= - as described, the processor state, etc.\n On a context switch, values in registers are put into the stopped PCB and then are loaded from the running PCB. 3. =[Process control information]= - process state, memory map, privileges, etc. .p A process can be =[forked]=, meaning a new process is created, with the same underlying code (but not process states and ...). :middle_slide35 #### Process states .bulleted - =[Created]=: This is the initial state given to a new process. In this state, the process awaits a "ready" state. - =[Ready]= (Waiting): A process is put in the run queue and awaits being executed ("running" state). - =[Running]=: When the process's program is being currently executed by the CPU. The process can run in either *[kernel mode]*, having access to the kernel and user addresses, or *[user mode]*, having access only to it's own code and data. - =[Blocked]=: When a process cannot continue without external change, usually when waiting for an I/O device. - =[Terminated]=: The process has either completed execution or has been killed. The process itself is called a [url https://en.wikipedia.org/wiki/Zombie_process "zombie process"] (until it is removed). :middle_slide35 #### Exit status .p Every process, before being deleted (and after being in a Terminated state) passes a small number, called the [url https://en.wikipedia.org/wiki/Exit_status exit status] (exit code), to it's parent process. For all intents and purposes, this number is an 8bit unsigned integer, meaning it's value is between 0 and 255. .p Pretty much universally, an exit code of 0 signifies a successful termination, and anything else specifies some sort of error code. Different programs specify the meaning of an exit code in different ways, some assign each number to a specific error, other use the bits representing the number as flags. :slide35 ### 2.3.2 Memory management [$br2] .p In ye olde days, you would save a bit of memory for the operating system and give all of the rest to the current process ([url https://en.wiktionary.org/wiki/monoprogramming monoprogramming]), which we wait until completion. .p This doesn't allow for context switching (disks are slow) and a blocked state forces the CPU to wait. .p Some bad ideas include: depending on the code of all processes to not use the same addresses, use only [url https://en.wikipedia.org/wiki/Position-independent_code relative addresses] (`[-fPIC]` option in `[gcc]`) or use a [url https://en.wikipedia.org//wiki/Relocation_%28computing%29#Relocation_table table] that the kernel fills out with addresses. :slide35 ### Virtual memory [$br2] .fright [image ./img/virtual-memory-by-Ehamberg.png]\n .font18 *[Image by [url https://commons.wikimedia.org/wiki/File:Virtual_memory.svg Ehamberg]]* .p A program (process) works with some (sequential) addresses, and the OS (with the help of [url https://en.wikipedia.org/wiki/Memory_management_unit hardware]) translates/maps the process' addresses (logical/[url https://en.wikipedia.org/wiki/Virtual_address_space virtual addresses]) to real ones (physical addresses). .p Simplified memory management for the program, protected address space for each process and improved flexibility. :slide35 #### Segmentation (without paging) [$br2] .p Initially, virtual memory was implemented with segmentation, where a process' memory is divided into segments for code, stack, heap, etc. Each segment is given a chunk of contiguous memory, which can be resized, and addresses are defined as the base segment (physical) address with an offset. .p In the modern day, segmentation isn't used by itself, since resizing segments could require reordering of memory and [url https://en.wikipedia.org/wiki/Fragmentation_%28computing%29#External_fragmentation external fragmentation], where there simply isn't enough contiguous memory, could occur. :slide35 #### Paging [$br3] .p The virtual and physical memory is divided into fixed-sized chunks of consecutive addresses. A virtual memory chunk is called a =[page]=, while a physical memory chunk is called a (page) =[frame]=. Usually both have the same size of (at least) 4 KiB. .p Every process has a [url https://en.wikipedia.org/wiki/Page_table page table], which maps the process' pages to system frames. Thus every memory address is translated from it's page location to a frame location. :slide35 .p Advantages include: .bulleted - elimination of [url https://en.wikipedia.org/wiki/Fragmentation_%28computing%29#External_fragmentation external fragmentation] - increased flexibility, allowed sharing of memory between processes (useful for shared libraries) - finding free memory is fast and easy, just use the first free pages you see - frames can be scattered - more efficient swapping, moving of memory from RAM to disk and back .p Disadvantages include: .bulleted - longer memory access, because of the page table - [url https://en.wikipedia.org/wiki/Fragmentation_%28computing%29#Internal_fragmentation internal fragmentation], where one or multiple pages might be partially empty, leaving free space that cannot be used :slide # 3. Linux from the inside .p Enough theory, time to have some fun and learn Linux! .p We'll be using [url https://ivandavidov.github.io/minimal/#home Linux Minimal Live] (just a bootable ISO) because it's: .bulleted - very small, comprised of only the Linux kernel, GNU C library and Busybox\n - made by a fellow Bulgarian [$bg] .p We're going to explore and look around the following stuff: .bulleted - =[Shells]=: What a modern shell would often support on a Linux based OS - =[Files and directories]=: Naming conventions and commands to work with them - =[Common configuration files]=: Relevant configuration files, their purpose and syntax - =[System management]=: Other commands with which to modify your system :middle_slide18 #### If you're following at home .p Owing to it's size, not everything we need comes out of the box, so I created [url ./mll-set.sh a script] which installs what's needed. If you want to try it out yourself, after booting up Minimal Linux Live, type out and run: ``` wget -q http://unsecure.syndamia.com/mll-set.sh && chmod +x mll-set.sh && ./mll-set.sh .p You might've noticed that the command above doesn't use the normal [url ./mll-set.sh https://syndamia.com/talks/linuxgraphy-by-strabo/mll-set.sh] link. That is because, out of the box, MLL doesn't really support secure connections (https), so I've made only that file available without it. .p The script, in summary, installs bash, coreutils and shadow "packages" (from [url http://s.minos.io/archive/bifrost/x86_64 http://s.minos.io/archive/bifrost/x86\_64]) to /usr/local/bin, creates /etc/passwd, /etc/bashrc and /etc/profile with some values in them and starts bash. :slide ## 3.1. Shells [$br2] ### Variables and data types .p Before talking about the different features newer shells support, we'll roughly cover variables and data types (mostly [url https://www.gnu.org/savannah-checkouts/gnu/bash/manual/bash.html#Shell-Parameters bash] specifics). .p A variable, also called parameter, is created with the syntax (NO spaces between the =): ``` name=value .p Where `[name]` is either a combination of letters (upper and lower case), numbers and the underscore character, which cannot begin with a number, or one of select few =[positional]= and =[special]= parameters. :slide35 #### Positional and special parameters .p [url https://www.gnu.org/savannah-checkouts/gnu/bash/manual/bash.html#Positional-Parameters Positional parameters] are numbers from 1 to N, where each one corresponds to a shell (or function, or script) argument. .p Some of the [url https://www.gnu.org/savannah-checkouts/gnu/bash/manual/bash.html#Special-Parameters special parameters] are: .bulleted - `[0]`, corresponding to the shell's or script's name, - `[@]` is all positional parameters, separated by spaces - `[#]` is the number of positional parameters - `[?]` is the last executed (foreground) process - `[$]` is process ID of the current shell. .p `[value]`, without any special surrounding characters (more on that later), is treated as a string (but it mustn't have spaces!), however it may also be omitted, in which case it is the empty string `[""]`. :slide35 #### Data types .p In most shells, there are 3 data types: strings, integers and (one-dimensional) indexed arrays. [url https://www.gnu.org/savannah-checkouts/gnu/bash/manual/bash.html#index-declare bash has more], like references and associative arrays, but those aren't universal. .p A value is a =[string]= when surrounded by single `[']` or double `["]` quotes,\n an =[integer]= when it isn't surrounded by anything and composed only of digits (if there are letters, it is a string; it's [url https://tldp.org/LDP/abs/html/untyped.html quite tricky] when a value is interpreted as a string or integer) and\n an =[array]= when surrounded by braces `[()]`, where inside elements are strings or integers, separated by spaces. An indexed array can also be created by specifying the value at any index: ``` name[index\]=value :slide #### Mixing types and operations [$br1] ``` name=value .table |= Type =||= string operation =||= integer operation =||= array operation =| |:string:| |:value:| |:0:| |:value when index is 0, empty string otherwise:| |:integer:| |:empty string, but sometimes value:| |:value:| |:value when index is 0, empty string otherwise:| |:array:| |:operation is done on element at index 0:| |:operation is done on element at index 0:| |:value:| :slide #### Parameter expansion .p With [url https://www.gnu.org/savannah-checkouts/gnu/bash/manual/bash.html#Shell-Parameter-Expansion parameter expansion], actions are done with the general variable (entity) or on strings and arrays values. Operations on integers are done with [url https://www.gnu.org/savannah-checkouts/gnu/bash/manual/bash.html#Arithmetic-Expansion arithmetic expansion]. .p Parameter expansion is started with the character `[\$]` and either a name or curly braces. Retrieving a value is done with the forms `[\$name]`, `[\${name}]` and `[\${name[index]}]` for arrays. .p Some parameter expansions in bash: .bulleted - `[\${#parameter}]` length of characters - `[\${parameter:offset}]` and `[\${parameter:offset:length}]` return substring from index offset - `[\${parameter/pattern/string}]` return a string, where every match on pattern is replaced by string - `[\${name[@]}]` a (space separated) sequence of indices from 0 to length (for defined values) of array name :slide #### Arithmetic expansion .p Arithmetic expansions allow evaluation of [url https://www.gnu.org/savannah-checkouts/gnu/bash/manual/bash.html#Shell-Arithmetic arithmetic expressions]. They're in the form `[\$(( expression ))]`. .p Some useful expressions are: .bulleted - `[++name]`, `[name++]`, `[--name]`, `[name++]` postfix and prefix incrementation and decrementation - `[+]`, `[-]`, `[*]`, `[/]`, `[%]` normal arithmetic operators, `[**]` exponentiation - `[~]`, `[<<]`, `[>>]`, `[&]`, `[^]`, `[|]` bitwise operators - `[==]`, `[!=]`, `[>]`, `[>=]`, `[<]`, `[<=]` logical comparison - `[!]`, `[&&]`, `[||]` logical not, and, or - `[expr ? exprTrue : exprFalse]` conditional - `[=]`, `[+=]`, ..., `[<<=]`, ... assignment operators - `[expr1 , expr2]` consecutive expressions and `[(expr)]` bracing .p Expressions work with both variable names and integers. :middle_slide ### Bourne shell .p Most modern shells support a lot of the more defying features of the [url https://en.wikipedia.org/wiki/Bourne_shell Bourne Shell], which include: :bulleted - [url https://en.wikipedia.org/wiki/Job_control_%28Unix%29 Job control]: Management of currently running groups of processes\n :bulleted - `[&]`: A process can be ran in the background with an ampersand (&) at the end of the command: ``` command [args...] & - `[Ctrl+z]`: suspend the currently running (foreground) job - `[bg]`: start a suspended job in the background - `[fg]`: resume the last job to be put in the background and make it the current job with which we're interacting - `[jobs]`: list all active (background) jobs :middle_slide :bulleted -   :bulleted - Every job is identified with a =[Job ID]= (number), which you can use by prepending a percentage sign. Example, to resume the job with JID 4: ``` fg %4 - For managing everything, usually there is a *[job table]*. Upon shell termination, the shell tells all jobs in that table to terminate and waits for them.\n `[disown]`: remove a job from the job table - `[kill]`: send a signal to the process or job. Then that process will have to handle it accordingly.\n Common ones are `[-KILL]` to immediately stop the process, `[-QUIT]` to quit it, `[-ABRT]` to cancel the current action, `[-TERM]` shut down (orderly), `[-STOP]` shut down (forcefully) :middle_slide27 :bulleted - [url https://en.wikipedia.org/wiki/Here_document heredoc]: File literal, meaning it is a user "string" which is interpreted as a file .bulleted - start with `[<< NAME]`, where NAME can be anything you want, it is used to mark the beginning and end of a heredoc. `[<<]` is also a redirection symbol - on every new line write out your text, all characters will be preserved - to end it, write out `[NAME]` on the beginning of a new empty line. Example: ``` cat << MYFILE This is some text MYFILE - control operators: Control what command is executed, depending on [url https://en.wikipedia.org/wiki/Exit_status exit status] .bulleted - `[expr1 && expr2]`: Run expr2, only if expr1 exited successfully - `[expr1 || expr2]`: Run expr2 if expr1 exited unsuccessfully - `[expr1 ; expr2]`: Run expr2 after expr1 (sequential execution) :middle_slide27 :bulleted - [url https://en.wikipedia.org/wiki/Redirection_%28computing%29 redirection]: Outside of having `[<]` and `[>]` for I/O redirection, often there is also :bulleted - `[>>]`: Acts like `[>]`, putting text into a file, but rather than overwriting everything, it appends it - `[<<]`: As explained, for heredocs - `[<<<]`: To the right is a string ([url https://en.wikipedia.org/wiki/Here_document#Here_strings herestring]), and it is interpreted as a file ``` cat <<< "Hello World!" - =[Standard input]=, =[standard output]= and =[standard error]= are all =[files]= with which the shell works. Typing text in the shell puts it into stdin, command output are put into stdout (and shown to the user) and errors are put into stderr (also shown to the user). .p Each one of them is numbered from 0 to 2, and you can specify redirection by appending or prepending it to the redirection symbol.\n .p .bulleted - `[N>outputfile]`: redirects the output from standard stream N into outputfile - `[N>&M]`: redirects the output from standard stream N into standard stream M - `[N<inputfile]`: redirects input file contents to standard stream N - `[N<&M]`: redirects standard stream M to standard stream N :middle_slide27 :bulleted - [url https://en.wikipedia.org/wiki/Test_%28Unix%29 built-in test command]: With `[test]` you can do conditional expressions. You can also often use brackets `[[ args... ]]` instead of `[test args...]` :p :bulleted - Some of the available file checks .bulleted - `[-e FILENAME]`: FILENAME exists - `[-d FILENAME]`: if FILENAME is a directory - `[-h FILENAME]`: if FILENAME is a symbolic link - `[-w FILENAME]`: you can write in FILENAME - Some of the available string checks .bulleted - `[-n STR]`: string STR has nonzero length - `[-z STR]`: string STR has length zero - `[STR1 = STR2]` and `[STR1 != STR2]`: self explanatory - Some of the available number checks .bulleted - `[INT1 -eq INT2]`: equal integers - `[INT1 -gt INT2]`: INT1 > INT2 - `[INT1 -gt INT2]`: INT1 >= INT2 - Operators: `[!]` - negation, `[-a]` - binary AND, `[-o]` - binary OR, parentheses for grouping (escaped with `[\\]`) :slide :bulleted - [url https://en.wikipedia.org/wiki/Shell_script scripting support]: Running commands from top to bottom of a file .p .bulleted - `[#]`: Everything between a # and the end of the line denotes a comment - `[#!]`: If on the first line of the file, is treated as a [url https://en.wikipedia.org/wiki/Shebang_%28Unix%29 shebang], which specifies what interpreter to be used for the script - compound commands: Structures for flow control\n In all examples, `[test-command]` is any command (or combination with control operators), and logic is based on exit code. :p :bulleted - conditional (if) ``` if test-command ; then commands-if-true elif test-command ; then commands-if-true else commands-if-true fi `[elif]` statements can appear multiple times, both elif and else are optional :slide :bulleted -   [$br1] :bulleted - conditional (case) ``` case word in (pattern | ...) commands-if-true ;; ... esac .p Where the beginning brace can be ommited.\n Patterns are according to [url https://www.gnu.org/savannah-checkouts/gnu/bash/manual/bash.html#Pattern-Matching bash's pattern matching] - loops ``` while test-command; do commands-if-true; done ``` for name in words...; do commands-if-true; done If `[in words...]` is ommited, positional arguments are used :middle_slide35 ## 3.2. Files and directories .bulleted - `[/dev]`: devices - `[/proc]`: process information [$br3] .bulleted - `[/bin]`: main command programs (binaries), used by users - `[/sbin]`: command programs, used for the system operation - `[/usr]`: other system resources .bulleted - `[/usr/bin]`: all other user command binaries :slide #### Everyday file system commands .bulleted - `[mkdir]`: create directories - `[rm]`: remove a (hardlink to) file or directory - `[mv]`: move file (or directory) to another directory - `[cp]`: copy a file - `[ln]`: create hard (or soft, with `[-s]`) links to files (inodes) - `[ls]`: list directory contents #### Other useful file system commands .bulleted - `[shred]`: overwrite file contents - `[dd]`: do many things with files, like modifying and duplicating data across devices and partitions, (basic) data recovery and generating files - `[du]`: estimate file usage - `[stat]`: display status for file or file system - `[chwon]`: change owner of an inode - `[chgrp]`: change group of an inode - `[chmod]`: change file modes - `[find]`: search for a file inside a directory (recursively) :slide #### Everyday text (file) commands .bulleted - `[echo]`: prints a given string to standard output - `[cat]`: sequentially prints files to standard output - `[less]`: view files with support for scrolling - `[head]`: print the first lines (or characters) of a file - `[tail]`: print the last lines (or characters) of a file #### Other useful text (file) commands .bulleted - `[grep]`: searches for matches in a file, using simplified regular expressions. `[egrep]` for extended regex and `[fgrep]` for string match (as-is). - `[sed]`: filter and transform text from files, according to regex - `[tr]`: translate, map (more than one to one for example) and delete characters - `[cut]`: remove parts of text from each line - `[strings]`: find embedded text inside binary files - `[truncate]`: shrink or extend the size of a file - `[wc]`: count words inside file - `[diff]`: compare files and show their differences :middle_slide #### Device commands .bulleted - `[mount]`: bind another file system to a location in the file system - `[umount]`: remove binding to a file system - `[lsusb]`: lists USB devices - `[lspci]`: lists PCI devices #### System information commands .bulleted - `[date]`: display current date and time - `[hostname]`: show (or set) the systems hostname (network name) - `[free]`: display free memory on the system :slide #### User and group management .bulleted - `[useradd]`: creates a new user (with a password) - `[userdel]`: remove a user and all of his files - `[usermod]`: modify a user account, including his name and groups - `[passwd]`: change password of a user. With `[-l]` user could be locked (impossible to log in) [$br3] .bulleted - `[groupadd]`: create a new group - `[groupdel]`: remove a group - `[groupmod]`: modify an existing group [$br3] .bulleted - `[w]`: show who is logged in and what they are doing - `[who]`: show who is logged in - `[whomai]`: print username - `[login]`: establish a new connection to the system :slide :bulleted - `[/etc]`: system configuration files [$br2] :bulleted - `[/etc/group]`: the data for all groups is stored here, in the format ``` GROUPNAME:PASSWORD:GID:USERS... - `[/etc/passwd]`: the data for all users is stored here, in the format ``` USERNAME:PASSWORD:UID:GID:USERINFO:HOMEDIR:SHELL - `[/etc/shadow]`: stores other user data, especially the password as an encrypted string, in the format ``` USERNAME:PASSWORD:LASTCHANGED:MINBETWEENCHANGES:MAXBETWEENCHANGES:WARNEXPIRE:INACTIVEDAYS:EXPIRE :slide27 :bulleted -   :bulleted - `[/etc/fstab]`: read on boot by mount to setup devices in fs ``` # device-spec mount-point fs-type options dump pass /dev/sda1 / ext4 defaults 1 1 UUID=123A-456B /boot vfat noauto,noatime 1 2 LABEL=Vault /mnt/Vault auto nosuid,nodev,nofail 0 0 - `[/etc/hosts]`: list of host names (domain names) and their corresponding IP address, when a DNS server doesn't do the job ``` # ipaddress domains 127.0.0.1 localhost mywebsite.com something.else 62.44.101.138 my.uni - `[/etc/bashrc]`: global defaults and aliases used by the bash shell - `[/etc/motd]`: message of the day (unexecuted text), shown after login but before the shell is ran :middle_slide35 :bulleted - `[/usr]`: other system resources .bulleted - `[/usr/lib]`: library files - `[/usr/local]`: local, system software - `[/usr/share]`: architecture independent data, like manuals - `[/var]`: variable data, like logs and cache - `[/home]`: home directories of all users - `[/lib]`: libraries and kernel modules - `[/mnt]`: mounted temporary filesystems - `[/root]`: home directory for the root user :slide # 4. Software licensing .p It is important to note that most software in Linux, including the kernel itself, is under a variety of open licenses. An open license is a license which allows others to reuse the original work, under some restrictions. .p For software, such a license is applied to the source code from which the original application was made. Some commonly used ones, with a (NON-LEGALLY BINDING) summary include: .bulleted - [url https://en.wikipedia.org/wiki/MIT_License MIT]: the author is not responsible for anything that the code does - [url https://en.wikipedia.org/wiki/BSD_licenses BSD-3-Clause]: use the same license (copyleft), the author is not responsible for anything that the code does - [url https://en.wikipedia.org/wiki/GNU_General_Public_License GNU GPL]: disclose the original project and use the same license ([url https://en.wikipedia.org/wiki/Copyleft copyleft]) .p This is important, since pretty much everything can be freely modified by anyone (for the better or worse, but generally better). :slide35 # 5. What makes up a useful OS [$br1] .p A Linux-based operating system is nothing more than the Linux kernel, combined with an assortment of programs.\n Overall, the main elements that make up a useful *[desktop]* operating system (alongside a command-line shell) are: [$br1] .numbered 1. =[Bootloader]=: load the kernel and operating system 2. =[Init system]=: well managed way to start everything inside the OS 3. =[Package manager]=: way to manage our binaries 4. =[Desktop manager]=: graphical shell :slide ## 5.1. Bootloader .p The computer boot process is a relay race. .p In a "standard" BIOS-MBR boot configuration: ``` BIOS -> Master Boot Record -> Active Parition -> Bootloader -> -> Boot menu (optional) -> Kernel -> Everything else .bulleted - [url https://en.wikipedia.org/wiki/BIOS BIOS]: a small chip with code that is ran upon power up - [url https://en.wikipedia.org/wiki/Master_boot_record MBR]: small amount of space at the very start of the disk, that contains some (bootstrap) code and partition information. The BIOS hands off execution to the bootstrap code. - Active partition: MBR loads and runs code from the beginning of the partition (boot sector), which starts the bootloader - [url https://en.wikipedia.org/wiki/Bootloader Bootloader]: the first complex program, which manages kernels, shows a menu, allows for debugging and so on :middle_slide .p The BIOS is baked into the motherboard, the MBR is pretty universal (and it's less than 512 bytes) and the kernel we know and love. .p That leaves us with the need for a bootloader. On Linux, the most used one is called [url https://en.wikipedia.org/wiki/GNU_GRUB GRUB]. .p It's config is found in `[/etc/default/grub]` (which is then fully generated with `[grub-mkconfig]`) and can be "installed in the partition" with `[grub-install]`. ``` GRUB_DEFAULT=0 GRUB_TIMEOUT=5 GRUB_TIMEOUT_STYLE=menu GRUB_CMDLINE_LINUX="" GRUB_CMDLINE_LINUX_DEFAULT="" GRUB_GFXMODE=640x480 :slide27 ## 5.2. Init system .p Now we need to load the user stuff, like a network manager, the graphical user interface and so on.\n In Minimal Linux Live, this is done by some hard written scripts, which aren't very modular or extensible. .p [url https://en.wikipedia.org/wiki/Init Init systems] use configuration files, called service, to start a program in some way. :bulleted - [url https://en.wikipedia.org/wiki/Systemd systemd], which also aims to also provide easy system configuration ```ini [Unit] Description=Service [Service] Type=simple User=root ExecStart=/usr/local/sbin/script.sh [Install] WantedBy=multi-user.target - [url https://en.wikipedia.org/wiki/OpenRC openrc], a simple and modern system :slide ## 5.3. Package manager [$br1] .p Programs are just executable files, usually placed in a specific folder, like `[/usr/bin]`. .bulleted - MLL comes with ~100 executables (and ~230 after my script) - my Debian server has ~1000 installed packages - my current desktop Gentoo setup has ~1500 installed packages - my old Fedora setups have reached ~3000 packages on multiple occasions .p Updating all of them means replacing all of the binaries, means tracking where each one came from, installing a new one might require installing multiple others and so on. .p On modern systems, managing binaries is automated by an application called a package manager. :middle_slide35 .p Additional benefits of package management automation include: .bulleted - checking for package validity (and tampering) - utilising multiple server mirrors - automatic dependency management and installation - version caveats - (indirectly) improved stability because of maintainer compatibility checks [$br1] ``` apt install PKG apt remove PKG apt upgrade :slide ## 5.4. Window systems .p A widow system is your graphical shell: the graphical way to interact with your computer (the kernel). It's main components are: [$br1] :bulleted - =[Window manager]=: the overall system with which you create, remove, resize and work with windows. They are generally separated into two types: .bulleted - *[Stacked]*: this is what you know, windows are just rectangles floating about, and can be put one over each other - *[Tiled]*: windows take up a predefined (but changeable) portion of the screen - =[Compositor]=: adds an unseen window buffer, allowing additional processing, before a window is shown. This powers transparency, animation effects and more. - =[Session manager]=: graphical application that manages the window managers (outside WMs), think of it as a login window :slide27 # 6. War on distros .p A distribution is a complete set of of all applications that you might use. .p Overall, there are 4-5 distributions from which 90% of all other distros are based upon: .numbered 1. [url http://www.slackware.com/ Slackware]: as the oldest still maintained distro, it served as inspiration for a lot of other distros. It aims to be as simple and as close to UNIX as possible. 2. [url https://www.debian.org/ Debian] (and [url https://ubuntu.com/ Ubuntu]): it is the most popular distribution (if we also include Ubuntu). Ubuntu aims to be modern and fancy, while Debian aims to be as stable as possible, which makes it the most popular in server usage. 3. [url https://www.gentoo.org/ Gentoo]: the distribution which defined non-binary software distribution 4. [url https://archlinux.org/ Arch Linux]: a very popular and modern distribution with the goal of providing the latest and greatest software. .p We haven't touched too much, but Linux distributions aren't limited to desktop or server computers. .numbered 1. [url https://en.wikipedia.org/wiki/Android_%28operating_system%29#AOSP AOSP]: the base of the whole Android operating system 2. [url https://en.wikipedia.org/wiki/OpenWrt OpenWRT]: a router operating system :slide35 # 7. Demos - virtual machine and setup from zero [$br2] .p Now for the best part: we'll be installing a Linux distribution on a virtual machine and then directly on a laptop, running Windows 10! .p I've chosen to install [url https://linuxmint.com/ Linux Mint], since it is generally targeted towards newcomers and will feel familiar enough to Windows users. .p Currently there won't be an official recording of the process, you should've been here live! :title_slide35 # Thank you for your time :slide18 # Sources .p All of these contain, at most, small paraphrased sentences in the slides. They served as personal educational and reference tools. .bulleted - [url https://en.wikipedia.org/ wikipedia] and [url https://stackexchange.com/ stackexchange] have been used extensively and linked where appropriate. - [url https://people.cs.rutgers.edu/~pxk/416/index.html CS416 - Operating Systems Design] by [url https://people.cs.rutgers.edu/~pxk/rutgers/ Paul Krzyzanowski] at [url https://www.cs.rutgers.edu/ Rutgers State University] - [url https://web.archive.org/web/20220713165958/https://homepage.cs.uri.edu/~thenry/resources/unix_art/ch02s01.html Origins and History of UNIX, 1969-1995] from [url https://web.archive.org/web/20220611160552/https://homepage.cs.uri.edu/~thenry/resources/unix_art/index.html The Art of Unix Programming] by Eric Steven Raymond - [url https://www.youtube.com/watch?v=EY6q5dv_B-o Ken Thompson interviewed by Brian Kernighan at VCF East 2019] by [url https://vcfed.org/ Vintage Computer Federation] - [url http://www.cs.uah.edu/~kkeen/CS590/Pseudo%20Terminals.ppt Pseudo Terminals presentation] from [url http://www.cs.uah.edu/~kkeen/CS590/ CS590] by [url http://www.cs.uah.edu/~kkeen/kkeen.html Kevin J. Keen] at [url https://www.uah.edu/ University of Alabama in Huntsville] - [url https://teaching.idallen.com/dat2330/04f/notes/links_and_inodes.html Hard links and Unix file system nodes] from [url https://teaching.idallen.com/dat2330/04f/ DAT 2330 Introduction to Operating Systems] by [url http://www.idallen.com/ Ian D. Allen] - [url https://www.cs.uic.edu/~jbell/CourseNotes/OperatingSystems/9_VirtualMemory.html Virtual Memory, OS Course Notes] by [url https://www.cs.uic.edu/~jbell/ Dr. John T. Bell] at [url https://www.cs.uic.edu University of Illinois Chicago],\n which is directly based on [url https://archive.org/details/operating-system-concepts-9e Operating System Concepts, 9th Edition] by [url https://codex.cs.yale.edu/avi/ Abraham Silberschatz] - [url http://csapp.cs.cmu.edu/3e/home.html Chapter 9 Preview - Virtual Memory] from [url http://csapp.cs.cmu.edu/ Computer Systems: A Programmer's Perspective] by [url http://www.cs.cmu.edu/~bryant Randal E. Bryant] and [url http://www.cs.cmu.edu/~droh David R. O'Hallaron] - [url https://www.cse.iitb.ac.in/~mythili/teaching/cs347_autumn2016/notes/07-memory.pdf Memory Management, Lecture Notes] by [url https://www.cse.iitb.ac.in/~mythili/ Mythili Vutukuru] - [url http://www.cs.iit.edu/~cs561/cs351/VM/paging%20advantages%20and%20disadvantages.html Paging advantages and disadvantages] by Charles R. Bauer at [url http://www.cs.iit.edu/ Computer Science Departmemnt of Illinois Institute of Technology] - [url https://www.gnu.org/savannah-checkouts/gnu/bash/manual/bash.html Bash Reference Manual] by the [url https://www.gnu.org/ GNU project] and [url https://man.openbsd.org/ksh ksh manual] by the [url https://www.openbsd.org/ OpenBSD project] - [url https://www.in-ulm.de/~mascheck/bourne/v7/ Manual of the Bourne Shell on Version 7], archived by [url https://www.in-ulm.de/~mascheck/ Sven Mascheck] - [url https://www.linuxbase.org/betaspecs/fhs/fhs/ch04.html /usr hierarchy] from [url https://www.linuxbase.org/betaspecs/fhs/fhs/index.html Filesystem Hierarchy Standard] by [url https://www.linuxfoundation.org/ Linux Foundation], Daniel Quinlan, Paul 'Rusty' Russell, Christopher Yeoh - [url https://www.pathname.com/fhs/pub/fhs-2.3.pdf Filesystem Hierarchy Standard] by Rusty Russel, Daniel Quinlan and Christopher Yeoh - [url https://refspecs.linuxbase.org/LSB_3.1.1/LSB-Core-generic/LSB-Core-generic/usernames.html#FTN.AEN26405 User & Group Names] from [url https://refspecs.linuxbase.org/LSB_3.1.1/LSB-Core-generic/LSB-Core-generic/book1.html Linux Standard Base Core Specification] by [url https://en.wikipedia.org/wiki/Free_Standards_Group Free Standards Group] - [url https://docstore.mik.ua/orelly/linux/run/ch05_07.htm Managing User Accounts] from [url https://docstore.mik.ua/orelly/linux/run/index.htm Running Linux, Third Edition] by Matt Welsh, Matthias Kalle Dalheimer, and Lar Kaufman - [url https://neosmart.net/wiki/mbr-boot-process/ The BIOS/MBR Boot Process] by [url https://wiki.neosmart.net/ NeoSmart's Knowledgebase] # Licensing .p Images on slides "2.1. shell" and "2.2. UNIX-style file system" are made by me and [url /license.html licensed as content].\n The script [url http://syndamia.com/talks/linuxgraphy-by-strabo/mll-set.sh mll-set.sh] is licensed under MIT (though I don't see it as substatial enough to be licensed under anything).