Deyao’s Blog

The Unexpected Joy of Using Paper

2024-03-16T00:00:00+00:00

Ever since I got an iPad with a digital pen around eight years ago, I have continuously been trying to stay away from paper. I thought paper was old-fashioned and outdated. They take up a lot of space, are heavy, and cannot be organized easily without tools like folders. Even when they can, they cannot be nested infinitely and very logically like I can with files on a computer. Moreover, I can do everything I can do on paper on an iPad and more. More importantly, I think technology is the future so why should I use something that is literally thousands of years old, and seemingly used only by old people who don’t understand new technology, when I can use an iPad that is only a few decades old?

That was what I used to think until I restarted using paper a few weeks ago by accident. I had forgotten to bring my iPad to the library one day, so I just stole some paper from the printer and borrowed a pen from a friend to work on some math problems.

At the end of the day, I discovered that for some reason my mental process was much clearer, and I solved more difficult problems than I used to. So I began to print out the lecture notes I use, and problem sheets. This method worked perfectly well so far.

I wondered why this was the case and came up with three reasons: less mental overhead, using paper is like using multiple screens and eliminating the false sense of progress.

Less Mental Overhead

Our mental capacity, despite what we wish and might sometimes feel, is limited, so we would want to minimize the mental overhead to maximize efficiency. Mental overheads are basically things that don’t really contribute to the task at hand, but are necessary to be mentally aware of, such as keeping computers charged, making sure WiFi is connected, etc.

Mental overhead is large when using a computer. You can imagine how many things need to work correctly to keep a computer working smoothly, like the apps you are in, the apps you have open, how many tabs are opened and if they are taking up too much RAM, slowing down the computer. Also, when writing on an iPad, I need to make sure that my hand doesn’t accidentally activate the touchscreen when I am writing by placing my palm directly and clearly on the screen to activate palm rejection. Keeping track of them means that I need to make sure that I don’t accidentally break these things. Also, I find that subconsciously, when things inevitably go wrong, I would have to spend quite some time fixing those problems. I also tend to get distracted by other things on the computer when I am fixing it.

But when using paper, I’d have to keep track of a lot less things. There is no way a paper can fail except with predictable and understandable physical damage.

I also suspect that multiple stacks of paper are easier to track mentally than multiple opened browser windows because we have dedicated brain circuits to track physical objects but we have to use the much slower, costly and error-prone conscious process to keep track of files open on a computer.

Basically Multiple Monitors on Steroids

If you are a tech nerd, you know how pleasant it is to have multiple monitors. You can keep your reference on one screen while working on another screen without having to juggle multiple windows.

You can achieve the same functionality with paper as with an extra monitor if you are primarily working with text, like me, who studies maths. Moreover, you can have three, four or however many pieces of paper spread out in front of you, instead of being limited to just a few due to cost concerns.

Eliminating False Sense of Progress

When I use computers, sometimes I like to click around and do random things when I get stuck on a task. For example, one of my favourite things to do on a computer is to upgrade software. I would feel like I have achieved something even though in reality I haven’t. Other illusions include arranging my windows, changing keyboard and mouse settings, tinkering with network settings to make the internet faster, and just clicking around aimlessly. These things keep me busy and fool me into thinking I am being productive even when I am not.

But, if I am using paper, there is nothing to do other than work with the content on the paper. I can keep good track of my progress and not be fooled by this illusion.

It’s a Tradeoff Afterall

Paper is good for certain things, but certainly not for all things. I still prefer to type on a computer than write essays on paper because typing is just so much faster. There are still, obviously, many things that cannot be done on paper, such as coding.

But, for what it is good at, being a medium for text, it undoubtedly is better than any other means of conveying text, even including e-readers.

After all, technology is just a tool. As science and engineering advance, we get better and more powerful technology like better screens, faster internet connection and faster computation. But they are only better in the sense that they can solve more problems or problems that can’t be solved by old technology. That is, they are more useful tools for a wide variety of purposes. Nevertheless, a tool is still a tool, and tools should be used only when there is a need for it, and the tool most fit for a purpose should be used. So, when the purpose is specialized, and the scope is limited, such as understanding text, I think paper is a perfect tool, and better than any other newer technology we have.

ChatGPT frequent requests refusals

2024-02-18T00:00:00+00:00

Recently, I was trying to investigate printing works at my university. I know it sends files with some protocol over https because I had to input some http url into a system dialogue to add the printer, as well as my university email and password, but I was not sure how it works exactly.

I was particularly interested in knowing how it handles authentication and if there’s any vulnerability that can lead to users’ email and password being stolen. I used Wireshark to analyse the traffic and found out it was encrypted with TLS.

So I needed to find a way to decrypt the message. After some Googling, I found that I could use the SSLKEYLOGFILE environment variable when using applications like Google Chrome, which basically tells the app to write the encryption key along with some other information into a file so that Wireshark can use it later to decrypt that traffic. The app then writes information like the private key into the file so that network traffic analyzers like Wireshark can decrypt the SSL traffic.

This method works with browsers, but since the printing was probably initiated by some system process, I didn’t know how to pass environment variables to them and even if I could, I was not sure if they would respect the environment variable.

Google didn’t immediately return any useful result, so I did the next best thing I could do, I asked ChatGPT. I told it I wanted to achieve something similar to SSHKEYLOGFILE for Chrome but for the entire system. To my utter surprise, ChatGPT refused my request saying it was unethical and illegal. I knew that ChatGPT refuses to assist with actually committing illegal activities, but what I was trying to do was far from illegal or ethical.

ChatGPT Refused requests

This led me to test if it would refuse more requests that are not illegal at all. Here are some examples of refused request

Fork Bomb: I asked ChatGPT why my fork bomb does not work in Python. It refused to answer me as well, citing that it can “cause serious harm”.
Reverse Shell: I asked it how to use a reverse shell. It refused with the classic “I’m sorry, but I can’t assist with that.” This was complete nonsense because the prerequisite for using these techniques is that an attacker gains arbitrary code execution on a system and by that time, the system is totally compromised. By that time you have much bigger problems than a fork bomb.

Intriguingly, as I was writing this, I tried to ask it about reverse shell again, two weeks after it initially denied my request and it actually answered me with helpful basic information, including commands to launch it (though I didn’t test if it works).

Degree of safety

Since LLMs are generated on a vast number of sources on the internet, which, mind you, has everything. There are obviously many things that are off-limit/illegal/unethical. So it’s important to moderate it, or as OpenAI calls it “safety and alignment”. Safety and alignment basically consist of rejecting certain requests and making sure its output doesn’t contain certain information, like how to cook meth.

But taken to the extreme, it stops the spread of useful information and knowledge. For example, telling people about cybersecurity might be useful for hackers but also useful for those who are interested in knowing how computer systems work and how they can defend themselves from attacks. Crucially, cybersecurity is freely discussed on the internet and search engines like Google return useful results and learning about them is not illegal or unethical in any sense.

Refusal doesn’t even work

Refusing to answer cybersecurity-related questions doesn’t help with preventing them from happening. This is because actual hackers have much better tools. They can also use a plethora of open-source models (or even train one themselves) that don’t have such restrictions. e.g. dolphin-mixtral. Unsurprising, it tells you how to cook meth. Setting it up wasn’t even difficult, and required nothing except for a decently powerful GPU like Nvidia RTX 3080. Actual well-sponsored hacker groups undoubtedly would have much better tools and resources, so ChatGPT refusing these requests would accomplish nothing much other than stopping curious individuals learning about cybersecurity.

Printer protocol

After some intensive Googling and document reading, I found out the transparency mode in mitmproxy was exactly what it needed. I set it up and successfully, captured, and decrypted the traffic. It turns out there is nothing special with the authentication protocol; it just sends the username and password as plaintext in the HTML header. Luckily, the traffic is encrypted with TLS. Still, I think there are better ways like using OAuth in case the traffic gets intercepted when decrypted somewhere else or when the private TLS key is leaked, but that’s probably for next time.

Applications of First Principles

2024-01-26T00:00:00+00:00

We often overlook first principles due to their self-evident nature, as our focus lies in uncovering less apparent truths. In mathematical contexts, while clear first principles, such as the definition of differentiation, exist, their direct application may not lead us far in practical derivations. Instead, we leverage a multitude of clever methods devised by brilliant minds over centuries, allowing us to work efficiently without constant contemplation of the foundational principles.

Applying first principles without considering their broader consequences can be a perilous mindset, exemplified by the notion “you only live once.” While it urges us to seek meaningful and joyful experiences, its straightforward application could lead to a myopic pursuit of pleasure without productivity. However, it’s important to realise that it’s better to be productive to maximise happiness in the long term rather than in the short term, so it becomes the intermediate trick we use and we temporarily forget about the ultimate goal to focus all our attention on practical methods.

This perspective underscores the complexity hidden beneath the surface of first principles. Institutions often have primary goals akin to first principles, guiding their actions and influencing intermediate objectives. Despite a cynical interpretation of a government’s primary goal as resource extraction from citizens and land, intermediate goals like building infrastructure contribute positively to public welfare. Similarly, while private companies’ primary goal may be profit maximization, their intermediate goals of delivering quality products and services benefit consumers.

In essence, understanding and navigating first principles involves appreciating the intricate layers of consequences and adopting pragmatic strategies that align with both short-term and long-term objectives.

First Principles

2024-01-11T00:00:00+00:00

That is the difference between mathematics and physics. Mathematicians, or people who have very mathematical minds, are often led astray when “studying” physics because they lose sight of the physics. They say: “Look, these differential equations—the Maxwell equations—are all there is to electrodynamics; it is admitted by the physicists that there is nothing which is not contained in the equations. The equations are complicated, but after all they are only mathematical equations and if I understand them mathematically inside out, I will understand the physics inside out.” Only it doesn’t work that way. Mathematicians who study physics with that point of view—and there have been many of them—usually make little contribution to physics and, in fact, little to mathematics. They fail because the actual physical situations in the real world are so complicated that it is necessary to have a much broader understanding of the equations. — Richard Feynman, The Feynman Lectures on Physics

What are First Principles

First principles are basic truths about a complex system from which all other truths can be logically derived. In mathematics, these principles manifest as axioms—self-evident truths. In the realm of physics, they take the form of fundamental laws, such as Newton’s Laws of Motion and Maxwell’s equations for electrodynamics.

We generalise the concept of first principles to other fields of knowledge beyond science. When analyzing and predicting the behavior of individuals or organizations, we can identify their ultimate objectives as the first principle, with intermediate goals representing logical implications. For instance, one might posit that the foundational assumption in economics is that individuals act rationally to maximize their utility or happiness.

There are also more cynical first principles that we might observe. For example, we might see that the first principle of a government is to extract resources from citizens and territory for the benefit of those in power, as opposed to prioritizing the welfare and utility of the public. Similarly, one might argue that the core principle guiding private companies is to maximize profit for shareholders rather than focusing on providing customers with desired products.

The consequences of First Principles are complex

The first principles are really important to understand. Newton’s discovery of the fundamental laws of motion, for example, paved the way for precise calculations of celestial body motion and the strategic placement of satellites in orbits tailored to our needs, such as synchronizing with the Earth’s rotation every 24 hours.

Similarly, comprehending the first principle of rational agents is key to unraveling the workings of economic activities and making reasonably accurate predictions. Remarkably, the ability to derive precise consequences from first principles using logic alone, without direct observation, is a testament to the power of foundational understanding.

Catiously, we need not to overesimate the power of first principles. Even though the first principles are quite simple on the surface, one might be tempted to underesimate the need to fully learn their implications. After all, why waste time learning about consequences when you can work them out yourself in your head. In reality, the resulting consequences are complex and not at all obvious. In the realm of mathematics, the derivation of complex and profound theorems from basic axioms and definitions exemplifies the depth and richness that emerges from seemingly straightforward principles. Often entire fields in maths like set theory is built upon really simple, straightfoward and intuitive axioms.

The complexity of consequences is also apparent in systems like Conway’s Game of Life, where intricate behavior arises from simple rules.

Moreover, recognizing these consequences is not an innate skill for most individuals, highlighting the limitations of relying solely on first principles, even though theoreciallinfinitely intelligent beings are able to successfully derive them.

Not only are the consequences complex, but people are not usually able to realise the consequences themselves in a vacuum, even though it is possible for a theoretically infinitely intelligent being. This underscores the importance of not only understanding first principles but also acknowledging their limitations and the need for practical exploration and observation in complex real-world scenarios.

Single points of failure in IPv4

2023-12-25T00:00:00+00:00

In my previous blog post, I made the bold claim that like NAT and HTTP’s Host Header, was sufficient for our current internet needs, perfect enough to work around the limited number of IPv4 addresses. However, a recent revelation about government surveillance through Apple’s and Google’s notification servers has led me to reconsider. These IPv4 workarounds, I’ve realized, introduce critical vulnerabilities: they create centralized points of failure, starkly contrasting the decentralized ethos of the Internet Protocol.

What are Notification Servers

Notification servers, operated by Apple and Google, are what deliver push notifications to devices. When an app wants to send a notification, it has to first send it to a notification server. Your device checks then with these servers to get new notifications. Your device periodically checks these servers for new alerts, There is no other way for an app to send notifications to Android and iOS users. This mechanism allows Apple and Google to see all your notifications from every app you use.

Recently, Apple and Google were exposed to have been surreptitiously sharing users’ notification histories with law enforcement. With the benefit of Hindsight, it is obviously bound to happen when everyone’s valuable information is conveniently gathered in one easy-to-access place. Although it appears that forcing everyone to use centralised notification servers might be driven by some hidden agenda, there are good reasons why apps cannot simply send notifications directly to users.

Why Notifications Servers are needed

The rationale behind these centralized servers lies in the requirements of push notification technology: timely delivery and high availability. In an ideal world, each device would operate its server, accepting incoming messages directly from various sources. This is essentially what notification servers do — they passively gather messages. However, NAT makes this impossible. A device has to send a request out in order to receive responses back. A device must initiate an outbound request to receive responses, leading to increased CPU and battery usage. CPU and battery usage also scales linearly with the number of sources. With tens or hundreds of different apps on your phone all wanting to send you notifications, periodically sending packages to all these different servers is impractical.

The IPv6 Game-Changer

Enter IPv6, where receiving messages from various sources is straightforward. Devices can actively listen for incoming connections without the need for constant outbound communication to each source, drastically reducing CPU usage. Most importantly, CPU usage stays constant, independent of the number of sources for notification. This negates the need for centralised notification servers, essentially turning your phone into a notification server.

Beyond Notification Servers: IPv6 and Decentralized Communication

The deeper issue with IPv4 lies in its structural limitation: for two devices to communicate over the internet, at least one must have a global address, not hidden behind NAT. Since most consumer devices use NAT, communication often relies on third-party servers like WhatsApp, Telegram, or Gmail. While end-to-end encryption offers security, the dependence on central servers brings its own set of problems, including potential outages and privacy concerns. It is also nearly impossible to distribute truly open and free messaging tools and social media. Open source initiatives like Matrix, a messaging platform, and Mastodon, a social media platform require users to run a server which is quite difficult to set up and expensive to scale.

In contrast, IPv6 facilitates a more decentralized, robust internet ecosystem. It paves the way for open-source messaging platforms, making them as straightforward to set up as downloading an app. There is no need for the complicated process of finding a computer that is up 24 x 7 with a globally reachable IP and configuring the network. This shift could herald a return to the early, decentralized days of email, where messages were sent directly to the recipient’s computer, denoted by the part of the email address after @, free from third-party control and the associated risks: there is no downtime because one intern pushes a wrong config, no terms of service to agree to that puts you at the mercy of the companies and no massive data leaks that harm everyone.

In essence, although IPv4 works so far with the series of hacks that largely resolve the problem of address depletion, IPv6 brings a lot more benefit than merely a large address space — it redefines our relationship with the internet, enhancing privacy, reliability, and freedom in digital communication.

The backward compatible hack that keeps the web together

2023-12-10T00:00:00+00:00

The internet is nothing short of a modern miracle. It’s astonishing that I can video call friends and family in China from halfway across the globe in the UK with almost seamless connectivity. My luggage get lost in transit and things I tried to send through mail gets stopped by custom, yet tiny changes in electrical current somehow manage to get through dozens of networking devices run by different groups of people with various technical ability and agenda somehow make it to the other end with remarkable reliability.

This robust and advanced state of the internet we enjoy today owes much to standardization, good algorithm designs and, just as importantly, a series of clever backwards-compatible hacks. These innovations have introduced new features while maintaining connectivity through older hardware and software that have yet to be updated.

In this blog, I aim to explore some of these ingenious hacks that keep the internet functioning smoothly and the lessons they offer. By delving into these technological marvels, we can appreciate the intricacies and brilliance behind our daily digital interactions.

Backwards compatible hacks

Network Address Translation

In my previous blog post, I touched on the significance of Network Address Translation (NAT) and its operational basics. Without delving into the details already covered, let’s remember the key function of NAT: it enables a multitude of devices, far exceeding the 2^32 limit, to connect to the internet using 32-bit addressing. NAT achieves this by intelligently using additional packet information at the receiving end to correctly route the data.

Basically Everything Uses HTTP

HTTP, which stands for HyperText Transfer Protocol, is the backbone of internet connectivity, initially designed for transmitting only text, like the content on this blog page. If we only consider its original intent, it might seem logical to restrict HTTP to text transmission and delegate other tasks to specialized protocols like SSH for computing, FTP for file transfer, and some hypothetical Video Transmission Protocol to transmit videos.

However, in reality, HTTP’s application has expanded far beyond its initial design. It is now used for a wide array of tasks: sending files, streaming live videos, and executing commands through RESTful APIs. This versatile usage underscores a recurring theme in technological evolution — using technologies in ways they were not originally intended to be used.

A major reason for HTTP’s predominance is its widespread support. Many networks, like the free WiFi at Edinburgh Airport, restrict protocols like SSH or FTP, but not HTTP, as blocking it would render most websites inaccessible. This prevalence makes HTTP a more reliable choice for various internet activities.

Another example of HTTP’s reliability over other protocols comes from my experience using traceroute, a tool that maps the journey of a message across the internet. While its default ICMP protocol often encounters timeouts, indicating blockages, switching to TCP on port 80, which HTTP uses, yielded successful and informative results. This experience highlights HTTP’s robustness and wide acceptance in diverse network environments.

Base64 Encoding

Imagine needing to send an image through a channel that only supports text. This problem can be resolved using Base64 Encoding. This method transforms any arbitrary data into a string of Latin letters, numbers, and select symbols.

One application of Base64 encoding is in email digital signatures with GPG. Digital signatures, essentially random data, can be cumbersome to transfer via email. Encoding it with Base64 is a very good hack because it bypasses most email filters, and won’t accidentally break email servers or clients that are poorly programmed, not to mention the convenience of copy and paste.

In essence, Base64 encoding is a clever workaround, enabling the transfer of any data type through text-only channels. This method overcomes the limitations of channel compatibility, offering a practical alternative to overhauling communication systems or facing data transfer restrictions. While this approach may introduce some data inefficiency, its benefits in versatility and accessibility are significant.

TLS 1.3 version number

Long story short, TLS 1.3 is the latest iteration of the Transport Layer Security protocol, designed to safeguard internet activities from malicious interception, such as password theft or webpage tampering. However, implementing TLS 1.3 faced unexpected challenges.

The protocol specification includes a field indicating its version. While updating this field to reflect TLS 1.3 seemed straightforward, trials by Chrome and Firefox revealed that it disrupted numerous network connections. The root cause was the incompatibility of some network devices designed to passively listen to and potentially filter traffic, which crash if they even see a number other than 1.2 or smaller in the version field. This is very stupid programming mistake if you ask me, but given TLS 1.2’s prolonged dominance the mistake is understandable due to the absence of use cases for versions beyond 1.2.

To resolve this, the Internet Engineering Task Force (IETF) essentially resorted to using a hack. Instead of altering the version number, they moved the version negotiation to an extension. This effectively makes old programs think the connection still uses 1.2 while new programs can look elsewhere to find the true actual version, maintaining backwards compatibility.

WebSocket

WebSocket, at its core, is functionally similar to TCP, offering full-duplex communication between a client and server. However, WebSocket extends some additional capabilities over standard TCP connections.

One significant advantage of WebSocket is its operation over the same TCP port used by HTTP. This alignment with HTTP’s port is highly beneficial, as previously discussed. Leveraging the HTTP port ensures broad support and acceptance, reducing the likelihood of encountering blocks or connectivity issues.

In contrast, utilizing alternative TCP ports often leads to compatibility challenges. Many firewalls and internet service providers, like those managing public WiFi networks (for example, at Edinburgh airport), restrict or block these non-standard ports. This limitation makes WebSocket’s compatibility with HTTP’s port an essential feature for reliable and accessible web communications.

Backwards Compatibility Hacks Widens Limited Communication Channels

Conceptually, a protocol essentially describes metadata attached to a message, instructing intermediary systems on the nature of a message and how to handle it without them needing to understand the content of the message. In this view, the backward-compatible changes add new metadata instead of changing the original metadata for new and improved functionality.

In the context of Network Address Translation (NAT), the strategy is not to modify the IP address format but to use additional internet traffic data to determine the appropriate traffic routing when the IP address alone is insufficient.

Similarly, in the ubiquitous use of HTTP, the modification isn’t in the format of the metadata (HTTP headers), but in altering the content of the data block or the message body.

For Base64 encoding, instead of using binary data that might cause issues in systems with inadequate data processing capabilities, the data is formatted to resemble text, ensuring compatibility with text-processing systems.

In the case of TLS 1.3, rather than changing the version number, which could confuse older systems, a new field is added to signal the version to newer systems without causing conflicts.

With WebSocket, the innovation lies in not altering the TCP port number but in changing the data transmitted over the TCP connection to mimic different functionalities.

A secondary commonality in these backwards-compatible solutions is their ability to enhance the capabilities of communication channels originally designed for limited functions. NAT expands the number of devices connectable to the internet, Base64 encoding enables different data types to be sent through text-only channels, and WebSocket provides enhanced functionalities without needing port changes.

Complete Redesign is Also Needed

All of these hacks are clever, but they sometimes come at the cost of efficiency of bandwidth or latency. So there are times when making backwards incompatible change is totally necessary. HTTP/2 is a successful example. It can be faster than its predecessor because it speeds up and optimises the handshake process. This is not possible with a backwards-compatible change because the handshake process can never be replaced in that case. It was quickly adopted and had 68% share of all HTTP traffic in May 2022, which is quite impressive as it was only introduced in 2015.

IPv6 is another example of such backward incompatible change. Although it has found mixed success. I am not very certain about my opinion about it because at least for now, the hacks work and using IPv4 work and switching to IPv6 does not provide much more benefit. However, I have learned something new this week that slightly changed my opinion on this. I might write about this in the next blog post (or not).

These hacks, while clever, often come with trade-offs in terms of bandwidth efficiency or latency. There are instances where backwards-incompatible changes are beneficial and needed. A prime example is HTTP/2. Introduced in 2015, it marked a significant improvement over its predecessor by optimizing the connection handshake process, leading to faster speeds. Such enhancements couldn’t have been achieved through backwards-compatible modifications, as they inherently require redefining the handshake process. By May 2022, HTTP/2 had captured an impressive 68% of all HTTP traffic, reflecting its rapid adoption and efficiency.

IPv6 presents a different scenario as a backwards-incompatible change. Its success has been mixed, and opinions on its necessity are divided. Currently, the existing ‘hacks’ with IPv4 are functioning adequately, and the transition to IPv6 doesn’t seem to offer substantial benefits in many cases. However, a recent discovery has slightly shifted my perspective on IPv6, which I might explore in my next blog post. The evolution of these technologies underscores the balance between innovation and compatibility in the ever-evolving landscape of internet protocols.

NAT Is Good, I hope it still exists for IPv6

2023-11-12T00:00:00+00:00

I used to hate Network Translation Layer (NAT) because it made hosting anything so much more complicated. It also makes devices waste a lot of power because they have to constantly poll a server to receive push notifications. IPv6 is supposed to address this problem so decided to experiment with it. Although the experiment failed (I still mostly use IPv4), I began to see several huge advantages of NAT.

Briefly, How NAT Works

IP addresses are like physical addresses: they tell the network routers between two devices where to send data. Theoretically, each device should have its own IP address so it can be unambiguously addressed. Because of the limited number of IPv4 addresses, it’s physically impossible to give every device its own IP address. Thankfully, most of the internet works on TCP or UDP which uses port numbers to address intended to address different programs running on the same computer. For example, an SSH server might listen on port 22 while an HTTP server might listen on port 80. There’s no reason why a program uses a certain port other than the conventions.

Network Translation Layer (NAT) is a hack that uses port numbers to address different devices rather than different programs on the same device. Many devices connect to the same NAT router and the router forwards requests from different ports to different devices. For example, port 22 can be an SSH server on computer A behind NAT, and 80 is an HTTP server on computer B behind NAT, but to an outsider, it seems like the two servers run on the same computer. It essentially allows multiple devices to share the same IP address. Your home router does NAT automatically. You can manually tell the router which port should be associated to which port on which computer but it’s done automatically to make it not so complicated for normal people to use the Internet.

One problem with NAT is that if you want to make a service persistently available, such as a website, you must control whatever router that controls NAT in order to tell it to always associate an outside port to the web server that you run (called port forwarding). This is not always possible. Sometimes your home router is the NAT router so you can easily do port forwarding but sometimes IPv4 addresses is so scarce that your internet service provider controls the NAT router and your home router shares the same IP address with several other homes such as my home in China.

NAT Enables Networking Freedom

Recently I’ve been not very happy with the WiFi quality and switching speed of the router provided by my internet service so I brought a router myself to fix this problem. Connecting it to the network is trivial. I just had to run a cable from the ISP router to my router and set my router to treat the connection to the router as the internet connection. The router automatically acts as a NAT router among other things. To my ISP-provided router, it just appears as one device even though there are multiple devices connected to my WiFi.

Getting IPv6 to work was a lot more complicated because I couldn’t figure out how to get the ISP router to allocate a block of IPv6 addresses from its pool to my router. I asked my roommate how he got his router to work (he has his own router) with IPv6 he told me that he just uses NAT for IPv6 (NAT66). Unfortunately, I didn’t figure out how to enable NAT66 on my router so I just gave up eventually. This made me realize one major advantage of NAT that I hadn’t thought about previously.

It allows you to connect a lot of devices to the same network even if whatever internet connection you get only allows you to connect one device. For example, you can make a cellular connection your main internet connection through NAT because the network carrier thinks all the devices in your home are only one device. Or if you buy internet on a plane that only allows for one device to connect at a time so they can sell you more connections, you can get all your devices online by using a router. In both cases, the internet service provider has no way of knowing exactly the number of devices connected to the network. Therefore they cannot implement price discrimination. If they know how many devices you have they can start charging a premium on top of the network traffic you incur just like how Apple charge disproportionately for RAM and storage.

Disappearance of NAT Can be Bad for You

Everything works with NAT not because NAT is functionally indistinguishable from not having NAT. Even if you don’t have websites to run, your day can also be ruined if no companies design their online products with NAT in mind. For example, push notification currently works (roughly) by having your device poll a server periodically because of NAT. You can control your smart IoT devices outside your home also because there is a central server that your mobile phone sends commands to and your smart home devices get commands from. Without NAT and with every device globally routable in IPv6, your device might decide to just have a process listening on a port to receive push notifications and commands. This means devices behind NAT will essentially have parts of their functionality broken.

The lack of need for NAT will mean programs and devices will not be designed to function with NAT and gradually spell the death of it. This means your ISP will be able to know and control the number of devices connected to its network. This will most likely lead to them implementing price discrimination. For example, they can charge you more if you have more IoT devices because that implies you have a bigger house and thus can afford a higher price, even though the devices don’t take up any bandwidth. Your cellular company can stop you from using mobile hotspot or ask you to pay more for the functionality.

Sure, there are ways to work around this problem such as using a VPN but all of these leave detectable traces so it will be detected and guarded against. It will be like using an ad blocker or the Tor network nowadays. It is perfectly possible but companies will try to detect it and ban it. Media can also create a narrative that the people who use these technologies are hackers or pirates.

Thankfully, this is not the reality now. Anyone who has an internet connection can use it in whatever ways they want, including connecting an arbitrary number of devices, using their own router instead of the ISP-provided one, while giving the ISP limited information about individual devices. I hope it will stay the same even without NAT for IPv6.

Why I Use Windows on Desktop Rather than Linux

2023-10-29T00:00:00+00:00

I love using Linux on servers. I run web servers, write code, and do experiments with interesting projects using Linux (specifically Debian and Ubuntu). This blog post explains why I don’t use Linux on desktop.

I am sorry. I really love Linux and open-source software, but realistically it never worked very well for me. I think this is partially due to the small user base that Linux has and partially due to design choices like the lack of backward compatibility, and the lack of “bloatware”.

I had experimented with using Linux on Desktop for a while. I tried various distros like Ubuntu, Fedora, and Manjaro. I even built an Arch with i3, Rofi, and Polybar (I use Arch btw). Most of the time, installing and getting apps to work takes a long time, especially if I am learning about it for the first time.

Backward Compatibility is Bad on Linux

Many open-source projects value new features and optimization and the “right” way to write code more than maintaining backward compatibility, which often means maintaining a bad design. As I programmer, I too hate having hacky workaround and messy code lying in the code base. However, as a user, I don’t care. I just want my app to work even if the program runs a little slower or takes a bit more RAM and storage space.

One example of broken backward compatibility is glibc dropping support for DT_HASH in favor of the better-implemented DT_GNU_HASH because the developers of glibc (rightly) think that everyone should be using DT_GNU_HASH. But this broke software like Easy Anti-Cheat which relied on DT_HASH. Granted, fixing many of the breaking changes is extremely trivial, such as renaming a variable, pointing a path somewhere else, or even just recompiling the code to use the new ABI. However, as a developer, it is only trivial if you know what the root cause is, such as the line of code is causing the problem. Finding the problem takes time because usually, the place where the software fails is not where the problem occurs. One needs to spend a lot of time narrowing down the issue. Adding insult to injury, the debugging process is made difficult because all the search results are outdated. Fixing issues is even more difficult as a user of a program because I don’t have access to the source code or the experience to know exactly how to look for the bug.

There are workarounds to these breaking changes. One such project that tires to make programs break less often is Flatpak. Flatpak uses container technology essentially to allow apps to version lock their dependencies and manage multiple versions of the same runtime library on a system.

Contrary to Linux, Microsoft really prioritizes backward compatibility. One example of this is the Excel “bug” that thinks that 1900 was a leap year. This bug was introduced in Lotus 1-2-3 and Microsoft copied the behavior to ensure compatibility with Lotus 1-2-3. This “bug” was never fixed and is even included in the formal specification of Excel to ensure that spreadsheets that used to work continue to work even though the behavior is not correct. There are more examples of Microsoft trying to maintain backward compatibility at the cost of correctness and ease of use for new developers. win32 ABI is still maintained to this day with its numerous flaws and idiosyncrasies. Developer Arek Hiler even wrote a blog post titled “Win32 Is The Only Stable ABI on Linux”. UTF-8 character encoding is still not the default and is marked as beta because some programs still use non-UTF-8 encoding for non-English characters such as a Chinese stock trading app called Zhao Shang Zheng Quan. A new version of Powershell uses a different folder for the profile path to avoid conflict and maintain backward compatibility with the old version.

Microsoft’s obsession with backward compatibility even extends to UI elements. I used to laugh at Windows for still having two settings pages – one called Settings introduced in Windows 10 and the other called Control Panel introduced in Windows 7. The two pages share a lot of the same functions but have different UI layouts. So the inconsistency makes the OS look very ugly. I used to wish for a complete overhaul and unification, but now I understand and appreciate the reasons for choosing to include both programs. I can still change my settings in the same way as I did 10 years ago. Every guide, even ones designed for Windows 7, still works. Even though I haven’t tried, I suspect hacky scripts written that interact with the computer based on graphical UI elements and mouse clicks would more or less still work with minimum changes.

Lack of Bloatware Makes Installing Software Difficult

Backward compatibility is similar to how the lack of “bloatware” contributes to a complicated user experience on many Linux distributions. Many packages on Linux ship with the bare minimum to give the user fine control of the features they want to include to reduce “bloat”. For example, when I install xorg-server, which is a program that basically coordinates GUI applications and their windows to be displayed on the screen, on Arch, it doesn’t come with xinit, which is a used to start xorg-server. The reason behind this is that xorg-server and xinit and independent programs. There are many other ways to start xorg-server that do not use xinit. I appreciate this modular approach, but I think most people would want xinit with xorg-server. It took me quite a while, especially as I was installing xorg for the first time, to realise xinit was missing. I was trying to figure out if I installed xorg-server wrong or if my PATH variable was messed up. When something doesn’t work, it always takes time to narrow down the cause and fix it even if it is just one simple install. I would have rather spent the extra bandwidth and disk space to install packages I didn’t need than having to waste a lot of time hunting down the exact missing package. Also, many distributions don’t even come with fonts for other languages like Chinese. It takes a while to find out what the exact Chinese font package is called and how to install it, especially for the first time. The lack of bloat makes the system use minimal RAM and storage. When there is nothing running, my Windows installation on my desktop takes a whopping 10GB whereas my Linux installation only uses a modest 2GB. But I think this is a price worth paying to have a simplified user experience.

Sunk Cost Fallacy and My Struggle with It

2023-10-15T00:00:00+00:00

I went to have a driving test a few days ago and I failed because of one simple mistake — moving off without giving way. I was quite devastated because I had spent 40 hours plus thousands of pounds taking driving lessons. Rationally, that is just the sunk cost and should be ignored but I still feel very upset and even lost sleep because of it. That made me reflect on why I experienced this.

Sunk Cost Should be Ignored

A good way to see if some cost is sunk is to consider the alternative. If I had passed the driving exam, I would have also already spent the time and money for it, so either way this is a cost that is spent regardless, so it is a sunk cost and should be ignored.

With that in mind, let’s consider this the right way. In general, when making a decision, only the difference in cost and benefit between two choices should be considered. Since sunk cost exists for any choice you make, it should be disregarded. Currently, I have two choices, to continue trying to get a driver’s license or not. The difference in cost is the 15 hours of lessons my instructor asks for, and the difference in benefit is whether or not I get a driver’s license.

Does the benefit outweigh the cost? No. The answer seems very clear. It is not worth it because I don’t have much of an opportunity to drive now. I live close to the lecture halls and biking is sufficient. Also, I would be better off spending the time catching up on some coursework in order to get a higher grade than getting a driver’s license. So the decision is made. Time to move on. Right?

Not quite. I struggled to move on because I thought that I had spent so much time and money practicing driving and studying for the theory test for essentially absolutely nothing in the end. If I had just continued learning and eventually passed the test, the time and money would not have been wasted. Even though I kept reminding myself the time and money already spent is a sunk cost and should not be considered in any circumstances, something about wasting time and money for nothing is so deeply upsetting that I could not just let the thought go.

Why Sunk Cost Fallacy

Disregarding sunk cost was the conclusion of logical and rational thought, but committing the fallacy seems to be an emotional response. So it got me thinking that considering sunk costs might have had an evolutionary advantage in the past.

In a general sense, sunk cost does matter if the goal gets easier to achieve with more effort and if you give up halfway through you have to spend more effort the next time you try to achieve it. For example, if you are chasing a prey, the longer you have chased it, the easier it is for you to catch it because the prey gets more tired, compared to a well-rested prey that you might want to catch next time. The potential cost for continuing is lower than restarting after you give up on the same goal of getting food to survive. Crucially, the sunk cost is roughly the same as the difference in cost for the two alternative choices. We keep track of sunk cost as a proxy for the cost difference because actively comparing the cost and benefits of two alternatives takes significantly more brain power than keeping track of the sunk cost. That is why, I think, we evolved value sunk cost.

As society evolves, the environment becomes vastly different. In many cases, the more effort we spend on something we struggle to achieve, the more future cost it involves. For example, the more you spend on a failing project, the more it serves to prove that either the project is more difficult or you are worse at it than you thought. For example, if you keep losing money on an investment, it means either the investment is bad or you suck at investment. The alternative, moving onto a completely new project, might incur less cost than continuing the same project for the same amount of profit you can make. In other words, the higher the sunk cost, the higher the future cost gets compared to the alternative of moving on to another project. The decision is usually very difficult because you not only have to overcome the sunk cost fallacy but also admit that you made a wrong decision in the first place. In my case, I have to admit that it was wrong to try to get a driver’s license in the first place as I underestimated the difficulty and overestimated its usefulness.

Resolution

Society changes way faster than humans can evolve. This is why we live with many unfortunate vestiges of evolutionary advantages, with the sunk costs fallacy being one of them. That’s why we have to make decisions carefully and if it helps, try to comfort your emotions by framing them in some other ways. For example, by thinking that I will get my driver;s license eventually after this short break now, I can feel a lot better about the decision.

To update or not to update, that is the question

2023-10-01T00:00:00+00:00

Picking up a coding project that I haven’t touched for a year or two, I have come to expect that nothing about it still works and I need to update all of its components and my code to get it to work again. I didn’t realize how strange this was until I compared it to other things. If I leave a book unattended for many years, it won’t just fall apart for no reason and if I pick up my camera after a few years, it will still take pictures.

So why is this not the same with software, why do we have to deal with constant updates that risk introducing new bugs into existing systems that work perfectly fine?

Security Update

The main culprit: security update. Because everything is connected to the internet all the time, your devices face constant threats. Your phone can get hacked just by reading a message because of stack overflow, sandbox escape, etc. There are bugs — logical errors — made by programmers that grant control to unauthorized parties. Compare this to things that are not connected to the internet: unless someone has physical access to something, they cannot control it. Because the laws of physics are written by God, there aren’t any bugs or exploits so things that don’t connect to the internet never need to be updated to fix security vulnerabilities.

You might ask, I was fine with using the software before a fix was applied for so long, so why would I not be fine after the security fix is released? The answer: no one knew about the bug beforehand (or at least, few people knew), but after the fix is published, especially for open-source software, everyone knows about and can develop exploits for those unpatched systems so your risk of getting hacked increases after a security update is published.

New Incompatible Features

Security updates explain why we need updates even if we don’t want them. Updates break existing features for a different reason. It’s because of the tradeoff between maintaining backward compatibility with old technology and the new features and quality of life improvement brought by new technology.

In real life, we make this tradeoff all the time. We updated our method of communicating with someone from writing letters to writing text messages over the internet because the benefits are huge even though that means we need to learn new ways of communicating and adapt our lifestyle around it, but we didn’t update our keyboard layout to something more efficient because it’s difficult to learn a new keyboard layout for a small amount of gain in efficiency.

The same tradeoff applies to the software world. While it is technically easy to maintain all the backward compatibility of a piece of software, this sometimes comes into conflict with new features. Usually, an update to break backward compatibility is made to fix problems with confusing design and introduce new features that are not allowed by the old design.

Windows has a good reputation for maintaining backward compatibility. One example is the control panel. When Windows rewrote the settings page in Windows 8, it preserved the control panel introduced in Windows 7, and it is kept in Windows to this day, 14 years after Windows 7 was released. This confuses new users because there are two very different programs that do the same thing. It looks ugly because they use different themes. The advantage is clear though. I can still follow tutorials written years ago, even those written for Windows 7, and what I remember from my childhood still works.

For other software, the thought process is the same, the specs of a programming language gets updated to fix confusing design that cause pitfalls to new and unfamiliar users or make it easier to maintain and develop programs. Depending on the popularity of the new version, developers decide if they should drop support for the older version. It took the developers of Python 10 years to drop support for the older version 2 of Python after the new version 3 came out. For Perl, version 5 was so popular that its successor version 6 was eventually renamed because of the incompatibility and people’s desire to keep using the old one despite its numerous pitfalls and stupid design choices.

Maintaining backward compatibility is like not changing your room layout because if you change it, the robot cleaner gets confused the stops working even though the layout was proven to be not logical. Whether to change the layout depends on how important the robot is and how difficult it is to update it to make it work with the new layout as well as how much more logical the new layout is.

When software gets updated, it becomes harder and harder to develop fixes for the older versions because the code bases diverge over time. When developers decide to drop support for an old version — usually meaning stopping applying new security fixes to it, users are usually forced to upgrade to the new version lest they leave their machines vulnerable to hacks. This is why programs constantly need updates and updates sometimes break things that already work.