TL;DR.

This post provides an overview of the various technologies I refer to in this blog. This list is not extensive, yet still emphasizes the importance of understanding, and applying, technological solutions. To excel in the continually-evolving development landscape, adapting to new tools needs to be an integral part of my daily routine.

An Introduction.

This list evolves as I adopt new ways of solving problems. This list will also expand as new technologies are added to my toolkit.

The purpose of this post is to centralise the descriptions of the techniques, and technologies, I employ in my workflow.

The Big Picture.

I usually describe the technologies I use within the posts that I write. As a result, my tool descriptions are usually repeated throughout my blog. This is the result of trying to ensure each post is self-contained. (Sometimes, however, I may need to divide a topic across multiple posts.) Repetitive tool descriptions will happen when using the same tool on multiple projects. Therefore, having a repository of these descriptions makes creating new posts a little easier.

Below are descriptions of many technologies I use to build my solutions.

What is Anaconda and Miniconda?

Python projects run in virtual environments. These isolated spaces are used to manage project dependencies. Different environments are used specifically to avoid conflicting versions of the same program, for instance, different versions of Python.

venv is a virtual environment manager that is bundled with Python since version 3.3. However, venv is not the only environment manager. Anaconda is a Python and R distribution for scientific computing that includes the conda package manager. Miniconda, a bootstrap version of Anaconda, is a virtual environment manager that is small, FREE, and also includes the conda package manager, Python, and other packages that are required or useful to a developer, like pip and zlib.

http://www.anaconda.com/ ↗,

https://docs.anaconda.com/free/miniconda/index.html ↗, and

https://solodev.app/installing-miniconda.

Miniconda must be installed (conda -V) before continuing with this post.

What is AnythingLLM?

AnythingLLM is a free, easy-to-use, and versatile document chatbot. It is made for people who want to chat with, or create, a custom knowledge base using existing documents and websites. RAG (Retrieval-Augmented Generation) is the process of creating custom knowledge bases, which conveniently sidesteps the need for finetuning an LLM (Large Language Model) with LoRA (Low-Rank Adaptation) or QLoRA (Quantised LoRA). The information from the documents and websites are converted, and saved, to a vector database so an LLM can use this data to improve its answers my queries. AnythingLLM can work with many users, or just a single user, from one installation. This makes AnythingLLM great for those who want a ChatGPT experience, and complete privacy, while supporting multiple users in the same setup.

https://useanything.com/ ↗.

What is Astro?

Astro is a frontend meta-framework. A meta-framework makes use of the resources provided by other, underlying frameworks. In the case of Astro, it can import components from React, Svelte, Vue, and other frameworks by using simple directives. There are hundreds of available integrations and API hooks that can be used to customize a project to an exact use case. Highlights of using Astro include Islands, Agnostic UIs, Server-Side Rendering, Zero JS (by default), Content Collections, and Custom Integrations.

https://astro.build/ ↗.

What is a BRD?

A Business Requirements Document (BRD) is used to clarify specific goals, the resources needed to reach those goals, the estimated costs involved, and the projected timeline for completion.

A well-rounded BRD is typically presented to a diverse audience that includes non-technical stakeholders and key decision-makers. This document serves as a critical tool for communicating the project's objectives, scope, and requirements in a manner that is accessible to individuals with varying levels of technical expertise. By clearly outlining the business needs, expected outcomes, and strategic alignment, the BRD ensures that all stakeholders have a comprehensive understanding of the project's goals and the value that the project aims to deliver. It clearly explains the projects' value proposition to managers, investors, and decision makers by showing how the project addresses market demands, the unique benefits the project offers, and how the demand and benefits align with the overall business strategy. This document ensures that stakeholders will understand the technical decisions that drive the project toward meeting its' business goals.

Additionally, the BRD includes detailed technical information that bridges the gap between non-technical stakeholders and the technical teams responsible for implementing the technologies. The technical requirements, system architecture, integration points, and other technical aspects are explained in a way that ensures everyone involved is on the same page. By providing this level of detail, the BRD helps turn business needs into clear technical tasks. This makes communication and collaboration between non-technical and technical teams easier, reduces misunderstandings, and ensures the project runs smoothly.

The inclusion of both high-level business information and detailed technical specifications in the BRD ensures that everyone involved in the project is aware of the business goal, and how technology helps achieve that goal. This comprehensive approach reduces the risk of misunderstandings, aligns expectations, and supports the smooth progression of the project from conception to completion. By addressing the needs and concerns of a mixed audience, the BRD plays a pivotal role in the successful buy-in of complex projects.

This is a business document for business people, not a technology document for technicians and engineers. It is designed to identify the resources and processes that are used to create a digital product. The focus of this business document is to determine how to convert an investment of time, effort, energy, creativity, resources, and money into a viable, digital asset.

What is CHMOD?

The chmod command is used to change the mode of files and directories. Given that ‘u’ is user, ‘g’ is group, and ‘o’ is others, I can change the modes of files and directories using ‘r’ for read, ‘w’ for write, and ‘x’ for execute. For example:

sudo chmod -R u=rwx /media

where:

sudo means to run the command using the root account privileges,
chmod means to (ch)ange the (mod)e of files and directories,
-R means to switch (-) to the (R)ecusive feature,
u means to select the (u)ser account,
rwx means to apply (r)ead, (w)rite, and e(x)ecute permissions, and
/media means to apply the permission changes to the /media directory, and subdirectories.

NOTE: The /media subdirectories are affected due to the (-R)ecursion flag used in the command.

What is CHOWN?

The chown command is used to change the owner of a file.

[pending]

What is Chroma?

Chroma is an open-source vector database used for saving, and using, embeddings. Its main job is to store embeddings, with related meta data, that is used by large language models. Chroma works well as a foundation for text-based search engines and is perfect for handling lots of unstructured and partly structured data.

https://www.trychroma.com/ ↗.

What is Cloudflare?

Cloudflare is a worldwide network of data centres that offer content and services that make websites and apps faster, safer, and more reliable. It's one of the biggest networks in the world. Companies, non-profits, bloggers, and anyone with an online presence enjoy quicker, more secure websites and apps because of Cloudflare. It supports millions of internet properties, and its network grows by tens of thousands daily. Cloudflare handles internet requests for millions of websites and manages, on average, 55 million HTTP requests per second.

https://www.cloudflare.com/ ↗.

What is CUDA?

CUDA is a parallel computing platform and programming model created by NVIDIA. It has been downloaded over 20 million times and helps developers speed up their applications using GPU accelerators. CUDA is used in many fields, not just high-performance computing and research. For example, pharmaceutical companies use CUDA to find new treatments, cars use it to improve self-driving, and stores use it to analyse customer data for recommendations and ads.

Some people think CUDA, launched in 2006, is just a programming language or an API. But with over 150 CUDA-based libraries, SDKs, and tools, it's much more than that. NVIDIA keeps innovating, and thousands of GPU-accelerated applications use the NVIDIA CUDA platform. CUDAs flexibility and programmability make it the top choice for developing new deep learning and parallel computing algorithms.

CUDA also helps developers easily use the latest GPU features, like those in the NVIDIA Ampere GPU architecture.

https://blogs.nvidia.com/blog/what-is-cuda-2/ ↗.

What is Curl?

Curl is a command-line utility that is used to transfer data from, or to, a server. Curl provides a number of options, including:

Resuming transfers,
User authentication,
Bandwidth limiting,
Proxy support, and
Much more.

With curl, you can download (or upload) data using a supported protocol, such as:

DICT,
FILE,
FTP,
FTPS,
GOPHER,
GOPHERS,
HTTP,
HTTPS,
IMAP,
IMAPS,
LDAP,
LDAPS,
MQTT,
POP3,
POP3S,
RTMP,
RTMPS,
RTSP,
SCP,
SFTP,
SMB,
SMBS,
SMTP,
SMTPS,
TELNET,
TFTP,
WS, and
WSS.

What is DaVinci Resolve?

DaVinci Resolve is a video editing, colour grading and correction, visual effects, motion graphics, and audio post-production application for macOS, Windows, and Linux. It was originally developed by da Vinci Systems as da Vinci Resolve. Then in 2009, when da Vinci Systems was acquired by Blackmagic Design, the application was rebranded as DaVinci Resolve.

https://www.blackmagicdesign.com/ ↗.

What is DeepSeek-R1?

According to DeepSeek-R1:14B…

“DeepSeek R1 is a groundbreaking, free, and open-source state-of-the-art reasoning model that leverages reinforcement learning for training without the need for supervised fine-tuning. Developed by DeepSeek, this innovative model stands out for its ability to achieve performance metrics comparable to leading closed-source models from major tech companies like Google, OpenAI, and Anthropic. Its open-source nature, combined with advanced architecture and training methodologies, mark a significant milestone in AI research. DeepSeek is accessible, and valuable, to researchers and developers worldwide.“

What is Distrobox?

Distrobox is a tool that allows me to run any Linux distribution within my terminal. It enables both backward and forward compatibility with any software, and the freedom to use whichever distribution I'm most comfortable with. Distrobox uses Podman, Docker, or Lilipod to create containers using the Linux distribution of my choosing. It aims to run any software, on top of my host system, without any hassle.

https://distrobox.it/ ↗.

What is Docker?

TLDR: Docker is used to containerise an app (and its dependencies) so it can run on any platform.

Docker is a tool for easily deploying, and running my applications on any platform. I can package an application with all its code, libraries, dependencies, and tools, which allows me to deploy that app as a single bundle. Docker guarantees that my application will run on any computer that also runs Docker.

https://ubuntu.com/tutorials/how-to-run-docker-inside-lxd-containers ↗,

https://solodev.app/installing-docker, and

https://www.docker.com/ ↗.

Docker must be installed (docker -v) before continuing with this post.

What is Docker Compose?

Docker Compose is a tool that helps define and run applications with multiple containers. It makes development and deployment smoother and more efficient. Compose makes it simple to control my whole application stack, managing services, networks, and volumes in one easy-to-understand YAML configuration file. I can create, and start, all the services from my configuration file with just one command. Compose works in all environments, like production, staging, development, testing, and CI/CD. It also has commands to manage my applications' entire lifecycle.

https://docs.docker.com/compose/ ↗.

What is a Docker Container?

Docker Containers are ephemeral (temporary) system simulations that run on PCs. Containers, which are created from Docker Images, are virtual spaces that separates my running applications from the base OS and other apps. Containers are easy to run on other systems that also support Docker.

https://docs.docker.com/guides/walkthroughs/what-is-a-container/ ↗.

What is Docker Desktop?

TL;DR: Docker Desktop is used to build, share, and run containerized applications and microservices.

Docker Desktop is a developer utility for my Mac, Linux, or Windows environment that lets me build, share, and run containerized applications and microservices. It provides a straightforward GUI (Graphical User Interface) that lets me manage my applications, containers, and images directly.

https://docs.docker.com/desktop/ ↗.

What is Docker Hub?

Docker Hub is a place where developers and open source contributors can find, use, and share container images. It lets developers host public repos for free or private repos for teams and businesses. Docker Hub is the biggest collection of container images, with content from community developers, open source projects, and software companies creating and sharing their code in containers.

https://docs.docker.com/docker-hub/ ↗.

What is a Docker Image?

Docker Images are immutable (permanent) files that are used to run code in Docker Containers. They also serve as a guide for building Docker Containers, like a template. Images are the starting point when using Docker. They are similar to a snapshot in virtual machine (VM) environments.

https://docs.docker.com/build/building/packaging/ ↗.

What is Docker Scout?

Docker Scout helps keep software safe by analysing images, quickly finding vulnerabilities, giving suggestions to fix problems, and more. It is made for developers and works well with Docker. With Docker Scout, I spend less time looking for, and fixing, issues and more time working on my code. The Docker Scout CLI plugin is a powerful, yet flexible terminal interface.

https://docs.docker.com/scout/ ↗.

What is Docker Swarm?

Docker Swarm mode is part of the current versions of Docker and helps manage a group of Docker Engines called a swarm. With the Docker CLI, I can create a swarm, put app services into it, and control how the swarm works. Docker Swarm mode is built directly into the Docker Engine.

https://docs.docker.com/engine/swarm/ ↗.

What is a Domain Name?

The Internet uses an octet (8-bit) numbering system to identify the locations of the numerous, online resources. These locations are called IPs, or Internet Protocol v4 addresses. (IP v6 addresses are also available, but they are rarely used.) Trying to remember 4 sets of numbers is tiresome so, in the early 1970s, domain names were introduced. When I type a domain name into the address bar of a browser, that name resolves to a specific IP address thanks to the Domain Name System, or DNS. DNS is the phone book of the Internet and uses a distributed service that resolves domain names to IP addresses. (New domains can take up to 48-hours to resolve.) Now that the browser knows the IP address, it can start interacting with the services at that given location. The computer at that specific IP address may be hosting any number of services, including (but not limited to):

Web servers,
Email servers,
Gaming servers, and
FTP (File Transfer Protocol) servers.

Every device gets an IP address when it connects to the Internet. These dynamic IP addresses are pooled by each Internet Service Provider, or ISP. When a device disconnects from the Internet, the IP address is released and returned to the ISP pool. Companies (and people) that use domain names typically rent space on someone else's servers. They go to their domain name provider, point the domain name(s) to the DNS (Domain Name System) run by the host, activates some settings on the hosting service, and the domain name eventually points to the provided service (web hosting, email server, etc.) Less frequently, people like me rent static IPs from our ISPs, pass our domain name(s) through Cloudflare, use Cloudflare to resolve our domain names to our static IP addresses, and host our own services (web servers, email servers, etc.) on our own premises.

What is Elia?

Elia is a terminal-based app for interacting with LLMs. It's designed to be keyboard-focused, efficient, and fun to use. It saves your conversations in a local SQLite database and lets you interact with different models. You can chat with proprietary models like ChatGPT and Claude, or with local models using ollama or LocalAI.

https://github.com/darrenburns/elia ↗.

What is FFmpeg?

FFmpeg is a top multimedia tool that can decode, encode, transcode, mux, demux, stream, filter, and play almost any format created by humans or machines. It supports both old and new formats, whether made by a standards group, the community, or a company. FFmpeg is also very portable: it can be compiled, run, and tested on Linux, Mac OS X, Windows, BSDs, Solaris, and more, across different build environments, machine types, and setups.

https://ffmpeg.org/ ↗.

What is Flowise?

Flowise is an open-source, low-code platform that helps me easily create customized AI workflows and agents. It simplifies the development of AI applications, which usually require many iterations, by allowing for quick changes from testing to production. Chatflows link AI models with various tools like memory, data loaders, and cache, along with over a hundred other integrations including LangChain and LlamaIndex. This setup enables the creation of autonomous agents and assistants that can perform diverse tasks using custom tools. I can build functional agents and OpenAI assistants, or opt for local AI models to save costs. Flowise supports extensions and integrations through APIs, SDKs, and embedded chat features. It is platform-agnostic, meaning Flowise can work with local, open-source AI models in secure, offline environments using local data storage. It is compatible with various platforms and technologies like Ollama, HuggingFace, AWS (Amazon Web Services), Azure, and GCP (Google Cloud Platform), offering flexibility in deployment.

https://docs.flowiseai.com/ ↗.

What is Gatsby?

Gatsby is an open-source, React-based, static site generator (SSG) framework. It is used to build websites that are performant, scalable, and secure. I can also pull data from a headless CMS.

https://www.gatsbyjs.com/ ↗.

What is Git?

Git is a version control manager created by Linus Torvalds in 2005 and, since then, this project has been maintained by Junio Hamano. This utility helps me track the changes I make to my code. Git is used by development team members, who collaborate on software projects, to merge their individual changes to a local, or remote, repository (repo) at the end of each day.

https://git-scm.com/book/en/v2/Getting-Started-What-is-Git? ↗.

Git must be installed (git -v) before continuing with this post.

What is GitHub?

GitHub hosts a collection of remote repos where local changes to a code base can be saved to an off-site location. These remote repos can either be public or private. Alternatives to GitHub include GitLab and Bitbucket.

https://docs.github.com/en/get-started/using-git/about-git ↗.

What is GitLab CE?

GitLab CE (Community Edition) is an open source, end-to-end software development platform with built-in version control, issue tracking, code review, CI/CD, and more.
I can self-host GitLab CE on my own servers, in a container, or on a cloud provider.

https://gitlab.com/free-releases/gitlab-ce ↗.

What is GPT Pilot?

GPT Pilot is the main engine that powers the Pythagora VS Code extension. (The developers of Pythagora want it to be the first, true AI assistant for developers.) Together, GPT Pilot and Pythagora do more than just fill in missing code or help with pull request messages. GPT Pilot helps Pythagora act like a real AI developer that can write complete features, fix bugs, discuss problems, request reviews, and do many other tasks related to coding.

What is Jekyll?

Jekyll is a free, Ruby-based, open-source tool for creating static websites. I don't need to know Ruby; It just needs to be installed. It is ideal for various static websites, such as personal blogs, portfolios, and documentation sites, eliminating the need for databases or content management systems. Jekyll works with text and markdown files, converting them into secure static HTML files, which can be easily hosted without complex systems. It is secure, does not require a database or server-side scripts, and is highly customizable. Additionally, Jekyll has a large community that provides inspiration and assistance.

https://jekyllrb.com/docs/ ↗.

What is Jupyter Notebook?

Jupyter Notebook is an open source web application that lets me create, and share, documents that include live code, equations, visuals, and text. It is managed by the maintainers at Project Jupyter and started as part of the IPython project. The name "Jupyter" reflects the main languages it supports: Julia, Python, and R. It comes with the IPython kernel for Python programming, but I can also use over 100 other kernels.

https://jupyter-notebook.readthedocs.io/en/latest/ ↗.

What is LangChain, LangSmith, LangeServe, and LangGraph?

LangChain is a Python and JavaScript library that helps me build language model applications. It uses these models to help with tasks like answering questions, creating text, or performing other tasks.

LangSmith is a tool for creating professional LLM applications. It helps me debug, test, check, and keep an eye on chains, and smart agents, made with any LLM framework, and is designed to work smoothly with LangChain.

LangeServe is a platform for deploying and maintaining my LLM applications. It helps me turn any LangChain runnable, or chain, into a REST API, making my LangChain projects ready for real-world use.

LangGraph is a tool for creating complex applications with LLMs that involve multiple parts working together. It is designed to work with LangChain and adds new features to the LangChain Expression Language, helping me manage several chains (or parts) through many steps in a loop.

https://python.langchain.com/docs/get_started/introduction ↗,

https://docs.smith.langchain.com ↗,

https://www.langchain.com/langserve ↗, and

https://python.langchain.com/docs/langgraph ↗.

These LangChain tools must be installed (pip show langchain) before continuing with this post.

What is Langflow?

Langflow is a user-friendly interface for LangChain, built using react-flow. It lets me easily create, and test, different setups by dragging and dropping components and using a chat box. Each part of the graph in Langflow works on its own. It's designed to help me quickly try out new ideas and build AI applications without needing to write much code. Langflow comes with ready-to-use components that I can mix and match to build AI apps. It fits perfectly with the tools I already use and makes it simple to develop basic, or advanced, AI applications. Langflow is all about making AI integration straightforward while expanding what I can do with AI in both tests and real-life use cases.

https://docs.langflow.org/ ↗.

What is LiteLLM?

LiteLLM is a proxy that provides LLMs with IP addresses. It is a unified interface that calls 100+ LLMs using the same Input/Output format, supporting OpenAI, Huggingface, Anthropic, vLLM, Cohere, and even custom LLM API services.

https://litellm.ai/ ↗.

LiteLLM must be installed (litellm -v) before continuing with this post.

What is LXD and LXC?

The LXD (LinuX Daemon) is the container manager that is used to create, and manage, LXCs (LinuX Containers). It is a background service that can automatically start LXCs when the host system boots, or stop any container from starting at all.

A LXC (LinuX Container) is an isolated, OS-level virtualization which, for efficiency, uses the Linux kernel of the host system. It is a virtual environment where system processes within the LXC can not affect other containers, or the host system, without specifically running certain commands.

https://ubuntu.com/tutorials/how-to-run-docker-inside-lxd-containers↗,

https://ubuntu.com/server/docs/containers-lxd↗,

https://ubuntu.com/server/docs/containers-lxc↗, and

https://solodev.app/installing-lxd-and-using-lxcs.

What is MAX?

MAX (Modular Accelerated Engine) is a set of tools that makes it easier for me to work on AI projects. It includes the MAX Engine, MAX Serving, and the Mojo programming language. These tools help me create, and run, the next generation of machine learning solutions.

https://www.modular.com/max ↗.

What is MemGPT?

MemGPT is a memory manager that empowers LLMs (large language models) to overcome their limited context windows. It learns about me, modifies its' personality, and when connected to an LLM, assumes the duties of a chatbot. When MemGPT is connected to my own filesystem, tools, databases, and APIs, a new level of LLM proficiency emerges. For instance, combining MemGPT with the AutoGen facility blends pair programming abilities with an infinite memory.

What are Microservices?

Microservices are small, independent "components" that communicate over well-defined APIs. They are usually developed, and maintained, by small teams of programmers. Each microservice is:

Small,
Autonomous,
Loosely coupled,
Focused on one task,
Independently deployable,
Aligned with a bounded context, and
An abstraction at the level of the problem domain.

The microservice architecture enables an organization to deliver large, complex applications rapidly, frequently, reliably, and sustainably - a necessity for deploying my 12 Startups.

https://www.ibm.com/topics/microservices ↗.

What is a Multi-Agent Workflow?

An agent is an entity that is powered by an LLM (large language model). The agent will also have its own prompt, a selection of tools, and some custom code that helps with collaboration. A multi-agent workflow is built with at least two agents that collaborate with each other in order to achieve my specifically defined outcome.

https://microsoft.github.io/autogen/blog/2023/12/01/AutoGenStudio/ ↗,

https://blog.langchain.dev/langgraph-multi-agent-workflows/ ↗,

https://docs.crewai.com/core-concepts/Collaboration/ ↗.

What is NGINX?

NGINX is a free, open-source tool for web hosting, reverse proxying, caching, load balancing, media streaming, and more. It began as a web server focused on top performance and stability. Besides its HTTP server features, NGINX can also work as an email proxy server (IMAP, POP3, and SMTP) and a reverse proxy and load balancer for HTTP, TCP, and UDP servers.

https://www.nginx.com/resources/glossary/nginx/ ↗.

What is NGINX Reverse Proxy?

A reverse proxy is a server that sits behind the firewall in a network and directs client requests to appropriate backend servers. The NGINX Reverse Proxy is a reconfigured NGINX server that provides an additional level of abstraction and control. This go-between server ensures network traffic between clients and servers flows smoothly.

https://www.nginx.com/resources/glossary/reverse-proxy-server/ ↗.

What is NodeJS, NPM, NVM, and PNPM?

NodeJS is a free, JavaScript (JS) server runtime. It runs JS code as single-threaded, non-blocking, asynchronous programs, which makes it very memory efficient. NodeJS performs many system-level tasks, but is commonly used to run JS servers.

NPM (Node Package Manager) is the world's largest software registry. Open source developers use it to share packages with each other. Many organizations use NPM to manage private development as well. Most developers interact with NPM using the CLI (Command Line Interface), and usually ships with NodeJS.

NVM (Node Version Manager) is used to switch between versions of NodeJS. NVM works on any POSIX-compliant shell (sh, dash, ksh, zsh, bash) and runs on Linux distros, macOS, and Windows WSL.

PNPM (Performant Node Package Manager) is a JavaScript package manager, similar to NPM.

https://nodejs.org/en/learn/getting-started/introduction-to-nodejs ↗,

https://docs.npmjs.com/about-npm ↗,

https://github.com/nvm-sh/nvm#intro ↗, and

https://solodev.app/installing-node-and-npm-with-nvm ↗, and

https://pnpm.io/ ↗.

NOTE: NPX is now part of the NPM CLI.

What is Nomic Embed?

Nomic Embed is a text embedding model that is open source and fully reproducible. It can process text up to 8,192 characters long and uses openly available, curated data for training. Text embeddings are crucial for modern NLP (natural language processing) applications that offer enhanced search, or mechanisms that generate responses for LLMs (large language models). Embeddings models turn text into vectors, which are then used in various tasks such as clustering for visualization, classifying information, and retrieving data.

https://blog.nomic.ai/posts/nomic-embed-text-v1 ↗.

What is Ollama?

Ollama is a tool for downloading, setting up, and running LLMs (large language models). It lets me access powerful models like Llama 2 and Mistral and helps me run them on my local Linux, macOS, and Windows systems.

https://ollama.com/ ↗.

Ollama must be installed (ollama -v) before continuing with this post.

What is Open WebUI?

Open WebUI is a flexible, feature-rich, and easy-to-use web interface that runs locally and works completely offline. It supports OpenAI-compatible APIs and the Ollama LLM manager. Ollama is a free, open-source manager that lets me run multiple LLMs on my PC. It uses llama.cpp, an open-source library that helps run LLMs on less powerful hardware. Ollama also includes a feature for downloading LLMs.

https://github.com/open-webui/open-webui ↗.

What is pipx?

pipx is a tool that helps me install and run Python applications in isolated environments. It's similar to macOS's brew, JavaScript's npx, and Linux's apt. It works with pip, but it focuses on installing and managing Python packages that I can run from the command line as applications.

pip is a general-purpose package installer for both libraries and apps that have no environment isolation. pipx is made specifically for application installation, as it adds isolation yet still makes the apps available in my shell: pipx creates an isolated environment for each application and its associated packages. pipx does not ship with pip, but installing it is often an important part of bootstrapping my system.

https://pipx.pypa.io/stable/ ↗.

What is Pythagora?

Pythagora is a tool that helps create apps from the ground up using large language models. It's an extension for VS Code and runs on GPT Pilot, one of the best code generators around. While designed around GPT-4, I have adjusted the GPT Pilot settings so Pythagora can work with local LLMs (large language models).

What is Python?

Python is an easy-to-understand programming language that is ideal for quickly developing applications. It features built-in tools for code organization and code reuse, as well as support for using C and C++ custom extensions. Both the Python interpreter and the standard library are freely available across all major platforms.

Python is a favourite language among programmers because it speeds up the development process. Unlike compiled languages, enhanced productivity is the result of allowing immediate editing, testing, and debugging of your source code.

Debugging in Python is very straightforward; instead of crashing the process that is running your code, the interpreter presents exceptions, or stack traces, for uncaught errors. The debugger also offers:

A source-level debugger for inspecting variables,
A way of evaluating expressions,
A way of setting breakpoints, and
Support for step-by-step code walk-throughs.

These features showcase Python's debugging versatility. The debugger itself is written in Python, and adding print statements to your source code is the fastest debugging approach. This is another benefit of Python's edit-test-debug cycle.

Thanks to this section of the video, you now know that Python:

Is made up of a language, interpreter, and standard library,
Supports custom extensions made with C and C++, and
Uses the edit-test-debug cycle as a coding practice.

https://docs.python.org/3/tutorial/index.html↗.

What is the Python Interpreter?

The Python interpreter is a computer program that converts human-readable, high-level program statements into computer-readable, low-level machine code. In other words, the interpreter translates the high-level, human-readable commands into low-level, computer-executable commands. High-level, human-readable programming languages require low-level, computer-executable translations. In the case of Python and JavaScript, these translations are provided by runtime interpreters. Other high-level languages, like C and Rust, use compilers and linkers to build executable files and libraries.

https://docs.python.org/3/tutorial/interpreter.html ↗.

What is PyTorch?

PyTorch is an open-source deep learning framework known for its flexibility and ease of use. It works well with Python, a popular language among machine learning developers and data scientists. PyTorch is a complete framework for building deep learning models, which are often used in tasks like image recognition and language processing. Since it is written in Python, most machine learning developers find it easy to learn and use. PyTorch was created by developers at Facebook AI Research and other labs. It combines fast and flexible GPU-accelerated back-end libraries from Torch.ch with an easy-to-use Python frontend. This makes it great for quick prototyping, readable code, and supporting many types of deep learning models. PyTorch allows AI engineers to use a familiar programming style while still creating graphs. It was open-sourced in 2017, and its' Python base has made it popular with machine learning developers.

https://pytorch.org/↗.

What is RSA?

Rivest-Shamir-Adleman (RSA) is a type of encryption that is popular in many products and services. It uses two connected keys to lock and unlock data. There is a private key and a public key. The public key can be shared with anyone, but the private key is kept secret by the person who made the keys. With RSA, the public key can lock the data, but only the private key can unlock it.

https://en.wikipedia.org/wiki/RSA_(cryptosystem)↗.

What is Rust?

Rust is a systems programming language that empowers everyone to build reliable and efficient software. It is blazingly fast and memory-efficient: with no runtime or garbage collector, it can power performance-critical services and run on embedded devices. Rust is focused on performance, memory safety, and safe concurrency.

https://www.rust-lang.org/↗.

What is Scrapy?

Scrapy is a fast, open-source, extensible, and powerful Python-based scraping framework that is used to extract website data. Although it was originally designed for web scraping, Scrapy can also use APIs to extract data, or even as a general purpose web crawler. Like other frameworks, there is a steep learning curve, however experienced systems operators and software developers should be able to understand the documents, install this utility, and comfortably use Scrapy without too much effort.

https://scrapy.org/↗.

What is SSH?

The Secure Shell (SSH) protocol is a way to safely send commands to a computer over an unsecured network. SSH uses special codes to check and protect connections between devices. It also allows for tunnelling, or port forwarding, which lets data packets go through networks they couldn't before. SSH is often used to control servers from a distance, manage infrastructure, and transfer files. It lets administrators manage servers and devices remotely. Older methods like Telnet sent commands that anyone could see. But SSH uses a secure connection, which is why it's called Secure Shell.

https://www.cloudflare.com/learning/access-management/what-is-ssh/↗.

What is Streamlit?

Streamlit is a free, open-source framework that helps me make web apps from data scripts using Python, even without front-end knowledge. With Streamlit, I can quickly transform my data scripts into shareable apps. After creating an app, I can deploy, manage, share, and interact with that app using the Streamlit Community Cloud.

https://streamlit.io/↗.

What is Twinny?

Twinny is an AI code completion plugin for VS Code. It is also compatible with the VSCodium editor. Twinny is designed to seamlessly work with locally-hosted LLMs (large language models), frameworks, and tools. It supports FIM (fill in the middle) code completion by providing real-time, AI-based suggestions that help as I type my code. I can also discuss my code with an LLM via the sidebar, where I can:

Get explanations for how a function works,
Ask the LLM to generate tests,
Request code refactoring,
and more.

https://marketplace.visualstudio.com/items?itemName=rjmacarthy.twinny↗.

What is VS Code?

Visual Studio Code (VS Code) is a simple, yet powerful, code editor that works on my computer with versions for Windows, macOS, and Linux. It natively supports JavaScript, TypeScript, and Node.js. Other programming languages and technical abilities are available to VS Code through the use of appropriate Extensions.

https://code.visualstudio.com/↗.

What are Web Page Scrapings?

Web page scraping, or data scraping, is the process of collecting content and data from websites, usually to save it for later use and analysis. Moving information from a website to a spreadsheet is a simple form of web scraping. But when I talk about 'web scrapers,' I'm referring to software 'bots' that automatically visit websites, find important data, and pull out useful information. Almost any data on a website can be scraped and is widely used in data analytics. Web scraping has a dark side and can be used to steal bank details or personal information. It is important to understand the risks, ethics, and legality of web scraping.

https://beautiful-soup-4.readthedocs.io/en/latest/

What are Web URL Scrapings?

URLs (uniform resource locations) describe where specific assets (webpages, email services, downloads, etc.) are found. These are usually fed into the address bar of my browser either manually or via a link on a webpage. URL scraping is the process of finding all the URLs at a specified, domain-name address and then saving the results to a file or database.

What is WhisperAI?

From the GitHub page ↗:

Whisper is a speech recognition model. It is trained on a large set of audio types and can perform multiple tasks like recognizing different languages, transcribing spoken text, and translating multiple languages to English.

https://github.com/openai/whisper ↗:

The Results.

Technology encompasses an array of tools, platforms, and frameworks that aid in various tasks such as software development, web hosting, data management, and more. From Python and Git for coding, to Docker and LXC for virtual environments, each technology has its unique role and benefits. Tools like AnythingLLM and Ollama leverage AI capabilities to enhance my experience and efficiency. Understanding these technologies and their applications can significantly augment my ability to navigate and excel in today's digital world.

In Conclusion.

Technology is a vast, diverse, and constantly expanding arena. Each tool, platform, and framework plays a unique role in tasks like software development, web hosting, data management, and more.

In the ever-evolving digital landscape, understanding these technologies can give me a significant edge.

What's your favourite tech tool and why? Let's discuss below!

Until next time: Be safe, be kind, be awesome.

#Technology #TechTalk #Python #Docker #Git #Cloudflare #Streamlit #DigitalWorld #AnythingLLM #Ollama #Anaconda #Chroma #DockerContainers #DockerCompose #DockerDesktop #DockerHub #DockerImages #DockerScout #DockerSwarm #GitHub #GitLabCE #LXC #LXD #Miniconda #NGINX #NodeJS #NPM #NVM #RSA #SSH #WebHosting #DataManagement #SoftwareDevelopment #AI #VirtualEnvironments

What is Technology?

Or: A Long List of Short Descriptions.

Table of contents