Exploratory Review of AI Concepts and Tools

AI Concepts

LLM typically refers to Large Language Model in the context of artificial intelligence and machine learning.It is a type of deep learning model trained on massive amounts of text data to understand and generate human-like language. Examples: GPT-4, Claude, Gemini, LLaMA, etc.

Transformer is a deep learning architecture introduced by Vaswani et al. in the 2017 paper Attention Is All You Need. It revolutionized how models process sequential data like text — and forms the backbone of models like GPT, BERT, Claude, Gemini, and more. A Transformer uses self-attention to weigh the importance of different words in a sequence — allowing it to understand context, even over long distances.

Prompt Engineering 🗣️✍️ → 🧠💬 is the practice of crafting effective inputs (prompts) to guide and optimize the behavior of large language models (LLMs) like GPT-4, Claude, or Gemini to get accurate, useful, or creative outputs.

RAG 🔍📄 ➕ 🧠 = 🧠💬 stands for Retrieval-Augmented Generation — a powerful technique that enhances large language models (LLMs) by giving them access to external information during inference time. LLMs like GPT or Claude are limited to the data they were trained on. RAG solves this by combining: (1) Retrieval: Pulling relevant documents or facts from an external knowledge base (e.g., database, website, PDFs). (2) Augmented Generation: Feeding those retrieved results into the LLM so it can generate more accurate and up-to-date responses.

Fine-tuning 📊 ➕ 🧠🛠️ = 🎯 is the process of taking a pre-trained large language model (LLM) (like GPT, LLaMA, or Claude) and training it further on custom, domain-specific data to specialize its behavior.

AI Agent 🤖🧠 is an intelligent system — often powered by a Large Language Model (LLM) — that can autonomously perceive, reason, and take actions to accomplish a goal, often by interacting with tools, environments, or users.

LangChain 🗣️ → 🔗🧠 → 🔧 → 💬 = AI brain (logic, reasoning, LLM interaction) is an open-source framework that helps developers build LLM-powered applications — especially ones that go beyond simple prompts by enabling reasoning, memory, tool use, and multi-step workflows.

Multi-modal 🧠 + 🖼️ + 🔊 + 📝 refers to AI systems that can process and understand multiple types of data (modalities) — such as text, images, audio, video, or structured data — at the same time or in combination.

AGI 🧠🤖🌐 (Artificial General Intelligence) is an AI system with the ability to understand, learn, and apply knowledge across a wide range of tasks — just like a human.

AI Tools

Cherry Studio is a powerful, open‑source desktop AI client designed for Windows, macOS, and Linux. It integrates multiple large language models (LLMs)—both cloud-based (like OpenAI, Gemini, Anthropic) and local models via backends such as Ollama or LM Studio—which lets you easily switch between them in conversation. 🌐 Website 📘 Docs 🐙 GitHub

n8n = Automation backbone (triggering, routing, integration) is an open-source AI workflow automation tool that lets you connect various services (APIs, databases, webhooks, etc.) and automate tasks without writing much code—though coding is also supported for flexibility. 🌐 Website 📘 Docs 🐙 GitHub ➡️ Workflows