Skip to content

Oreilly

Cloud Computing

  • AWS: a cloud infrastructure platform offered by Amazon. It provides scalable compute, storage, networking, and managed services for hosting applications and data systems.

  • Cloud Platforms: provide on-demand infrastructure and managed services for computing, storage, networking, and application deployment across distributed data centers.

Data Engineering

  • Data Warehouse: a centralized system optimized for analytical queries, reporting, and historical data analysis across structured datasets.

  • Apache Spark: a distributed data processing engine designed for large-scale batch and streaming workloads.

  • Kafka: a distributed event streaming platform used for real-time data pipelines and message-driven architectures.

  • Data Lake: a centralized storage system that holds raw, structured, semi-structured, and unstructured data at scale.

  • Streaming & Messaging systems: enable real-time data flow between producers and consumers with low latency.

  • NoSQL Databases: store non-relational data and are optimized for scalability, flexibility, and high-throughput workloads.

AI & ML

  • Generative AI: focuses on models that create new content such as text, images, code, or audio based on learned patterns.

  • Machine Learning: enables systems to learn patterns from data and make predictions without explicit programming.

  • Artificial Intelligence (AI): encompasses systems that perform tasks requiring reasoning, perception, or decision-making.

  • Deep Learning: uses multi-layer neural networks to model complex patterns in large datasets.

  • Reinforcement Learning: trains agents to make sequential decisions through reward-based feedback.

  • NLP: enables computers to understand, interpret, and generate human language.

  • MLOps: focuses on deploying, monitoring, and maintaining machine learning models in production environments.

Software Architecture

  • Object-Oriented: an architecture that organizes software around objects that encapsulate data and behavior.

  • Distributed Systems: consist of multiple networked components that coordinate to achieve a common goal.

  • Domain-Driven Design: structures software around core business concepts and domain logic.

  • Architectural Patterns: provide reusable solutions to common system design problems.

IT / Ops

  • Kubernetes: a container orchestration platform for deploying, scaling, and managing containerized applications.

  • Docker: enables applications to be packaged and run in isolated, portable containers.

  • GitHub: a platform for source control, collaboration, and software development workflows.

  • Computer Networking: the design and operation of systems that transmit data between computers and devices.