Oreilly
Cloud Computing¶
-
AWS: a cloud infrastructure platform offered by Amazon. It provides scalable compute, storage, networking, and managed services for hosting applications and data systems.
-
Cloud Platforms: provide on-demand infrastructure and managed services for computing, storage, networking, and application deployment across distributed data centers.
Data Engineering¶
-
Data Warehouse: a centralized system optimized for analytical queries, reporting, and historical data analysis across structured datasets.
-
Apache Spark: a distributed data processing engine designed for large-scale batch and streaming workloads.
-
Kafka: a distributed event streaming platform used for real-time data pipelines and message-driven architectures.
-
Data Lake: a centralized storage system that holds raw, structured, semi-structured, and unstructured data at scale.
-
Streaming & Messaging systems: enable real-time data flow between producers and consumers with low latency.
-
NoSQL Databases: store non-relational data and are optimized for scalability, flexibility, and high-throughput workloads.
AI & ML¶
-
Generative AI: focuses on models that create new content such as text, images, code, or audio based on learned patterns.
-
Machine Learning: enables systems to learn patterns from data and make predictions without explicit programming.
-
Artificial Intelligence (AI): encompasses systems that perform tasks requiring reasoning, perception, or decision-making.
-
Deep Learning: uses multi-layer neural networks to model complex patterns in large datasets.
-
Reinforcement Learning: trains agents to make sequential decisions through reward-based feedback.
-
NLP: enables computers to understand, interpret, and generate human language.
-
MLOps: focuses on deploying, monitoring, and maintaining machine learning models in production environments.
Software Architecture¶
-
Object-Oriented: an architecture that organizes software around objects that encapsulate data and behavior.
-
Distributed Systems: consist of multiple networked components that coordinate to achieve a common goal.
-
Domain-Driven Design: structures software around core business concepts and domain logic.
-
Architectural Patterns: provide reusable solutions to common system design problems.
IT / Ops¶
-
Kubernetes: a container orchestration platform for deploying, scaling, and managing containerized applications.
-
Docker: enables applications to be packaged and run in isolated, portable containers.
-
GitHub: a platform for source control, collaboration, and software development workflows.
-
Computer Networking: the design and operation of systems that transmit data between computers and devices.