Google · data
Preparation guide for Machine Learning Engineer positions at Google India. Covers their Phone Screen → DSA × 4 → System Design → Behavioral process with technical, behavioral, and HR questions.
What is the difference between a model parameter and a hyperparameter?
Tip: Parameters are learned from data during training (weights, biases). Hyperparameters are set before training and control the learning process (learning rate, number of layers, batch size). You tune hyperparameters with cross-validation; you do not tune parameters directly.
What is transfer learning and when is it most beneficial?
Tip: Transfer learning uses a model pre-trained on a large dataset as a starting point. Most beneficial when: labelled data is scarce, compute budget is limited, or domains are similar (ImageNet to medical imaging). Fine-tune the last layers; freeze early layers.
Explain backpropagation. What is it actually computing?
Tip: Backpropagation computes the gradient of the loss function with respect to each weight using the chain rule. It propagates error signal from output layer backwards. The gradient tells the optimiser (SGD/Adam) how to adjust each weight to reduce loss.
What is the vanishing gradient problem? How is it addressed in modern deep learning?
Tip: In deep networks with sigmoid/tanh activations, gradients shrink exponentially during backprop — early layers learn very slowly. Solutions: ReLU activations, residual connections (ResNet skip connections), batch normalisation, gradient clipping for RNNs.
What is model drift? How do you detect and handle it?
Tip: Data drift: input distribution shifts over time. Concept drift: the relationship between inputs and output changes. Detect with: monitoring prediction score distributions, input feature statistics, and business KPIs. Handle with: scheduled retraining, online learning.
How would you deploy a machine learning model to production at scale?
Tip: Serve via REST API (FastAPI + uvicorn), containerise with Docker, orchestrate with Kubernetes. For low-latency: ONNX runtime or TensorRT. Use a model registry (MLflow) for versioning. A/B test new models via traffic splitting. Monitor prediction latency and drift.
What is the difference between batch inference and real-time inference?
Tip: Batch: run inference on a large dataset offline, results stored (e.g. daily churn predictions). Real-time: single-sample inference on demand (e.g. fraud detection during a transaction). Batch: higher throughput, cheaper. Real-time: low-latency requirement, more infrastructure complexity.
Tell me about an ML model you trained, validated, and deployed. What was the end-to-end pipeline?
Tip: Cover all stages: problem framing, data collection/cleaning, feature engineering, model selection, training, evaluation, deployment, monitoring. Highlight one non-trivial decision at each stage. Show you understand the full MLOps lifecycle, not just modelling.
A model performs well in testing (90% accuracy) but poorly in production (65%). What are the likely causes?
Tip: Training-serving skew: test data distribution differs from production. Target leakage: a feature in training was not available in production. Check feature distributions at serving time vs training time. Log and analyse production inputs.
How do you version control ML models and datasets?
Tip: Model versioning: MLflow, DVC, or Weights & Biases — track hyperparameters, metrics, artifacts per experiment. Dataset versioning: DVC or cloud storage with immutable versioned paths. Never overwrite a dataset used in training.
What is the purpose of a validation set vs a test set? Why is it wrong to tune on the test set?
Tip: Validation set: used during development to compare models and tune hyperparameters. Test set: held out completely, used ONCE at the very end. Tuning on test data leaks information — the test accuracy then overestimates true generalisation.
TensorFlow vs PyTorch — which do you prefer for production ML and why?
Tip: PyTorch: Pythonic, easier debugging, dominant in research and increasingly in production. TensorFlow: mature serving stack (TF Serving, TFLite), better mobile/embedded support. Know one deeply rather than both superficially.
Take a full scored mock interview tailored to your resume. Get feedback on technical depth, clarity, structure, confidence, and relevance — free to start.