Question 1

What is fractional GPU allocation?

Accepted Answer

Fractional GPU allows you to allocate only a portion of a GPU (12.5% to 100%) and pay only for what you use. This enables cost-effective GPU computing for smaller workloads without paying for an entire GPU.

Question 2

What GPUs are available on PodStack?

Accepted Answer

PodStack offers NVIDIA A10G (24GB), L40S (48GB), A100 (40GB and 80GB), and H100 (80GB) GPUs with fractional allocation from 12.5% to 100% of a GPU.

Question 3

How does pay-per-minute billing work?

Accepted Answer

With PodStack's pay-per-minute billing, you only pay for the exact time your GPU resources are running. No minimum charges, no hourly rounding - just precise billing down to the minute.

Question 4

What is the difference between GPU Pods and VMs?

Accepted Answer

GPU Pods are containerized workloads with persistent storage, ideal for training jobs. GPU VMs provide full virtual machines with root access and GPU passthrough, suitable for more complex setups requiring full OS control.

Question 5

How do I get started with PodStack?

Accepted Answer

Sign up at cloud.podstack.ai, add credits to your account, and launch your first GPU pod in minutes. PodStack provides pre-configured templates for PyTorch, TensorFlow, and LLM fine-tuning to get you started quickly.

Question 6

How is PodStack different from RunPod?

Accepted Answer

PodStack is a developer-first GPU cloud with capabilities RunPod does not offer. PodStack's proprietary PodVirt platform provides fractional GPU allocation (as low as 12.5% of a GPU) and zero egress fees, backed by an ISO 27001 certified organisation. Built by ex-Oracle engineers, not stitched-together open source.

Question 7

Is PodStack better than E2E Networks for GPU cloud?

Accepted Answer

PodStack offers several advantages over E2E Networks: fractional GPU allocation via proprietary PodVirt (E2E has no fractional GPU), zero egress fees (E2E charges for egress), per-minute billing (not hourly), and one-click ML templates for vLLM, Unsloth, and ComfyUI. PodStack is an ISO 27001 certified organisation.

Question 8

How does PodStack compare to CoreWeave and Lambda Labs?

Accepted Answer

CoreWeave and Lambda Labs focus on reserved capacity and full-GPU rentals. PodStack is a developer-first GPU cloud with per-minute billing, zero egress fees, instant self-serve provisioning, and fractional GPU allocation via proprietary PodVirt - not available on either platform. PodStack also offers fast cold starts for pods and VMs.

Question 9

How is PodStack different from Spheron Network?

Accepted Answer

Spheron is a Web3 decentralized GPU marketplace with token-based billing and no SLA guarantees. PodStack is an SLA-backed, ISO 27001 certified, enterprise-ready GPU cloud with standard payment methods, predictable per-minute billing, and enterprise legal standing. Regulated, not decentralized.

Question 10

Who founded PodStack?

Accepted Answer

PodStack was founded in 2024 by Saurav Kumar (CEO, IIIT Bengaluru alumnus, ex-Oracle Principal Engineer), Vishal Gupta (CMO, IIT Kharagpur and IIM Lucknow alumnus, Co-Founder of AnoCloud), and Harsh Manvar (CTO, ex-Oracle, recognized Docker Captain, CNCF Ambassador, Google Developer Expert, and Google Champion Innovator with 500+ Kubernetes answers on Stack Overflow reaching 1M+ developers).

Question 11

What is GPU as a Service?

Accepted Answer

GPU as a Service (GPUaaS) allows you to rent high-performance NVIDIA GPUs on-demand without purchasing expensive hardware. PodStack's GPUaaS provides instant access to A10G, L40S, A100, and H100 GPUs for AI/ML training, LLM fine-tuning, model inference, and deep learning research - all with pay-per-minute billing and no long-term commitments.

Question 12

Can I use PodStack for LLM fine-tuning?

Accepted Answer

Yes, PodStack is ideal for LLM fine-tuning. You can use A100 (40GB or 80GB) and H100 (80GB) GPUs with pre-configured PyTorch and Hugging Face environments. PodStack supports popular fine-tuning frameworks like LoRA, QLoRA, and full fine-tuning with persistent storage for datasets and model checkpoints.

Question 13

Does PodStack support PyTorch and TensorFlow?

Accepted Answer

Yes, PodStack fully supports PyTorch, TensorFlow, JAX, and all major ML frameworks. Pre-configured GPU environments come with CUDA, cuDNN, and popular ML libraries pre-installed. You can also use custom Docker images with any framework of your choice.

Question 14

What makes PodStack a strong AI infrastructure platform?

Accepted Answer

PodStack combines enterprise-grade GPU infrastructure with startup agility. ISO 27001 certified and founded by ex-Oracle engineers (IIIT Bengaluru), IIT Kharagpur & IIM Lucknow alumni, and recognized cloud-native experts (Docker Captain, CNCF Ambassador, Google Developer Expert). PodStack offers proprietary PodVirt fractional GPU technology, per-minute billing, zero egress fees, and instant self-serve provisioning. Not a weekend project - 11 years of infrastructure DNA, CNCF Ambassador-grade engineering, and NVIDIA-certified hardware.

Question 15

How does PodStack handle security and compliance?

Accepted Answer

PodStack is an ISO 27001 certified organisation. Workloads run on operator-owned hardware in access-controlled data centres with isolation between tenants, encrypted transport, and audit logging. The platform is designed to support customer data-protection obligations, including under India’s DPDP Act.

Question 16

What are GPU egress fees and does PodStack charge them?

Accepted Answer

GPU egress fees are charges for transferring data out of a cloud provider's network. AWS, GCP, and most GPU clouds charge significant egress fees that inflate total cost. PodStack charges zero egress fees - download your model weights, datasets, and inference results without hidden charges. Combined with per-minute billing and fractional GPUs, PodStack keeps total cost of ownership low.

Question 17

Can I host vLLM and deploy LLM inference on PodStack?

Accepted Answer

Yes, PodStack provides one-click templates for vLLM hosting, Unsloth fine-tuning, and ComfyUI deployment. Launch a vLLM inference server on NVIDIA L40S or A100 GPUs with pre-configured environments. From JupyterLab to production inference - no re-platforming needed. All with zero egress fees.

Question 18

How does PodStack keep GPU costs lower than AWS and GCP?

Accepted Answer

PodStack lowers total GPU cost through zero egress fees, per-minute billing (not hourly), and fractional GPU allocation. AWS and GCP charge premium rates plus egress fees and hourly rounding that inflate bills. PodStack's proprietary PodVirt platform means you pay only for the GPU fraction and the minutes you actually use. Contact sales for a quote tailored to your workload.

Launch.
Train. Serve.
Operate.

Most teams don't have a GPU problem.
They have a GPU-stack problem.

Scarce, overpriced GPUs

A fragmented stack

Slow to production

Lock-in & egress fees

Three products. One platform.

QuickPods

TrainPods

Inference

Run your own GPU cloud — DC Suite

Developer Tools

What ships in the box

Why datacenters license it

Not sure which GPU? Ask the advisor.

Need something custom? Talk to us.

Reserved GPUs

High Performance Labs

Developer Tools

Ready to accelerate your ML workflow?

Cookie Preferences

Launch.Train. Serve.Operate.

Most teams don't have a GPU problem.They have a GPU-stack problem.

Scarce, overpriced GPUs

A fragmented stack

Slow to production

Lock-in & egress fees

Three products. One platform.

QuickPods

TrainPods

Inference

Run your own GPU cloud — DC Suite

Developer Tools

What ships in the box

Why datacenters license it

Not sure which GPU? Ask the advisor.

Need something custom? Talk to us.

Reserved GPUs

High Performance Labs

Developer Tools

Ready to accelerate your ML workflow?

Cookie Preferences

Launch.
Train. Serve.
Operate.

Most teams don't have a GPU problem.
They have a GPU-stack problem.