Scalytics Connect:
Privacy-First Cloud AI Platform

Scalytics Connect delivers dedicated AI infrastructure where all your data stays within your models. Our fully-managed solution includes complete hardware setup, proven architecture, and enterprise-grade infrastructure that's been thoroughly tested in demanding production environments. From initial deployment to continuous operations, we handle everything—giving you powerful AI capabilities with zero DevOps overhead.

All plans require annual commitments, with preferred 3-year terms offering additional savings.


Choose the plan that fits your organization's needs:

Small Business

Ideal for startups and small teams running moderate AI workloads.
$81,500 / year
3-year contract:
$73,500 / year (10% additional savings)
Features:
4x NVIDIA L4 GPUs - 24 GB RAM each (dedicated)
Up to 50 concurrent users
48 vCPUs | 192 GB RAM
5TB Cloud Storage
Full Scalytics Connect management platform
Complete private model hosting with RBAC
OpenAI compatible API gateway
No Vendor Lock-In
Models with 14B in Q4 or 12B in Q6 (GGUF)
Optional:
24/7 support with SME, priority queue, remote support specialist

Book Now!

Pro

Perfect for growing businesses with substantial AI requirements.
$157,500 / year
3-year contract:
$138,500 / year (10% additional savings)
Features:
8x NVIDIA L4 GPUs - 24 GB RAM each (dedicated)
Up to 100 concurrent users
96 vCPUs | 384GB RAM
10TB secure cloud storage
Advanced custom prompt templates
Complete private model hosting with RBAC
OpenAI compatible API gateway
No Vendor Lock-In
Models with 14B in Q4 or 12B in Q6 (GGUF)
Optional:
24/7 support with SME, priority queue, remote support specialist

Book Now!

Enterprise

Designed for enterprises with mission-critical AI at scale.
$795,500 / year
3-year contract:
$714,500 / year (10% additional savings)
Features:
8x NVIDIA A100 GPUs - 80 GB RAM each (dedicated)
Up to 1,100 concurrent users
96 vCPUs | 1.36 TB RAM
20TB secure cloud storage with enhanced backup
White-glove support with dedicated account manager
Integration with single sign-on (SSO) and Active Directory
OpenAI compatible API gateway
Multi-region deployment
Early access to AI agents & MCP features
No Vendor Lock-In
Models with 70B and more parameters
24/7 support with SME, priority queue, remote support specialist

Book Now!

Scalytics Connect On-Premises

Scalytics Connect delivers enterprise-grade AI infrastructure where all your data stays within your control. Our on-premises solution brings our proven architecture and thoroughly tested platform directly to your existing hardware. Deploy powerful AI capabilities with minimal DevOps overhead while maintaining complete data sovereignty and security compliance.

For organizations with existing infrastructure or specific regulatory requirements, Scalytics Connect On-Premises provides the perfect balance of performance, control, and simplicity.

On-Premises Edition

For organizations with existing infrastructure or specific compliance requirements, Scalytics Connect is available as an on-premises solution with straightforward, unlimited licensing.
$78,500 / year
3-year contract:
$188,400 / year (10% additional savings)
On-premises deployments are tailored to your specific requirements. Contact our sales team to get started.
Features:
All Model Capabilities - Support for models from 7B to 70B+
- 7B models: Full performance with robust tool use
- 14B models: Simple coding and stable tool use
- 32B models: Good performance across diverse tasks
- 70B+ models: Best local performance for complex workloads
Enterprise Features - Multiple deployment environments
- Advanced analytics and monitoring
- Full admin controls with granular permissions
- User system prompt template
- OpenAI compatible API gateway
- Air-gapped installation support
- Advanced security and compliance features
Unlimited GPUs - Scale as much as your hardware allows
Complete Platform - The same powerful software that powers our cloud offering
4-hour response time for critical issues
Regular software updates and security patches
24/7 support with SME, priority queue, remote support specialist

Request On-Prem Consultation

Optional

Implementation Package $25.500
Infrastructure assessment and planning
Installation and configuration
Security hardening
Knowledge transfer and training (16 hours)
Advanced Services
Solution Architect: $15,000/week
Advanced training: $2,500/day
Performance tuning: $10,000/one-time

Hardware Recommendations

Recommended Starting Configuration
4-8 NVIDIA A100, H100, or V100 GPUs (32GB+ VRAM each)
64+ CPU cores
512GB+ RAM
2TB+ NVMe SSD storage
25Gbps networking
High-Performance Configuration
16+ NVIDIA A100, H100, or V100 GPUs (80GB VRAM recommended)
128+ CPU cores
1TB+ RAM
4TB+ NVMe SSD storage in RAID configuration
100Gbps networking
Request On-Prem Consultation

Frequently Asked Questions (FAQ)

General Questions

What is Scalytics Connect?

keyboard_arrow_down

Scalytics Connect is a fully-managed, privacy-first AI infrastructure solution that provides organizations with dedicated GPU resources for running large language models privately within their own environment. Our solution handles all aspects of deployment, management, and optimization, allowing you to focus on using AI rather than maintaining it.

How is Scalytics Connect different from other AI services?

keyboard_arrow_down

Unlike typical AI services that send your data to external APIs, Scalytics Connect provides dedicated infrastructure where all data processing happens within your private environment. We combine the convenience of managed services with the security and compliance benefits of private infrastructure. Additionally, we support multiple model families (DeepSeek, Mistral, Llama, Gemma, Phi) while most competitors focus on a single model or require you to handle complex DevOps tasks yourself.

Do I need AI expertise to use Scalytics Connect?

keyboard_arrow_down

No. While Scalytics Connect is powerful enough for AI specialists, it's designed to be accessible for organizations without extensive AI expertise. Our platform provides intuitive interfaces for using models, and our team handles all the complex technical optimization and maintenance.

Technical Questions

Where does my data go when using Scalytics Connect?

keyboard_arrow_down

Your data never leaves your dedicated environment. All processing happens on your private infrastructure, meaning sensitive information stays within your control at all times. This is fundamentally different from API-based services where your data must be sent to external servers.

What models can I run on Scalytics Connect?

keyboard_arrow_down

You can run a variety of open models like DeepSeek, Mistral, Llama, Gemma, and Phi without restrictions. The exact model sizes and quantizations depend on the VRAM of the GPUs:

NVIDIA L4 GPU (24GB VRAM):
-
7B models in Q6/Q5/Q4 quantization (1-2 model instances per GPU)
- 12B models in Q5/Q4 quantization (1 model instance per GPU)
- 14B models in Q4 quantization (1 model instance per GPU)

NVIDIA H100 GPU (80GB VRAM):
-
7B models in full precision or Q6/Q5/Q4 quantization (4-6 model instances per GPU)
- 14B models in full precision or Q6/Q5/Q4 quantization (2-3 model instances per GPU)
- 34B models in Q6/Q5/Q4 quantization (1-2 model instances per GPU)
- 70B models in Q4 quantization (1 model instance per GPU)

Our platform is optimized for Q6 quantized models, with DeepSeek-R1-14B as our reference model for performance specifications. You can also integrate with cloud models like OpenAI and Anthropic through API keys to enhance your AI experience and to build powerful privacy foccussed agents.

What does "concurrent users" mean in your pricing?

keyboard_arrow_down

Concurrent users are individuals actively querying your system within a short time window (approximately 10 seconds). Our measurements are based on users sending 1-2 requests per second with prompts of 100-500 tokens and responses of 100-200 tokens. The number indicates the capacity of your infrastructure to handle simultaneous requests.

Can I upgrade my plan as my needs grow?

keyboard_arrow_down

Yes. You can easily upgrade from the Small Business to SME tier as your usage increases. Upgrading from SME to Enterprise requires a technology migration due to the different GPU architecture (L4 to A100), which our team will manage for you.

Deployment & Management

Where can I deploy Scalytics Connect?

keyboard_arrow_down

Scalytics Connect can be deployed on AWS, Azure, GCP, or any Linux-based cloud that offers NVIDIA or similar GPUs. We also support on-premises deployments through our partnerships with HPC system builders and specialized GPU infrastructure providers across Europe.

How long does it take to deploy Scalytics Connect?

keyboard_arrow_down

For cloud deployments, we typically have your environment operational within 2-3 business days from contract signing. On-premises deployments vary based on your existing infrastructure and requirements.

Who handles maintenance and updates?

keyboard_arrow_down

Our team manages all aspects of maintenance, including security updates, performance optimizations, and model updates. You'll always have access to the latest features and improvements without needing to manage them yourself.

What kind of support is included?

keyboard_arrow_down

All plans include dedicated support. The Small Business tier provides business hours support, the SME tier includes 24/7 priority support, and the Enterprise tier offers white-glove support with a dedicated account manager. Our support team has deep expertise in AI infrastructure and can assist with both technical and strategic questions.

Security & Compliance

How does Scalytics Connect ensure data privacy?

keyboard_arrow_down

Data privacy is built into the core architecture of Scalytics Connect. Your data never leaves your dedicated environment, and all processing happens locally on your infrastructure. We implement comprehensive security measures including end-to-end encryption, role-based access control, and secure deployment practices.

Is Scalytics Connect compliant with regulations like GDPR, HIPAA, etc.?

keyboard_arrow_down

Scalytics Connect's architecture supports compliance with major regulations since your data remains in your controlled environment. We can help implement specific controls required for GDPR, HIPAA, and other regulatory frameworks. Our team can work with your compliance officers to ensure proper documentation and controls.

How does the role-based access control (RBAC) work?

keyboard_arrow_down

Our RBAC system allows administrators to define precise permissions for users and groups. You can control which models users can access, limit token usage, restrict certain features, and enforce security policies. This ensures proper governance of your AI infrastructure according to your organization's requirements.

Pricing & Contracts

Can I test Scalytics Connect before committing to a contract?

keyboard_arrow_down

Yes! We offer a 5-day trial period during which you can fully evaluate Scalytics Connect in your environment. During this trial, you can cancel anytime and you'll only be billed for the actual hardware costs incurred. This allows you to verify the performance, security, and usability of our platform with your specific use cases before making a longer commitment.

Why do you only offer annual contracts?

keyboard_arrow_down

AI infrastructure requires significant setup and optimization for your specific needs. Annual contracts allow us to make this investment while ensuring stable, reliable service. They also provide you with cost predictability and dedicated resources that aren't shared with other organizations.

What happens if my usage exceeds my plan's capacity?

keyboard_arrow_down

You're fully responsible for managing your usage within the infrastructure capacity you've purchased. While we provide monitoring tools that show your usage patterns, it's up to you to ensure you stay within appropriate usage levels. You can add more users beyond the recommended concurrent user count, but be aware that this will lead to performance degradation. Our concurrent user guidelines are designed to maintain optimal performance - exceeding them means your users may experience slower response times or processing delays.

Are there any hidden costs or fees?

keyboard_arrow_down

No. Our pricing is transparent and includes all aspects of the service: infrastructure, management, support, and software. The only additional costs would be if you choose to integrate with third-party models like OpenAI or Anthropic (you would pay for their API usage directly).

Still have questions? Contact us directly or email us at hello@scalytics.io