Tuesday, June 16, 2026

Day out with LM Studio (for running local LLMs)

See All on GenAI    « Previously    Next »

LM Studio is widely considered the absolute gold standard for running local LLMs if you prefer a clean, visual interface over a terminal window. It abstracts away all the complex command-line arguments of tools like llama.cpp while still giving you deep developer controls under the hood.

Setting it up and getting your first model running takes less than 10 minutes.

1. System Check (What Fits?)

Before downloading a massive model that locks up your computer, check your hardware specs. LM Studio relies heavily on VRAM (GPU Memory), with system RAM as a fallback.

Total Available VRAM Recommended Model Size Best Quantization Format
8 GB 7B - 8B models (e.g., Llama 3 8B) Q4_K_M (Practical baseline)
12 GB - 16 GB 12B - 14B models (e.g., Gemma 4 12B, Qwen 3.6 14B) Q4_K_M or Q6_K
24 GB 32B - 35B models (e.g., Qwen 3.6 35B MoE) Q4_K_M or Q6_K (The sweet spot)
48 GB+ 70B+ models Full 8-bit (Q8_0) or unquantized (BF16)

💡 Apple Silicon Note: If you are running an M-series Mac, LM Studio automatically defaults to Apple's MLX runtime. Because Mac uses unified memory, your system RAM handles the heavy lifting directly.

2. Step-by-Step Setup Guide

1
Download and Install
~2 minutes

Go to lmstudio.ai and download the installer matching your OS (Windows x64/ARM, macOS M-series, or Linux AppImage). Run the installer to open the GUI.

2
Discover and Download a Model
~3-5 minutes

Click the Search/Discover icon (Magnifying Glass) on the left sidebar. Type in a popular open model like Gemma 4 12B or Qwen 3.6 Coder.

LM Studio will display a list of available Hugging Face files. Look for the green rocket icon next to the files—this indicates the model quantization will comfortably fit your hardware profile. Click Download.

3
Configure Your Hardware Engine
~1 minute

Head to the AI Chat view (Bubble icon) and look at the right-hand settings panel. Under Hardware Settings, select your runtime engine:

  • NVIDIA: Choose CUDA 12 llama.cpp.

  • Apple Silicon: Leave it on MLX.

  • AMD/Intel GPU: Choose Vulkan llama.cpp.

  • CPU Only: Choose CPU llama.cpp (if you don't have a dedicated GPU).

4
Adjust GPU Offload and Context
~1 minute

If you're using a discrete GPU (like NVIDIA), locate the GPU Offload slider. Toggle it to Max to push as many layers of the model into your VRAM as possible.

Set your Context Length next (start with 4096 or 8192 tokens). Higher context lengths use exponentially more VRAM.

5
Load and Chat
Instant

At the very top of the window, click the "Select a model to load" dropdown and select your downloaded model. Once the progress bar fills, type your prompt in the bottom text box and enjoy 100% private, offline AI.

3. Power-User Features to Explore Later

Once you have basic chat working, LM Studio has major features designed for software development and local workflows:

Local OpenAI-Compatible Server

Click the Developer tab (Code brackets icon) on the left menu. Here, you can click Start Server to spin up a local API endpoint on localhost:1234. Because it is fully OpenAI-compatible, you can drop this endpoint straight into developer setups, IDE extensions (like Continue or VS Code Copilot alternatives), or local scripts using the standard OpenAI SDK format:

Python
from openai import OpenAI

client = OpenAI(base_url="http://localhost:1234/v1", api_key="lm-studio")

response = client.chat.completions.create(
    model="local-model", # It automatically targets whatever model is currently loaded
    messages=[{"role": "user", "content": "Write a quick Python sort algorithm."}]
)
print(response.choices[0].message.content)

Chat with Documents (Local RAG)

You can attach local text files, PDFs, or code repositories directly into your chat. LM Studio handles the text extraction and local embedding vectorization completely offline, allowing you to ask questions about your private files without data leaking to external servers.

LM Link (Remote Workloads)

If you have a powerful machine (like a desktop rig with a great GPU) but want to work from a lightweight laptop on your couch, you can turn on LM Link in your settings. It leverages a secure, end-to-end encrypted mesh network (powered by Tailscale) to let you stream your desktop's heavy model processing directly to your laptop as if it were running locally.

See All on GenAI    « Previously    Next »
Tags: Large Language Models,Generative AI,Agentic AI,

Sunday, June 14, 2026

Quiz on "Modeling data distributions" (Unit 4, Jun 14th 2026)


See All: Questions For Statistics From Khan Academy
« Previously    Next »
1:

Code:
mean = 170.4
sd = 10

l = 145
lz = (l - mean) / sd
print(lz)

import statistics
lz_area = statistics.NormalDist(mu=0, sigma=1).cdf(lz)
print(lz_area)

h = 171
hz = (h - mean) / sd
print(hz)

hz_area = statistics.NormalDist().cdf(hz)

area_req = round(hz_area - lz_area,4)
print(area_req)



2:


mean = 80
sd = 9

proportion = 0.4

import statistics
z = statistics.NormalDist().inv_cdf(proportion)

print(z)

x = z * sd + mean

print(x)



3:

Code:
mean = 13.1
sd = 1.5

sd1 = (mean - sd, mean + sd)
print(sd1)

sd2 = (mean - 2 * sd, mean + 2 * sd)
print(sd2)

sd3 = (mean - 3 * sd, mean + 3 * sd)
print(sd3)

sd2_area = 0.95
sd3_area = 0.997

area_req = (sd3_area - sd2_area) / 2

print(area_req)

percentage_wise = round(area_req * 100, 4)
print(percentage_wise)

out = """
(11.6, 14.6)
(10.1, 16.1)
(8.6, 17.6)
0.02350000000000002
2.35
"""



4:

Code:
b = 2
h = 0.6

area = 0.5 * b * h

percentage_of_area = area * 100

print(percentage_of_area)



5:


Code:
mean_sales = 8000
sd_sales = 1500

mean_salary = 2000 + 0.3 * mean_sales

sd_salary = sd_sales * 0.3

print("mean_salary, sd_salary")
print(mean_salary, sd_salary)



6:




7:


area = 1
b = 6
h = area * 2 / b
print(h)



8:

def area_of_trapezium(b1, b2, h):
    return 0.5 * (b1 + b2) * h

b1 = 0.5
b2 = 0.75
h = 1

a = area_of_trapezium(b1, b2, h)
print(a)

print(round(a*100, 4))


b1 = 0.25
b2 = 0.5
h = 1

a = area_of_trapezium(b1, b2, h)
print(a)

print(round(a*100, 4))



9:




10:


mean = 1497
sd = 322

proportion = 0.85

import statistics
z = statistics.NormalDist().inv_cdf(proportion)

x = z * sd + mean

print(round(x, 4))

See All: Questions For Statistics From Khan Academy
« Previously    Next » Tags: Python,Mathematical Foundations for Data Science,Data Analytics,

Saturday, June 13, 2026

Snowflake (NoSQL Database) Books (Jun 2026)


Download Books    Download Report    Other Technology Book Lists

1:
Build Pipelines for AI: An Essential Guide to Smarter Data Engineering
By: 
Year Published: 2024

2:
Building the Interoperable Lakehouse: Data Strategies for AI Leaders
By: 
Year Published: 2024

3:
Bulk loading from Amazon S3 using COPY
By: Snowflake
Year Published: 2020

4:
Bulk loading from a local file system using COPY
By: Snowflake
Year Published: 2023

5:
Cassandra: The Definitive Guide
By: Unknown
Year Published: 2010

6:
Create users and grant roles
By: Snowflake
Year Published: 2023

7:
Data Modeling with Snowflake: A practical guide to accelerating Snowflake development using universal data modeling techniques
By: Serge Gershkovich
Year Published: 2022

8:
Data Trends 2026: Manufacturing
By: 
Year Published: 

9:
Database Performance at Scale: A Practical Guide
By: Unknown
Year Published: 2023

10:
Getting Started with NoSQL
By: unknown
Year Published: 2013

11:
Getting Started with Snowflake
By: Snowflake
Year Published: 2020

12:
JSON basics
By: Snowflake
Year Published: 

13:
Learning Snowflake SQL and Scripting
By: Alan Beaulieu
Year Published: 2022

14:
Load and query sample data using SQL
By: Snowflake
Year Published: 2024

15:
Load data from cloud storage (Amazon S3)
By: Snowflake
Year Published: 

16:
Load data from cloud storage (Google Cloud Storage)
By: Snowflake
Year Published: 2023

17:
Load data from cloud storage (Microsoft Azure)
By: Snowflake
Year Published: 

18:
Loading JSON data into a relational table
By: Snowflake
Year Published: 

19:
Loading and unloading Parquet data
By: Snowflake
Year Published: 2022

20:
Making Sense of NoSQL
By: unknown
Year Published: 2014

21:
Next Generation Databases: NoSQL and Big Data
By: Guy Harrison
Year Published: 2015

22:
NoSQL Distilled
By: unknown
Year Published: 2012

23:
NoSQL Distilled
By: Pramod Sadalage and Martin Fowler
Year Published: 2012

24:
NoSQL For Dummies
By: unknown
Year Published: 2015

25:
NoSQL and SQL Data Modeling
By: unknown
Year Published: 

26:
NoSQL for Mere Mortals
By: Dan Sullivan
Year Published: 2015

27:
NoSQL the Ruby Way
By: Durran Jordan
Year Published: 2012

28:
NoSQL with MongoDB in 24 Hours
By: Brad Dayley
Year Published: 2015

29:
Seven Databases in Seven Weeks: A Guide to Modern Databases and the NoSQL Movement
By: Unknown
Year Published: 2012

30:
Snowflake - The Complete Masterclass 2023
By: Unknown
Year Published: 2023

31:
Snowflake - The Definitive Guide: Architecting, Designing, and Deploying on the Snowflake Data Cloud
By: Joyce Kay Avila
Year Published: 2022

32:
Snowflake Data Engineering
By: Maja Ferle
Year Published: 2021

33:
Snowflake Essentials
By: Frank Bell
Year Published: 2021

34:
Snowflake Intelligence for CFOs
By: 
Year Published: 

35:
Snowflake PDF
By: Tom Coffing and Dave Cook
Year Published: 2016

36:
Snowflake Security: Securing Your Snowflake Data Cloud
By: Ben Herzberg and Yoav Cohen
Year Published: 2023

37:
Snowflake and SAP: Powering Enterprise AI with Unified Data and Context
By: 
Year Published: 2024

38:
Snowflake in 20 Minutes
By: Snowflake
Year Published: Unknown

39:
Snowflake the Definitive Guide
By: O'Reilly
Year Published: 2022

40:
The Essential Guide to Transactional, Analytical and Hybrid Data
By: 
Year Published: 2016

41:
The Snowflake Dummies Guide Series
By: Snowflake Inc.
Year Published: 

42:
Tuning the Snowflake Data Cloud: Optimizing Your Data Platform to Minimize Cost and Maximize Performance
By: Andrew Carruthers
Year Published: 2022


Download Books    Download Report    Other Technology Book Lists

Tags: List of Books,Database,

Using DeepSeek With GitHub Copilot

See All on GenAI    « Previously    Next »



Important Links


DeepSeek API Docs
DeepSeek V4 for Copilot Chat (By: Vizards)
Integrate with GitHub Copilot
Awesome DeepSeek Integrations

*** IMPORTANT ***

Optional: Vision Support DeepSeek V4 is text-only, but the extension handles images automatically. Drop a screenshot into chat and it proxies through another installed Copilot model (Claude, GPT-4o) to describe the image before sending to DeepSeek. Run DeepSeek: Set Vision Proxy Model to pick which model handles image descriptions.

Help From Gemini

I don't have any other model for vision proxy to use with DeepSeek in GitHub Copilot. How can I use screenshot with DeepSeek V4 in GitHub Copilot?

Because GitHub Copilot recently integrated DeepSeek V4 (both Pro and Flash models) directly into its ecosystem via the official "DeepSeek V4 for Copilot Chat" extension, there is a built-in architectural catch: DeepSeek V4 is entirely text-only, but Copilot handles screenshots using a transparent proxy system.

If you are using this extension or an OpenAI-compatible third-party bridge, you do not need a separate, external vision proxy model like your own Claude or OpenAI API key. GitHub Copilot uses its own built-in models (like GPT-4o or Claude 3.5 Sonnet) to act as that proxy entirely for free behind the scenes.

If it isn't working or you are getting an error (like unknown variant image_url or the screenshot is simply ignored), use the following steps to properly configure and use screenshots with DeepSeek V4 in Copilot:

The Setup Method

1
Verify Extension Prerequisites
Required version

Ensure your VS Code is updated to the latest version (1.116 or later) and your GitHub Copilot extension is active. Open Copilot Chat and ensure you can see DeepSeek V4 Pro or DeepSeek V4 Flash in your model picker.

2
Configure the Built-in Copilot Vision Proxy
Command Palette

You don't need an external API key for vision. Open the Command Palette (Cmd+Shift+P on Mac or Ctrl+Shift+P on Windows) and run:

DeepSeek: Set Vision Proxy Model

3
Select an Installed Copilot Model
Choose fallback

A dropdown will appear listing the native models your GitHub Copilot subscription already provides (e.g., gpt-4o or claude-3.5-sonnet). Pick one. This does not change your main chat model; it simply tells Copilot which of its native models to use to look at your screenshot.

4
Paste and Reference the Screenshot
In the Chat Window

Drag and drop your screenshot into the Copilot Chat panel, or copy the image and press Ctrl+V / Cmd+V. Add your text prompt (e.g., "Fix the styling issue in this screenshot") and hit enter.

How it Works Behind the Scenes

When you drop a screenshot into the chat while DeepSeek V4 is active, the workflow bypasses DeepSeek's text-only limitation seamlessly:

  1. The Interception: Copilot intercepts the image file before it hits the DeepSeek API.

  2. The Description: It routes the image directly to the local native Copilot model you selected in Step 3 (like GPT-4o).

  3. The Text Injection: That model generates a highly detailed text description of what is happening in the screenshot (UI layout, error message, code snippet).

  4. The DeepSeek Execution: Copilot invisibly appends that text description to your original prompt and forwards it to DeepSeek V4. DeepSeek processes the instructions as if it "saw" the image.

Troubleshooting Tip: If you encounter a 400 Error (unknown variant image_url) when pasting a screenshot, it means your VS Code extension is trying to pass the raw image straight to DeepSeek's text endpoint. Update the DeepSeek V4 for Copilot Chat extension to the newest patch release, which fixes the format translation layer.


See All on GenAI    « Previously
   Next » Tags: Agentic AI,Generative AI,Large Language Models,