Understanding Python Variables and Memory Management

Variables

Components of Variables

A variable consists of three main parts:

Variable name
Assignment operator (=)
Variable value

Variable Name

The variable name points to the memory address of the variable value and serves as the only way to access that value.

Assignment Operator (=)

The assignment operator binds the memory address of the variable value to the variable name.

Variable Value

The variable value represents the actual data we want to store.

Variable Definition Example

score = 1000
print(score)  # Output: 1000

Three Characteristics of Variables

ID

ID represents the unique identifier of a variable in memory. Different memory addresses have different IDs.

Type

Type indicates the data type of the variable's value.

Value

The actual value stored in the variable.

Example

data = 1000
print(id(data))      # Output: 140732170872832 (will vary)
print(type(data))    # Output: <class 'int'>
print(data)          # Output: 1000

Difference between 'is' and '=='

The 'is' operator compares whether the IDs of two values are equal, while '==' compares whether the values themselves are equal.

x = 1000
y = 1000
print(x, y)          # Output: 1000 1000
print(id(x), id(y))  # Output: 140732170872832 140732170873232 (will vary)
print(x is y)        # Output: False
print(x == y)        # Output: True

Memory Management

When the Python interpreter encounters variable definition syntax, it allocates memory space to store the variable's value. Since memory capacity is limited, this raises the question of how to reclaim memory space occupied by values that are no longer needed. A variable value becomes garbage when it's no longer accessible through any references.

From a logical perspective, we define variables to store values for future use. Accessing a value requires either direct references (like x=10, where 10 is directly referenced by x) or indirect references (like l=[x,2], where x=10 is directly referenced and indirectly referenced by list l). When a value is no longer bound to any references, it becomes inaccessible and should be treated as garbage for memory reclamation.

Memory allocation and deallocation are resource-intensive operations that can potentially cause memory overflow issues. Fortunately, the CPython interpreter provides an automatic garbage collection mechanism to handle this.

Garbage Collection Mechanism

Garbage Collection (GC) is an automatic memory management mechanism built into the Python interpreter. It specifically reclaims memory space occupied by variable values that no longer have any references.

Why Garbage Collection is Necessary

During program execution, large amounts of memory are allocated. Without proper cleanup of unused memory, it can lead to memory overflow, causing program crashes and system failures. Python's garbage collection mechanism frees developers from the complexities of manual memory management.

Stack vs Heap Memory

When defining variables, both the variable name and its value need storage, corresponding to two memory regions: the stack and the heap.

The relationship between variable names and memory addresses of values is stored in the stack.
Variable values are stored in the heap. Memory management focuses on reclaiming heap space.

Direct and Indirect References

Direct reference: A memory address that can be directly reached from the stack.
Indirect reference: A memory address that requires further references from the heap to be reached.

first_num = 10
second_list = [first_num, 20]
print(first_num)     # Output: 10
print(second_list)   # Output: [10, 20]

Garbage Collection: Reference Counting

Python's GC mechanism primarily uses reference counting to track and reclaim garbage. This approach can be supplemented with mark-and-sweep to handle circular references in container objects, and generational collection to improve efficiency by trading space for time.

What is Reference Counting?

Reference counting is the number of times a variable value is associated with a variable name.

counter = 1
print(counter)  # Output: 1
# The value 1 is associated with variable name 'counter', so its reference count is 1

Changes in Reference Count

Reference Count Increase

value = 1
alias = value
print(value, alias)  # Output: 1 1
# The value 1 is associated with 'value' and then its address is given to 'alias', so reference count is now 2

Reference Count Decrease

x = 1
y = x
x = 17
print(x, y)  # Output: 17 1
# The relationship between 'x' and value 1 is broken, and 'x' now associates with 17. The reference count for 1 is now 1.

x = 1
print(x)  # Output: 1
del x
print(x)  # Output: NameError: name 'x' is not defined
# 'del' breaks the association between 'x' and value 1, reducing reference count to 0

Problems with Reference Counting and Solutions

Reference counting has a critical weakness: circular references.

list1 = ['item1']
list2 = ['item2']
list1.append(list2)  # Appends list2 to list1, increasing list2's reference count to 2
list2.append(list1)  # Appends list1 to list2, increasing list1's reference count to 2

print(list1)  # Output: ['item1', ['item2', [...]]]
print(list2)  # Output: ['item2', ['item1', [...]]]
# A circular reference is formed. Even if these lists are no longer referenced by other variables, their reference counts remain non-zero due to mutual references.

del list1
del list2
# While 'list1' and 'list2' are no longer accessible, their mutual references prevent memory reclamation, leading to memory leaks.

Mark and Sweep

The mark-and-sweep algorithm works as follows when available memory is exhausted:

Mark: Identify all objects reachable from GC Roots (stack variables) and mark them as alive.
Sweep: Traverse all objects in the heap and remove those that weren't marked.

Generational Collection

Reference counting has efficiency issues beyond circular references. Each memory scan requires traversing all object reference counts, which is time-consuming. Generational collection improves efficiency by trading space for time.

Generation: Objects that survive multiple scans are considered frequently used and are scanned less frequently.
Collection: Uses reference counting as the basis for reclamation.

Small Integer Object Pool

To optimize performance, Python implements a small integer object pool concept.

a = 256
b = 256
c = 257
print(id(a), id(b), id(c))  # Output: 140732170872832 140732170872832 140732170873232 (will vary)
# In Python, integers in the range [-5, 256] are pre-created and shared across the program.

String Interning Mechanism

As one of the most commonly used data types in Python, strings benefit from an interning mechanism to improve efficiency and performance.

What is String Interning?

The interning mechanism ensures that identical string objects are stored only once in a string pool and shared. This requires strings to be immutable objects.

String Interning Principle

String interning works by maintaining a dictionary-based string pool. If a string already exists in the pool, it's reused; otherwise, a new string is created and added to the pool. However, Python's automatic use of interning varies by scenario.

Not all strings use interning. Only strings containing underscores, digits, and letters are automatically interned. Interning occurs at compile time, not runtime.

# Automatic interning works
s1 = 'test'
s2 = 'test'
print(s1 is s2)  # Output: True

# With spaces, interning doesn't apply
s1 = 'tes t'
s2 = 'tes t'
print(s1 is s2)  # Output: False

# Interning happens at compile time, not runtime
s1 = 'xyz'
s2 = 'xy' + 'z'
s3 = ''.join(['xy', 'z'])
print(s1 is s2)  # Output: True
print(s1 is s3)  # Output: False

Tags: python Variables Memory Management garbage collection reference counting

Posted on Sat, 16 May 2026 23:24:13 +0000 by goodluck4287

Freaks City

Understanding Python Variables and Memory Management

Understanding Python Variables and Memory Management

Variables

Components of Variables

Variable Name

Assignment Operator (=)

Variable Value

Variable Definition Example

Three Characteristics of Variables

ID

Type

Value

Example

Difference between 'is' and '=='

Memory Management

Garbage Collection Mechanism

Why Garbage Collection is Necessary

Stack vs Heap Memory

Direct and Indirect References

Garbage Collection: Reference Counting

What is Reference Counting?

Changes in Reference Count

Reference Count Increase

Reference Count Decrease

Problems with Reference Counting and Solutions

Mark and Sweep

Generational Collection

Small Integer Object Pool

String Interning Mechanism

What is String Interning?

String Interning Principle

Hot Tags