.. _tutorials:

Tutorials
=========

This section provides step-by-step tutorials for using the evolutionary optimization framework,
focusing on the core functionality of the ``evopt`` package.

Basic Optimization
------------------

Single-Objective Optimization
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Learn how to set up and run a basic optimization problem:

.. code-block:: python

    import evopt
    
    # Define parameter space with bounds
    params = {
        'x': (-5, 5),
        'y': (-5, 5),
    }
    
    # Define evaluator function (Rosenbrock function)
    def evaluator(param_dict):
        x = param_dict['x']
        y = param_dict['y']
        return (1 - x)**2 + 100 * (y - x**2)**2
    
    # Run optimization
    results = evopt.optimize(
        params=params,
        evaluator=evaluator,
        batch_size=16,
        max_workers=4,
        verbose=True
    )
    
    # Print results
    print(f"Best parameters: {results.best_parameters}")
    print(f"Final error: {results.final_error}")
    
    # Analyze convergence history
    print(f"Epochs completed: {results.epochs_completed}")
    print(f"Termination reason: {results.terminated_reason}")

Advanced Optimization
---------------------

Multi-Objective Optimization with Constraints
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

This example demonstrates how to optimize with multiple objectives and constraints:

.. code-block:: python

    import evopt
    
    # Define engineering simulation that returns multiple metrics
    def structural_simulation(params):
        height = params['height']
        width = params['width']
        length = params['length']
        
        # Calculate metrics (in a real case, this would run your simulation)
        weight = height * width * length * 7.8  # Density of steel
        stress = 1000 / (height * width)
        deflection = 500 / (height**2 * width)
        
        return {
            'weight': weight,
            'stress': stress,
            'deflection': deflection
        }
    
    # Define parameter space
    params = {
        'height': (1, 10),
        'width': (1, 20),
        'length': (10, 100),
    }
    
    # Define target values with constraints
    # Targets can be specified by any combination of the following formats:
        # dict[key: dict[value, hard], ...]
        # dict[key: (min, max), ...]
        # dict[key: value, ...]
        # dict[key: dict[(min, max), hard], ...]
    targets = {
        'weight': {'value': 500, 'hard': False},  # Soft constraint
        'stress': (0, 50),                        # Hard constraint range
        'deflection': {'value': 1.0, 'hard': True}  # Hard constraint
    }
    
    # Run optimization
    results = evopt.optimize(
        params=params,
        evaluator=structural_simulation,
        target_dict=targets,
        batch_size=32,
        max_workers=8,
        num_epochs=50
    )
    
    print(f"Best parameters: {results.best_parameters}")

Checkpointing and Resuming Optimization
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Learn how to resume interrupted optimization runs:

.. code-block:: python

    import evopt
    import os
    
    # Define unique directory ID for this optimization
    base_dir = "./optimization_studies" # Your directory for all studies
    dir_id = 5  # This will create "./optimization_studies/evolve_5"
    
    # First run (might be interrupted)
    results = evopt.optimize(
        params=params,
        evaluator=evaluator,
        base_dir=base_dir,
        dir_id=dir_id,
        num_epochs=100
    )
    
    # Resume optimization from the last epoch
    results = evopt.optimize(
        params=params,
        evaluator=evaluator,
        base_dir=base_dir,
        dir_id=dir_id,
    )

    # Resume from a specific epoch
    results = evopt.optimize(
        params=params,
        evaluator=evaluator,
        base_dir=base_dir,
        dir_id=dir_id,
        start_epoch=57  
    )
    

Visualization
-------------

Basic Convergence Plots
~~~~~~~~~~~~~~~~~~~~~~~

Generate plots showing how parameters converged during optimization:

.. code-block:: python

    import evopt
    
    # Path to optimization results
    evolve_dir = "./optimization_results/evolve_5"
    
    # Generate convergence plots
    evopt.Plotting.plot_epochs(
        evolve_dir_path=evolve_dir,
        show=True,
        save_figures=True
    )

.. image:: https://raw.githubusercontent.com/robh96/evopt/main/images/convergence_plots.png
   :alt: Error convergence.
   :width: 800
   :align: center


Parameter Space Visualization
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Create various visualizations of the parameter space:

.. code-block:: python

    import evopt
    
    evolve_dir = "./optimization_results/evolve_5"
    
    # 2-D scatter plot (parameter vs error)
    evopt.Plotting.plot_vars(
        evolve_dir_path=evolve_dir,
        x="height",
        y="error"
    )

.. image:: https://raw.githubusercontent.com/robh96/evopt/main/images/x1_vs_error.png
   :alt: Parameter versus error.
   :width: 450
   :align: center

.. code-block:: python

    # 2-D Voronoi diagram colored by error
    evopt.Plotting.plot_vars(
        evolve_dir_path=evolve_dir,
        x="height", 
        y="width",
        cval="error"
    )

.. image:: https://raw.githubusercontent.com/robh96/evopt/main/images/x1_vs_x2_vs_error_Voronoi.png
   :alt: Parameters versus error Voronoi plot.
   :width: 450
   :align: center

.. code-block:: python

    # 3-D surface plot
    evopt.Plotting.plot_vars(
        evolve_dir_path=evolve_dir,
        x="height",
        y="width",
        z="stress"
    )

.. image:: https://raw.githubusercontent.com/robh96/evopt/main/images/x1_vs_x2_vs_target2_surface.png
   :alt: Parameters versus target 3-D surface plot.
   :width: 450
   :align: center

.. code-block:: python
    
    # 4-D visualization (3D surface with color)
    evopt.Plotting.plot_vars(
        evolve_dir_path=evolve_dir,
        x="height",
        y="width",
        z="deflection",
        cval="epoch"
    )

.. image:: https://raw.githubusercontent.com/robh96/evopt/main/images/x1_vs_x2_vs_error_vs_epoch_surface.png
   :alt: Parameters versus error coloured by epoch 3-D surface plot.
   :width: 450
   :align: center


High-Performance Computing
--------------------------

Running on SLURM Clusters
~~~~~~~~~~~~~~~~~~~~~~~~~

The following demonstrates how to run optimizations on a SLURM-based HPC system:

Step 1: Create a simple optimization script
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

Create a file named ``optimize.py``:

.. code-block:: python

    # filepath: optimize.py
    import evopt
    
    # Define a simple parameter space
    params = {
        'x': (-5, 5),
        'y': (-5, 5),
        'z': (-5, 5)
    }
    
    # Define evaluator function
    def evaluator(params):
        # This would typically call your simulation or analysis code
        x, y, z = params['x'], params['y'], params['z']
        return x**2 + y**2 + z**2  # Simple function to minimize
    
    # Run optimization with HPC settings
    results = evopt.optimize(
        params=params,
        evaluator=evaluator,
        batch_size=16,
        max_workers=8,             # Will use up to 8 parallel SLURM jobs
        hpc_cores_per_worker=2,    # Each worker gets 2 CPU cores
        hpc_memory_gb_per_worker=4,# Each worker gets 4GB RAM
        hpc_wall_time="2:00:00",   # 2-hour time limit per worker
        verbose=True
    )
    
    # Print results
    print(f"Best parameters: {results.best_parameters}")
    print(f"Final error: {results.final_error}")

Step 2: Create a simple SLURM submission script
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

Create a file named ``submit_job.sh``:

.. code-block:: bash

    #!/bin/bash
    #SBATCH --job-name=evopt_job
    #SBATCH --output=evopt_%j.out
    #SBATCH --error=evopt_%j.err
    #SBATCH --time=24:00:00
    #SBATCH --nodes=1
    #SBATCH --ntasks=1
    #SBATCH --cpus-per-task=2
    #SBATCH --mem=8G
    
    # Load necessary modules
    module load python
    
    # Run the optimization script
    python optimize.py
    
    echo "Job completed"

Step 3: Submit the job
^^^^^^^^^^^^^^^^^^^^^^

Submit your job to SLURM:

.. code-block:: bash

    $ sbatch submit_job.sh

How it works
^^^^^^^^^^^^

When you run this setup:

1. SLURM launches your main Python script
2. The ``evopt.optimize()`` function detects it's running in a SLURM environment
3. For each batch of evaluations, evopt automatically:
   - Creates the necessary parameter files
   - Submits worker jobs to SLURM (up to max_workers)
   - Waits for all workers to complete
   - Collects and processes the results

You don't need to manually handle the parallelization - evopt takes care of submitting
the appropriate SLURM jobs based on your settings.

Note: For more complex simulations, you would typically modify the evaluator function
to interact with your simulation software or external tools as needed.


Case Studies
------------

Engineering Design Optimization
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

A complete example of optimizing an engineering design:

.. code-block:: python

    # Example of optimizing a truss structure
    import evopt
    import numpy as np
    
    # Define truss simulation function
    def simulate_truss(params):
        # In a real case, this would call an engineering simulation software
        # For this example, we'll use a simplified model
        
        # Extract design parameters
        cross_section = params['cross_section']
        height = params['height']
        span = params['span']
        
        # Calculate performance metrics
        material_volume = cross_section * (span + 2 * height)
        max_stress = 1000 * span / (cross_section * height)
        deflection = span**3 / (48 * 200e9 * cross_section * height**2)
        
        return {
            'volume': material_volume,
            'stress': max_stress,
            'deflection': deflection
        }
    
    # Define parameter space
    params = {
        'cross_section': (0.001, 0.01),  # m2
        'height': (0.5, 2.0),           # m
        'span': (5.0, 15.0)             # m
    }
    
    # Define targets
    targets = {
        'volume': {'value': 0.1, 'hard': False},      # Minimize volume (soft)
        'stress': (0, 250e6),                         # Stress limit (hard)
        'deflection': {'value': 0.01, 'hard': True}   # Deflection limit (hard)
    }
    
    # Run optimization
    results = evopt.optimize(
        params=params,
        evaluator=simulate_truss,
        target_dict=targets,
        batch_size=32,
        max_workers=8
    )
    
    # Visualize results
    evolve_dir = f"./evolve_{results.directory_id}"
    evopt.Plotting.plot_epochs(evolve_dir)
    evopt.Plotting.plot_vars(evolve_dir, x="cross_section", y="height", cval="volume")

Machine Learning Hyperparameter Optimization
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Optimize hyperparameters for a machine learning model:

.. code-block:: python

    import evopt
    from sklearn.ensemble import RandomForestClassifier
    from sklearn.datasets import load_iris
    from sklearn.model_selection import cross_val_score
    
    # Load data
    iris = load_iris()
    X, y = iris.data, iris.target
    
    # Define evaluator function
    def evaluate_model(params):
        # Create model with parameters
        model = RandomForestClassifier(
            n_estimators=int(params['n_estimators']),
            max_depth=int(params['max_depth']),
            min_samples_split=int(params['min_samples_split']),
            random_state=42
        )
        
        # Evaluate using cross-validation
        scores = cross_val_score(model, X, y, cv=5)
        # Return negative score because we're minimizing
        return -scores.mean()
    
    # Define parameter space
    params = {
        'n_estimators': (10, 200),
        'max_depth': (3, 20),
        'min_samples_split': (2, 20)
    }
    
    # Run optimization
    results = evopt.optimize(
        params=params,
        evaluator=evaluate_model,
        batch_size=16,
        max_workers=4,
        num_epochs=20
    )
    
    print("Best hyperparameters:")
    for param, value in results.best_parameters.items():
        if param in ['n_estimators', 'max_depth', 'min_samples_split']:
            print(f"{param}: {int(value)}")
    
    print(f"Best accuracy: {-results.final_error:.4f}")


Symbolic Regression
-------------------

Equation Discovery with PySR
~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Learn how to discover mathematical equations from data using symbolic regression:

.. code-block:: python

    import evopt
    import matplotlib.pyplot as plt
    
    # Initialize the Derive class with existing data
    # Assumes results.csv exists in the specified directory with columns x1, x2, x3, and y
    model = evopt.Derive(
        evolve_dir_path="./simulation_results/evolve_5",
        target_variable="y",
        parameters=["x1", "x2", "x3"],
        unary_operators=["sin", "cos", "exp"],  # Include trigonometric functions
        n_iterations=40,                        # Number of iterations
        population_size=32,
        max_size=20                            # Maximum equation complexity
    )
    
    # Fit the model to discover equations
    model.fit()
    
    # Print the best equation found
    print(f"Discovered equation: {model.best_equation}")
    
    # Make predictions using the discovered equation
    model.predict()
    
    # Create a parity plot to visualize the fit
    model.parity_plot(
        point_colour="blue",
        alpha=0.6,
        title="Actual vs. Predicted Values",
        save_figures=True,
        show=True
    )