287 lines
11 KiB
ReStructuredText
287 lines
11 KiB
ReStructuredText
|
.. SPDX-License-Identifier: GPL-2.0
|
||
|
|
||
|
===============================================
|
||
|
How to Implement a new CPUFreq Processor Driver
|
||
|
===============================================
|
||
|
|
||
|
Authors:
|
||
|
|
||
|
|
||
|
- Dominik Brodowski <linux@brodo.de>
|
||
|
- Rafael J. Wysocki <rafael.j.wysocki@intel.com>
|
||
|
- Viresh Kumar <viresh.kumar@linaro.org>
|
||
|
|
||
|
.. Contents
|
||
|
|
||
|
1. What To Do?
|
||
|
1.1 Initialization
|
||
|
1.2 Per-CPU Initialization
|
||
|
1.3 verify
|
||
|
1.4 target/target_index or setpolicy?
|
||
|
1.5 target/target_index
|
||
|
1.6 setpolicy
|
||
|
1.7 get_intermediate and target_intermediate
|
||
|
2. Frequency Table Helpers
|
||
|
|
||
|
|
||
|
|
||
|
1. What To Do?
|
||
|
==============
|
||
|
|
||
|
So, you just got a brand-new CPU / chipset with datasheets and want to
|
||
|
add cpufreq support for this CPU / chipset? Great. Here are some hints
|
||
|
on what is necessary:
|
||
|
|
||
|
|
||
|
1.1 Initialization
|
||
|
------------------
|
||
|
|
||
|
First of all, in an __initcall level 7 (module_init()) or later
|
||
|
function check whether this kernel runs on the right CPU and the right
|
||
|
chipset. If so, register a struct cpufreq_driver with the CPUfreq core
|
||
|
using cpufreq_register_driver()
|
||
|
|
||
|
What shall this struct cpufreq_driver contain?
|
||
|
|
||
|
.name - The name of this driver.
|
||
|
|
||
|
.init - A pointer to the per-policy initialization function.
|
||
|
|
||
|
.verify - A pointer to a "verification" function.
|
||
|
|
||
|
.setpolicy _or_ .fast_switch _or_ .target _or_ .target_index - See
|
||
|
below on the differences.
|
||
|
|
||
|
And optionally
|
||
|
|
||
|
.flags - Hints for the cpufreq core.
|
||
|
|
||
|
.driver_data - cpufreq driver specific data.
|
||
|
|
||
|
.get_intermediate and target_intermediate - Used to switch to stable
|
||
|
frequency while changing CPU frequency.
|
||
|
|
||
|
.get - Returns current frequency of the CPU.
|
||
|
|
||
|
.bios_limit - Returns HW/BIOS max frequency limitations for the CPU.
|
||
|
|
||
|
.exit - A pointer to a per-policy cleanup function called during
|
||
|
CPU_POST_DEAD phase of cpu hotplug process.
|
||
|
|
||
|
.suspend - A pointer to a per-policy suspend function which is called
|
||
|
with interrupts disabled and _after_ the governor is stopped for the
|
||
|
policy.
|
||
|
|
||
|
.resume - A pointer to a per-policy resume function which is called
|
||
|
with interrupts disabled and _before_ the governor is started again.
|
||
|
|
||
|
.ready - A pointer to a per-policy ready function which is called after
|
||
|
the policy is fully initialized.
|
||
|
|
||
|
.attr - A pointer to a NULL-terminated list of "struct freq_attr" which
|
||
|
allow to export values to sysfs.
|
||
|
|
||
|
.boost_enabled - If set, boost frequencies are enabled.
|
||
|
|
||
|
.set_boost - A pointer to a per-policy function to enable/disable boost
|
||
|
frequencies.
|
||
|
|
||
|
|
||
|
1.2 Per-CPU Initialization
|
||
|
--------------------------
|
||
|
|
||
|
Whenever a new CPU is registered with the device model, or after the
|
||
|
cpufreq driver registers itself, the per-policy initialization function
|
||
|
cpufreq_driver.init is called if no cpufreq policy existed for the CPU.
|
||
|
Note that the .init() and .exit() routines are called only once for the
|
||
|
policy and not for each CPU managed by the policy. It takes a ``struct
|
||
|
cpufreq_policy *policy`` as argument. What to do now?
|
||
|
|
||
|
If necessary, activate the CPUfreq support on your CPU.
|
||
|
|
||
|
Then, the driver must fill in the following values:
|
||
|
|
||
|
+-----------------------------------+--------------------------------------+
|
||
|
|policy->cpuinfo.min_freq _and_ | |
|
||
|
|policy->cpuinfo.max_freq | the minimum and maximum frequency |
|
||
|
| | (in kHz) which is supported by |
|
||
|
| | this CPU |
|
||
|
+-----------------------------------+--------------------------------------+
|
||
|
|policy->cpuinfo.transition_latency | the time it takes on this CPU to |
|
||
|
| | switch between two frequencies in |
|
||
|
| | nanoseconds (if appropriate, else |
|
||
|
| | specify CPUFREQ_ETERNAL) |
|
||
|
+-----------------------------------+--------------------------------------+
|
||
|
|policy->cur | The current operating frequency of |
|
||
|
| | this CPU (if appropriate) |
|
||
|
+-----------------------------------+--------------------------------------+
|
||
|
|policy->min, | |
|
||
|
|policy->max, | |
|
||
|
|policy->policy and, if necessary, | |
|
||
|
|policy->governor | must contain the "default policy" for|
|
||
|
| | this CPU. A few moments later, |
|
||
|
| | cpufreq_driver.verify and either |
|
||
|
| | cpufreq_driver.setpolicy or |
|
||
|
| | cpufreq_driver.target/target_index is|
|
||
|
| | called with these values. |
|
||
|
+-----------------------------------+--------------------------------------+
|
||
|
|policy->cpus | Update this with the masks of the |
|
||
|
| | (online + offline) CPUs that do DVFS |
|
||
|
| | along with this CPU (i.e. that share|
|
||
|
| | clock/voltage rails with it). |
|
||
|
+-----------------------------------+--------------------------------------+
|
||
|
|
||
|
For setting some of these values (cpuinfo.min[max]_freq, policy->min[max]), the
|
||
|
frequency table helpers might be helpful. See the section 2 for more information
|
||
|
on them.
|
||
|
|
||
|
|
||
|
1.3 verify
|
||
|
----------
|
||
|
|
||
|
When the user decides a new policy (consisting of
|
||
|
"policy,governor,min,max") shall be set, this policy must be validated
|
||
|
so that incompatible values can be corrected. For verifying these
|
||
|
values cpufreq_verify_within_limits(``struct cpufreq_policy *policy``,
|
||
|
``unsigned int min_freq``, ``unsigned int max_freq``) function might be helpful.
|
||
|
See section 2 for details on frequency table helpers.
|
||
|
|
||
|
You need to make sure that at least one valid frequency (or operating
|
||
|
range) is within policy->min and policy->max. If necessary, increase
|
||
|
policy->max first, and only if this is no solution, decrease policy->min.
|
||
|
|
||
|
|
||
|
1.4 target or target_index or setpolicy or fast_switch?
|
||
|
-------------------------------------------------------
|
||
|
|
||
|
Most cpufreq drivers or even most cpu frequency scaling algorithms
|
||
|
only allow the CPU frequency to be set to predefined fixed values. For
|
||
|
these, you use the ->target(), ->target_index() or ->fast_switch()
|
||
|
callbacks.
|
||
|
|
||
|
Some cpufreq capable processors switch the frequency between certain
|
||
|
limits on their own. These shall use the ->setpolicy() callback.
|
||
|
|
||
|
|
||
|
1.5. target/target_index
|
||
|
------------------------
|
||
|
|
||
|
The target_index call has two arguments: ``struct cpufreq_policy *policy``,
|
||
|
and ``unsigned int`` index (into the exposed frequency table).
|
||
|
|
||
|
The CPUfreq driver must set the new frequency when called here. The
|
||
|
actual frequency must be determined by freq_table[index].frequency.
|
||
|
|
||
|
It should always restore to earlier frequency (i.e. policy->restore_freq) in
|
||
|
case of errors, even if we switched to intermediate frequency earlier.
|
||
|
|
||
|
Deprecated
|
||
|
----------
|
||
|
The target call has three arguments: ``struct cpufreq_policy *policy``,
|
||
|
unsigned int target_frequency, unsigned int relation.
|
||
|
|
||
|
The CPUfreq driver must set the new frequency when called here. The
|
||
|
actual frequency must be determined using the following rules:
|
||
|
|
||
|
- keep close to "target_freq"
|
||
|
- policy->min <= new_freq <= policy->max (THIS MUST BE VALID!!!)
|
||
|
- if relation==CPUFREQ_REL_L, try to select a new_freq higher than or equal
|
||
|
target_freq. ("L for lowest, but no lower than")
|
||
|
- if relation==CPUFREQ_REL_H, try to select a new_freq lower than or equal
|
||
|
target_freq. ("H for highest, but no higher than")
|
||
|
|
||
|
Here again the frequency table helper might assist you - see section 2
|
||
|
for details.
|
||
|
|
||
|
1.6. fast_switch
|
||
|
----------------
|
||
|
|
||
|
This function is used for frequency switching from scheduler's context.
|
||
|
Not all drivers are expected to implement it, as sleeping from within
|
||
|
this callback isn't allowed. This callback must be highly optimized to
|
||
|
do switching as fast as possible.
|
||
|
|
||
|
This function has two arguments: ``struct cpufreq_policy *policy`` and
|
||
|
``unsigned int target_frequency``.
|
||
|
|
||
|
|
||
|
1.7 setpolicy
|
||
|
-------------
|
||
|
|
||
|
The setpolicy call only takes a ``struct cpufreq_policy *policy`` as
|
||
|
argument. You need to set the lower limit of the in-processor or
|
||
|
in-chipset dynamic frequency switching to policy->min, the upper limit
|
||
|
to policy->max, and -if supported- select a performance-oriented
|
||
|
setting when policy->policy is CPUFREQ_POLICY_PERFORMANCE, and a
|
||
|
powersaving-oriented setting when CPUFREQ_POLICY_POWERSAVE. Also check
|
||
|
the reference implementation in drivers/cpufreq/longrun.c
|
||
|
|
||
|
1.8 get_intermediate and target_intermediate
|
||
|
--------------------------------------------
|
||
|
|
||
|
Only for drivers with target_index() and CPUFREQ_ASYNC_NOTIFICATION unset.
|
||
|
|
||
|
get_intermediate should return a stable intermediate frequency platform wants to
|
||
|
switch to, and target_intermediate() should set CPU to that frequency, before
|
||
|
jumping to the frequency corresponding to 'index'. Core will take care of
|
||
|
sending notifications and driver doesn't have to handle them in
|
||
|
target_intermediate() or target_index().
|
||
|
|
||
|
Drivers can return '0' from get_intermediate() in case they don't wish to switch
|
||
|
to intermediate frequency for some target frequency. In that case core will
|
||
|
directly call ->target_index().
|
||
|
|
||
|
NOTE: ->target_index() should restore to policy->restore_freq in case of
|
||
|
failures as core would send notifications for that.
|
||
|
|
||
|
|
||
|
2. Frequency Table Helpers
|
||
|
==========================
|
||
|
|
||
|
As most cpufreq processors only allow for being set to a few specific
|
||
|
frequencies, a "frequency table" with some functions might assist in
|
||
|
some work of the processor driver. Such a "frequency table" consists of
|
||
|
an array of struct cpufreq_frequency_table entries, with driver specific
|
||
|
values in "driver_data", the corresponding frequency in "frequency" and
|
||
|
flags set. At the end of the table, you need to add a
|
||
|
cpufreq_frequency_table entry with frequency set to CPUFREQ_TABLE_END.
|
||
|
And if you want to skip one entry in the table, set the frequency to
|
||
|
CPUFREQ_ENTRY_INVALID. The entries don't need to be in sorted in any
|
||
|
particular order, but if they are cpufreq core will do DVFS a bit
|
||
|
quickly for them as search for best match is faster.
|
||
|
|
||
|
The cpufreq table is verified automatically by the core if the policy contains a
|
||
|
valid pointer in its policy->freq_table field.
|
||
|
|
||
|
cpufreq_frequency_table_verify() assures that at least one valid
|
||
|
frequency is within policy->min and policy->max, and all other criteria
|
||
|
are met. This is helpful for the ->verify call.
|
||
|
|
||
|
cpufreq_frequency_table_target() is the corresponding frequency table
|
||
|
helper for the ->target stage. Just pass the values to this function,
|
||
|
and this function returns the of the frequency table entry which
|
||
|
contains the frequency the CPU shall be set to.
|
||
|
|
||
|
The following macros can be used as iterators over cpufreq_frequency_table:
|
||
|
|
||
|
cpufreq_for_each_entry(pos, table) - iterates over all entries of frequency
|
||
|
table.
|
||
|
|
||
|
cpufreq_for_each_valid_entry(pos, table) - iterates over all entries,
|
||
|
excluding CPUFREQ_ENTRY_INVALID frequencies.
|
||
|
Use arguments "pos" - a ``cpufreq_frequency_table *`` as a loop cursor and
|
||
|
"table" - the ``cpufreq_frequency_table *`` you want to iterate over.
|
||
|
|
||
|
For example::
|
||
|
|
||
|
struct cpufreq_frequency_table *pos, *driver_freq_table;
|
||
|
|
||
|
cpufreq_for_each_entry(pos, driver_freq_table) {
|
||
|
/* Do something with pos */
|
||
|
pos->frequency = ...
|
||
|
}
|
||
|
|
||
|
If you need to work with the position of pos within driver_freq_table,
|
||
|
do not subtract the pointers, as it is quite costly. Instead, use the
|
||
|
macros cpufreq_for_each_entry_idx() and cpufreq_for_each_valid_entry_idx().
|