APPARATUS AND METHOD FOR A NETWORK DEVICE
An apparatus for a network device, the apparatus including at least one processor, and at least one memory storing instructions that, when executed by the at least one processor, cause the network device to: determine an expected radio resource demand, and perform, at least based on the expected radio resource demand, at least one of a micro discontinuous transmission technique, or a multiple input multiple output, MIMO, muting technique, or a power-domain decision.
The disclosure relates to an apparatus for a network device.
The disclosure further relates to a method for a network device.
BACKGROUNDCommunication systems such as, e.g., wireless communication systems may be used for wireless exchange of information between two or more entities, e.g., comprising one or more terminal device, e.g., user equipment (UE), and one or more network devices such as, e.g., base stations.
In some conventional approaches such as, e.g., based on the third-generation partnership project (3GPP), the numbers of antennas at base stations are increasing, e.g., leading to extreme multiple input multiple output (eMIMO) systems. While in some approaches such systems are envisaged to operate with wider bandwidths and very large antenna array sizes, e.g., in comparison to massive MIMO (mMIMO) systems, which may bring improvements in terms of the spectral efficiency (SE) or quality of service (QoS), it may also lead to an increase in a power consumption at the base stations.
SUMMARYVarious example embodiments of the disclosure are set out by the independent claims. The example embodiments and features, if any, described in this specification, that do not fall under the scope of the independent claims, are to be interpreted as examples useful for understanding various example embodiments of the disclosure.
Some example embodiments relate to an apparatus for a network device, the apparatus comprising at least one processor, and at least one memory storing instructions that, when executed by the at least one processor, cause the network device to: determine an expected radio resource demand, perform, at least based on the expected radio resource demand, at least one of a) a micro discontinuous transmission technique, or b) a multiple input multiple output, MIMO, e.g., mMIMO, muting technique, or c) a power-domain decision or technique. In some examples this enables to, e.g., jointly, control an operation of at least some techniques and/or aspects that may influence energy efficiency.
In some examples, the network device may adhere to and/or may be based on some accepted (and/or planned) standard, such as, e.g. 3G, 4G, 5G, 6G, or some other wireless communication standard.
In some examples, the network device may be a base station, e.g., a gNB.
In some examples, the expected radio resource demand is an expected radio resource demand for a predetermined time, e. g., a predetermined amount of time resources, e.g., for one, e.g., current, slot.
In some examples, the resources are radio resources, e.g., time and/or frequency resources.
In some examples, the artificial intelligence model is a machine learning model.
In some examples, the micro discontinuous transmission technique may provide decision(s) to switch on or off at least one component, e.g., a power amplifier, of a radio frequency (RF) chain of the gNB, thus, e.g., effecting an energy efficiency. In some examples, the micro discontinuous transmission technique may, e.g., be used to turn off, e.g., a transceiver, e.g., for symbols where nothing is to be sent, e.g., if no data transmission is allocated in a respective time resource, e.g., a current slot.
In some examples, the mMIMO muting technique provides selecting an appropriate subset of antenna elements and/or RF chains to successfully enable a transmission to one or more terminal devices, thus, e.g., effecting an energy efficiency.
In some examples, the power-domain decision or technique may, e.g., comprise a technique of the POLITE-type explained in detail further below.
In some examples, the determining of the expected radio resource demand is performed upon taking a scheduling decision in a time period. In some examples, the time period comprises or is a plurality of slots, e.g., time slots. In some examples, the time period comprises or is a single slot.
In some examples, the determining of the expected radio resource demand comprises using an artificial intelligence model.
In some examples, the instructions, when executed by the at least one processor, cause the network device to: determine a number of terminal devices which are connected to the network device. In some examples, the terminal devices which are connected to the network device are those terminals which are currently in a connected state, e.g., in an RRC_CONNECTED state.
In some examples, the number of connected terminal devices may, e.g., be used as input information for the artificial intelligence model.
In some examples, the instructions, when executed by the at least one processor, cause the network device to perform at least one of: a) providing a convolutional neural network as the artificial intelligence model, or b) training the artificial intelligence model using a supervised learning approach.
In some examples, the instructions, when executed by the at least one processor, cause the network device to: model the micro discontinuous transmission technique by a first Markov decision process, wherein a state variable s(t) of the first Markov decision process is characterized by at least one of: a) the expected radio resource demand, or b) a signal to interference plus noise ratio associated with at least one terminal device, or c) at least one parameter characterizing a quality of service associated with at least one terminal device, wherein a reward function r of the first Markov decision process is based on an achievable fair sum-rate Rsum for the at least one terminal device and on an energy consumption Econs associated with the network device.
In some examples, the instructions, when executed by the at least one processor, cause the network device to: model, e.g., jointly model, the micro discontinuous transmission technique and the MIMO muting technique by a second Markov decision process.
In some examples, elements of an action space of the second Markov decision process characterize at least one of: a) information, whether at least one of a plurality of radio frequency chains should be activated, or b) information how many antenna elements and/or radio frequency chains should be used, e.g., for a predetermined time resource, e.g., a slot, or c) information indicating at least one of c1) a predetermined muting pattern for MIMO muting, or c2) a micro discontinuous transmission technique operation.
In some examples, the instructions, when executed by the at least one processor, cause the network device to: determine whether to apply at least one further technique for improving energy efficiency, and, based on the determination, apply the at least one further technique for improving energy efficiency.
In some examples, the at least one further technique for improving energy efficiency comprises at least one of: a) a power domain technique for reducing a transmit power for at least one specific transmission to at least one terminal device, e.g., as disclosed by S. Mandelli, A. Lieto, P. Baracca, A. Weber and T. Wild, “Power Optimization for Low Interference and Throughput Enhancement for 5G and 6G systems,” in 2021 IEEE Wireless Communications and Networking Conference Workshops (WCNCW), Nanjing, 2021, or b) a technique for reducing a crest factor, or c) a technique for controlling an effective isotropic radiated power (EIRP).
Some examples relate to an apparatus for a network device, the apparatus comprising means for determining an expected radio resource demand, performing, at least based on the expected radio resource demand, at least one of a) a micro discontinuous transmission technique, or b) a multiple input multiple output, MIMO, muting technique, or c) a power-domain decision.
In some examples, the means for determining the expected radio resource demand using an artificial intelligence model, and for performing, at least based on the expected radio resource demand, at least one of a) the micro discontinuous transmission technique, or b) the multiple input multiple output, MIMO, muting technique may, e.g., comprise at least one processor, and at least one memory storing instructions that, when executed by the at least one processor, cause the apparatus to perform the aforementioned aspects of determining and performing.
In some examples, the means for determining the expected radio resource demand using an artificial intelligence model, and for performing, at least based on the expected radio resource demand, at least one of a) the micro discontinuous transmission technique, or b) the multiple input multiple output, MIMO, muting technique may, e.g., comprise circuitry configured to perform the aforementioned aspects of determining and performing.
Some examples relate to a network device, e.g., base station, e.g., gNB, for a communication system comprising at least one apparatus according to the disclosure.
Some examples relate to a communication system comprising: at least one apparatus according to the disclosure.
Some examples relate to a method for a network device, comprising: determining an expected radio resource demand, performing, at least based on the expected radio resource demand, at least one of a) a micro discontinuous transmission technique, or b) a multiple input multiple output, MIMO, muting technique, or c) a power-domain decision.
Some examples relate to a computer program comprising instructions which, when executed by an apparatus, cause the apparatus to perform the method according to the disclosure.
Some examples relate to a computer-readable storage medium, for example a non-transitory computer-readable storage medium, comprising the computer program according to the disclosure.
Some examples relate to a data carrier signal carrying and/or characterizing the computer program according to the disclosure.
Some example embodiments, see, for example,
In some examples,
In some embodiments,
In some examples, the expected radio resource demand DEM-RR is an expected radio resource demand for a predetermined time, e.g., a predetermined amount of time resources, e.g., for one, e.g., current, slot.
In some examples,
In some examples,
In some examples,
In some examples,
In some examples,
In some examples,
In some examples,
The optional block 222 of
In some examples,
In some examples,
The optional block 232 of
In some examples,
In some examples,
Networking Conference Workshops (WCNCW), Nanjing, 2021, or b) a technique for reducing a crest factor, or c) a technique for controlling an effective isotropic radiated power (EIRP).
Some examples,
In some examples, the power-domain decision POW-DD or technique may, e.g., comprise the above-explained POLITE technique.
In some examples,
In some examples,
Some examples,
Some examples,
Some examples,
In the following, further aspects and examples are disclosed, which, in some examples, may be combined with each other and/or with at least one of the aforementioned aspects or examples.
Element e1 symbolizes determining connected terminal devices, e.g., at least similar to block 200 of
Element e2 symbolizes determining an expected radio resource demand DEM-RR, e.g., at least similar to block 202 of
Element e3 of
Element e4 of
Element e5 of
In some examples,
In some examples,
Element e7 of
Element e8 of
Element e9 of
Element e10 of
Arrow a6 of
In some examples, at least some of the following notations may be used:
Let :={1,2, . . . , N} denote a set of all RRC CONNECTED terminal devices, e.g., UEs, 20, 20a, . . . (
In some examples, a slot (e.g., time slot) is denoted as t∈{1,2, . . . }, and a set of UEs resulting from an UE selection operation (see, for example, element e3 of
In some examples, the expected (e.g., normalized) radio resource demand DEM-RR (see, for example, block 202 of
In some examples, the perceived radio resource usage may be characterized by the ratio between resources that would be used if no energy efficient operation (that could, e.g., increase the resource usage) were happening and the total amount of resources available.
In some examples, a decision variable for the POLITE technique may, e.g., be characterized as disclosed by S. Mandelli, A. Lieto, P. Baracca, A. Weber and T. Wild, “Power Optimization for Low Interference and Throughput Enhancement for 5G and 6G systems,” in 2021 IEEE Wireless Communications and Networking Conference Workshops (WCNCW), Nanjing, 2021, see, for example, equation 9.
In some examples, the determination of the expected radio resource demand {circumflex over (β)}(t), also see reference sign DEM-RR of
In some embodiments, the determination, e.g., computation, of {circumflex over (β)}(t) may be modelled using a deep supervised learning approach, wherein, e.g., a convolutional neural network CNN (or a variant of a convolutional neural network) is trained to estimate the value of the expected radio resource demand {circumflex over (β)}(t), e.g., using labeled data samples.
In some examples, the convolutional neural network CNN can be trained either purely offline, e.g., applying a training or pretraining, e.g., based on simulation data, or online, e.g., at regular training epochs, e.g., using data samples as may be collected from operations of the gNB, e.g., L2 operations of the gNB, e.g., evolving over time.
In some examples, nonetheless, the training procedure as such may, e.g., be carried out as non-real-time operation.
In the following, example aspects of an online training process according to some examples are explained, since they also encapsulate an offline training process according to some examples.
In some examples, e.g., at every training epoch τ, the training procedure e10 (
Here, the ith feature vector is characterized as xi:=[Δi, μi, mi, ρi] and its corresponding label is characterized by the perceived radio resource usage, βi (e.g., as obtained by element e8 of
In some examples, a loss function, l(τ; θ) can be determined, e.g., computed as:
where, λ>0 is a regularizing parameter, which is, in some examples, predetermined at the gNB 10 (
Note that in some examples, it can be observed that the feature vector xi comprises of not, e.g., just, the traffic/buffer information pertaining to UEs in the set U, but may also consider individual decisions of individual energy driver(s) (e.g., operational innovations and component technology advancements associated with aspects of energy efficiency), thereby inducing a novel feedback/information-exchange mechanism in some examples.
In the following, aspects related to the micro discontinuous transmission (μDTX) technique and the mMIMO muting technique according to some examples are provided. In some examples, joint decisions related to μDTX and mMIMO muting, e.g., together with other aspects or energy drivers according to the disclosure, are enabled.
In some examples, a μDTX operation may switch ON or OFF power amplifiers (PAs) of the gNB 10, e.g., for a duration of OFDM symbols pertaining to a data transmission in a downlink direction.
In some examples, the mMIMO muting technique comprises of selecting an appropriate subset of antenna elements and/or RF chains of the gNB 10, e.g., to successfully enable transmission to (t). In some examples, such an operation is NP-hard in nature, e.g., with exponential computational complexity, e.g., for very large antenna systems at the gNB 10.
In some examples, it can be observed that both the energy drivers μDTX and mMIMO muting may have a direct impact on an output of at least one power amplifier of the gNB 10, which, in some examples, may be exploited from an operational, e.g., optimization, perspective.
In some examples, AI or ML algorithms with a comparatively low or, e.g., reasonable, complexity may be used, e.g., to jointly determine μDTX and mMIMO muting decisions, e.g., by maximizing an energy efficiency, e.g., subject to QoS constraints. In some examples, this is different from conventional realizations, e.g., of an L2 packet scheduler, where in some conventional approaches, a spectral efficiency and/or QoS forms the sole focus of an optimization problem.
In some examples, a micro discontinuous transmission technique μDTX-TECH may be performed, e.g., without mMIMO Muting. In other words, in such examples, a μDTX operation may be considered as a standalone optimization problem.
In some examples, e.g., to this extent, the μDTX operation may be casted, e.g., described, as, e.g., modeled by, an infinite-horizon discounted Markov decision process (MDP), see, for example the first Markov decision process MDP-1 of
In some examples, the first Markov decision process MDP-1, which is associated with the μDTX operation, may comprise the following components:
State Information (inputs): The state variable is characterized by a tuple which is formally defined as s(t):=<{circumflex over (β)}(t), Ŷ(t), {circumflex over (X)}(t)>, where the vectors Ŷ(t) and {circumflex over (X)}(t) denote the SINR and QoS values of the UEs in (t).
Actions (outputs): Since in some examples, the μDTX operation consists of ON or OFF decisions, an action space can be modelled as ∈{0,1}, and the individual actions at slot t, a(t)=0 may, e.g., corresponds to μDTX OFF and a(t)=0 may, e.g., correspond to μDTX ON.
Immediate reward function: In some examples, the immediate reward function is modelled after an energy efficiency (“EE”-) metric, which, in some examples, may, e.g., be a function of the state variable s(t) and action a at t.
Formally, in some examples, the immediate reward may, e.g., be denoted as r(t)∈ and may, e.g., be defined as follows:
where, Rsum(s(t), a(t)) is an achievable fair sum-rate for UEs in (t), which is given by
where, ru is the achievable rate of a UE u∈Û(t). On the other hand, Econs(s(t), a(t)) denotes energy consumption, which, in some examples, is given as follows:
In the above equation, Prad denotes a total radiated power (TRP).
Notice that in some examples, Econs(s(t), a(t)) is modelled, e. g., only, as a function of TRP since other terms contributing to energy consumption may, e.g., be considered as fixed terms, e.g., as a consequence of the μDTX operation.
In some examples, an online policy π(t) (e.g., deterministic or stochastic) is determined, e.g., provided, which maps s(t) to a(t), e.g., to maximize the expected discounted return,
where, α(k) is the discount factor, which is, e. g., predetermined at the gNB 10. In some examples, it may be resorted to, e.g., low-complexity, numerical online policy iteration algorithms, e.g., as disclosed by W. B. Powell, Approximate Dynamic Programming: Solving the curses of dimensionality, John Wiley & Sons, 2007.
In some examples, a mMIMO muting operation which encapsulates aspects of the μDTX technique may be performed. In other words, in some examples, the μDTX decision may be subsumed by the mMIMO muting operation. More specifically, in some examples, the mMIMO muting operation is cast as, e.g., may be modeled by, an infinite-horizon discounted MDP, e.g., the second Markov decision process MDP-2 (see
In some examples, however, the action space, reward and the policy optimization may be adapted for the second Markov decision process MDP-2, e.g., as compared to the first Markov decision process MDP-1, e.g., to efficiently solve a mMIMO muting operation as described below:
Actions: In some examples, an original action space for a mMIMO muting decision may be defined by a composite action space, =1×. . . ×
Thus, alternatively, in some examples, a sub-optimal MDP with a finite action space may be considered and solved, wherein individual actions a(t) correspond to, e.g., determining, e.g., just, a number of antenna elements and/or RF chains at every slot t, i.e., a(t)∈{0} ∪. In some examples, the action a(t)=0 corresponds to a μDTX operation.
In some examples, e.g., for the sub-optimal action space, the immediate reward function, r(t) may be defined as:
with f(.) being an achievable SINR, which, in some examples, may be a strictly non-decreasing function in m characterizing a rate of each UE in (t).
In some examples, the energy consumption in the denominator of the immediate reward function r(t) can be characterized by a predetermined energy consumption model. For instance, in some examples, Econs(s(t), a(t)) may be given by the energy consumption model as disclosed by S. Wesemann, J. Du and H. Vishwanathan, “Energy Efficient Extreme MIMO: Design Goals and Directions,” arXiv e-print, no. doi: 10.48550/arXiv.2301.01119, 2023.
In some examples, the energy consumption, e.g., Econs(s(t), a(t)), may be determined, e.g., computed, based on, e.g., as a function of, at least one of: a) the micro discontinuous transmission technique, or b) the multiple input multiple output, MIMO, e.g., mMIMO, muting technique, or c) the power-domain decision or technique, e.g., according to the POLITE approach.
Policy: As in the previous examples, a policy may be devised which maximizes the expected discounted return, G(t), for which it can be resorted to, e.g., low-complexity, numerical online policy iteration algorithms as described in W. B. Powell, Approximate Dynamic Programming: Solving the curses of dimensionality, John Wiley & Sons, 2007.
In some examples, related to antenna pattern selection, once the policy determines the number of antenna elements and/or RF chains, e.g., to enable a successful transmission to (t), e.g., in L2, a suitable antenna pattern can jointly be obtained with, e.g., a precoding functionality in L1.
In some examples, a fixed antenna pattern based mMIMO muting is proposed, wherein the state information, the immediate reward function and the policy design may, e.g., be the same as for the preceding examples. However, for the fixed antenna pattern based mMIMO muting, the action space comprises of predetermined antenna muting patterns, see, for example,
Note that, in some examples, predetermined, e.g., fixed, antenna muting patterns may support gNB hardware employing standard-, e.g., 3GPP-, compliant codebook-based precoding at L1. In some examples, an action space for fixed antenna muting patterns is described as follows:
Actions: The set of all muting patterns available at the gNB 10 may collectively be denoted using such that :={0,1, . . . , {tilde over (M)}}, with {tilde over (M)} denoting the maximum number of patterns, which is different from M considered in the preceding examples. In the present examples, each index in the muting pattern set corresponds to a predetermined muting pattern, with index 0, e.g., denoting μDTX operation. Three such example patterns are depicted by
In some examples, e.g., once a mMIMO muting operation according to the disclosure is successfully carried out, e.g., for a successful transmission of data (and control information) the gNB 10 may perform further L2 tasks, such as: a) a selection of a modulation and coding scheme, MCS, e.g., for each UE in (t), and b) transmit power and resource allocation (wherein, in some examples, the resources are allocated in units of one or more physical resource blocks, PRBs) to (t).
In some examples, e.g., related to element e5 of
In some examples,
where ρthresh corresponds to a maximum tolerable MCS degradation, e.g., without pushing the power amplifier(s) of the gNB 10 into an inefficiency region. In some examples, this is because, the MCS obtained w.r.t ρthresh corresponds to the lowest TRP, at which point the PAS operate in higher backoff, thus leading to reduced efficiency. In some examples, the value ρthresh may, e.g., depend on factors such as EIRP constraints and power amplifier operation regimes which can be determined via the component technology advancement category energy drivers at the gNB 10.
In some examples, the muting action m(t) may, e.g., be defined in a discrete space. In some examples, the muting action m(t) may, e.g., correspond with at least one muting option as depicted by
In some examples, for a POLITE operation, it is proposed: If ϕ(t)=1 in the above example, the L2-RT (layer 2 real-time) operations invoke POLITE operation, which, in some examples, may, e.g., use the following tuple of input information denoted by Ω(t):
where,
Note that in some examples, similar approaches to the POLITE technique, which may generally, e.g., be defined as “power-domain techniques”, may also be applied in element e5, e.g., instead of the POLITE technique. In some examples, the general idea for element e5 is that a technique performed according to element e5 may reduce the transmit power needed for a specific transmission to a UE, e.g., like the POLITE technique does.
In some examples,
In some examples, which relate to EIRP control, an EIRP operational threshold for a slot “t” can, e.g., be determined, e.g., computed, according to, e.g., based on, at least one of: a) an expected (e.g., pessimistic, thus upper-bounded) CRF symbols power consumption on angular directions to be monitored, or b) UEs' precoding and power reduction. In some examples, based on this, the number of resources to be allocated in a data channel to each UE can be reduced, e.g., to comply with the EIRP limit in the current slot for the desired angular direction.
Some examples,
Some examples,
Some examples,
The principle of the disclosure enables to provide an architecture with a holistic view of energy drivers residing in a gNB 10, which manages their operations, e.g., to ensure minimum energy consumption while, e.g., primarily satisfying target QoS requirements. Additionally, the architecture enabled by the principle of the disclosure is easily implementable and real-time executable, e.g., in the qNB 10, e.g., without impacting other, e.g., regular L2 mechanisms.
The principle of the disclosure facilitates interaction among various energy driver operations associated with the qNB 10 (e.g., μDTX, mMIMO muting and POLITE), being also forward compatible with other features, like CFR and EIRP control. In some examples, this allows to control L2 packet scheduler operations to maximize the overall energy efficiency, e.g., while also controlling the Qos.
Claims
1. An apparatus for a network device, the apparatus comprising:
- at least one processor; and
- at least one memory storing instructions that, when executed with the at least one processor, cause the network device to: determine an expected radio resource demand; and perform, at least based on the expected radio resource demand, at least one of: a micro discontinuous transmission technique, a multiple input multiple output muting technique, or a power-domain decision.
2. The apparatus according to claim 1, wherein the instructions, when executed with the at least one processor, cause the apparatus to perform the determining upon taking a scheduling decision in a time period.
3. The apparatus according to claim 1, wherein the instructions, when executed with the at least one processor, cause the apparatus to perform the determining using an artificial intelligence model.
4. The apparatus according to claim 3, wherein the instructions, when executed with the at least one processor, cause the network device to perform at least one of: providing a convolutional neural network as the artificial intelligence model, or training the artificial intelligence model using a supervised learning approach.
5. The apparatus according to claim 1, wherein the instructions, when executed with the at least one processor, cause the network device to: model the micro discontinuous transmission technique with a first Markov decision process, wherein a state variable s(t) of the first Markov decision process is characterized with at least one of: wherein a reward function r(t) of the first Markov decision process is based on an achievable fair sum-rate Rsum for the at least one terminal device and on an energy consumption Econs associated with the network device.
- the expected radio resource demand,
- a signal to interference plus noise ratio associated with at least one terminal device, or c), or
- at least one parameter characterizing a quality of service associated with at least one terminal device,
6. The apparatus according to claim 1, wherein the instructions, when executed with the at least one processor, cause the network device to: model the micro discontinuous transmission technique and the multiple input multiple output muting technique by with a second Markov decision process, wherein elements of an action space of the second Markov decision process characterize at least one of:
- information, whether at least one of a plurality of radio frequency chains should be activated,
- information how many at least one of antenna elements or radio frequency chains should be used, or
- information indicating at least one of a predetermined muting pattern for multiple input multiple output muting, or a micro discontinuous transmission technique operation.
7. The apparatus according to claim 1, wherein the instructions, when executed with the at least one processor, cause the network device to: determine whether to apply at least one further technique for improving energy efficiency, and, based on the determination, apply the at least one further technique for improving energy efficiency.
8. The apparatus according to claim 7, wherein the at least one further technique for improving energy efficiency comprises at least one of:
- a power domain technique for reducing a transmit power for at least one specific transmission to at least one terminal device,
- a technique for reducing a crest factor, or
- a technique for controlling an effective isotropic radiated power.
9. (canceled)
10. A network device for a communication system comprising at least one apparatus according to claim 1.
11. A communication system comprising: at least one apparatus according to claim 1.
12. A method for a network device, comprising:
- determining an expected radio resource demand; and
- performing, at least based on the expected radio resource demand, at least one of: a micro discontinuous transmission technique, a multiple input multiple output muting technique, or a power-domain decision.
13. A non-transitory program storage device readable with an apparatus, tangibly embodying a program of instructions executable with the apparatus to perform the method according to claim 12.
14. (canceled)
15. (canceled)
Type: Application
Filed: May 9, 2025
Publication Date: Nov 13, 2025
Inventors: Ravi Sharan Bhagavathula Anantha Gopala (Fellbach), Silvio Mandelli (Ludwigsburg)
Application Number: 19/203,296