APPARATUS AND METHOD FOR A NETWORK DEVICE

Info

Publication number: 20250351066
Type: Application
Filed: May 9, 2025
Publication Date: Nov 13, 2025
Inventors: Ravi Sharan Bhagavathula Anantha Gopala (Fellbach), Silvio Mandelli (Ludwigsburg)
Application Number: 19/203,296

Abstract

An apparatus for a network device, the apparatus including at least one processor, and at least one memory storing instructions that, when executed by the at least one processor, cause the network device to: determine an expected radio resource demand, and perform, at least based on the expected radio resource demand, at least one of a micro discontinuous transmission technique, or a multiple input multiple output, MIMO, muting technique, or a power-domain decision.

Description

Description

FIELD OF THE DISCLOSURE

The disclosure relates to an apparatus for a network device.

The disclosure further relates to a method for a network device.

BACKGROUND

Communication systems such as, e.g., wireless communication systems may be used for wireless exchange of information between two or more entities, e.g., comprising one or more terminal device, e.g., user equipment (UE), and one or more network devices such as, e.g., base stations.

In some conventional approaches such as, e.g., based on the third-generation partnership project (3GPP), the numbers of antennas at base stations are increasing, e.g., leading to extreme multiple input multiple output (eMIMO) systems. While in some approaches such systems are envisaged to operate with wider bandwidths and very large antenna array sizes, e.g., in comparison to massive MIMO (mMIMO) systems, which may bring improvements in terms of the spectral efficiency (SE) or quality of service (QoS), it may also lead to an increase in a power consumption at the base stations.

SUMMARY

Various example embodiments of the disclosure are set out by the independent claims. The example embodiments and features, if any, described in this specification, that do not fall under the scope of the independent claims, are to be interpreted as examples useful for understanding various example embodiments of the disclosure.

Some example embodiments relate to an apparatus for a network device, the apparatus comprising at least one processor, and at least one memory storing instructions that, when executed by the at least one processor, cause the network device to: determine an expected radio resource demand, perform, at least based on the expected radio resource demand, at least one of a) a micro discontinuous transmission technique, or b) a multiple input multiple output, MIMO, e.g., mMIMO, muting technique, or c) a power-domain decision or technique. In some examples this enables to, e.g., jointly, control an operation of at least some techniques and/or aspects that may influence energy efficiency.

In some examples, the network device may adhere to and/or may be based on some accepted (and/or planned) standard, such as, e.g. 3G, 4G, 5G, 6G, or some other wireless communication standard.

In some examples, the network device may be a base station, e.g., a gNB.

In some examples, the expected radio resource demand is an expected radio resource demand for a predetermined time, e. g., a predetermined amount of time resources, e.g., for one, e.g., current, slot.

In some examples, the resources are radio resources, e.g., time and/or frequency resources.

In some examples, the artificial intelligence model is a machine learning model.

In some examples, the micro discontinuous transmission technique may provide decision(s) to switch on or off at least one component, e.g., a power amplifier, of a radio frequency (RF) chain of the gNB, thus, e.g., effecting an energy efficiency. In some examples, the micro discontinuous transmission technique may, e.g., be used to turn off, e.g., a transceiver, e.g., for symbols where nothing is to be sent, e.g., if no data transmission is allocated in a respective time resource, e.g., a current slot.

In some examples, the mMIMO muting technique provides selecting an appropriate subset of antenna elements and/or RF chains to successfully enable a transmission to one or more terminal devices, thus, e.g., effecting an energy efficiency.

In some examples, the power-domain decision or technique may, e.g., comprise a technique of the POLITE-type explained in detail further below.

In some examples, the determining of the expected radio resource demand is performed upon taking a scheduling decision in a time period. In some examples, the time period comprises or is a plurality of slots, e.g., time slots. In some examples, the time period comprises or is a single slot.

In some examples, the determining of the expected radio resource demand comprises using an artificial intelligence model.

In some examples, the instructions, when executed by the at least one processor, cause the network device to: determine a number of terminal devices which are connected to the network device. In some examples, the terminal devices which are connected to the network device are those terminals which are currently in a connected state, e.g., in an RRC_CONNECTED state.

In some examples, the number of connected terminal devices may, e.g., be used as input information for the artificial intelligence model.

In some examples, the instructions, when executed by the at least one processor, cause the network device to perform at least one of: a) providing a convolutional neural network as the artificial intelligence model, or b) training the artificial intelligence model using a supervised learning approach.

In some examples, the instructions, when executed by the at least one processor, cause the network device to: model the micro discontinuous transmission technique by a first Markov decision process, wherein a state variable s(t) of the first Markov decision process is characterized by at least one of: a) the expected radio resource demand, or b) a signal to interference plus noise ratio associated with at least one terminal device, or c) at least one parameter characterizing a quality of service associated with at least one terminal device, wherein a reward function r of the first Markov decision process is based on an achievable fair sum-rate R_sumfor the at least one terminal device and on an energy consumption E_consassociated with the network device.

In some examples, the instructions, when executed by the at least one processor, cause the network device to: model, e.g., jointly model, the micro discontinuous transmission technique and the MIMO muting technique by a second Markov decision process.

In some examples, elements of an action space of the second Markov decision process characterize at least one of: a) information, whether at least one of a plurality of radio frequency chains should be activated, or b) information how many antenna elements and/or radio frequency chains should be used, e.g., for a predetermined time resource, e.g., a slot, or c) information indicating at least one of c1) a predetermined muting pattern for MIMO muting, or c2) a micro discontinuous transmission technique operation.

In some examples, the instructions, when executed by the at least one processor, cause the network device to: determine whether to apply at least one further technique for improving energy efficiency, and, based on the determination, apply the at least one further technique for improving energy efficiency.

In some examples, the at least one further technique for improving energy efficiency comprises at least one of: a) a power domain technique for reducing a transmit power for at least one specific transmission to at least one terminal device, e.g., as disclosed by S. Mandelli, A. Lieto, P. Baracca, A. Weber and T. Wild, “Power Optimization for Low Interference and Throughput Enhancement for 5G and 6G systems,” in 2021 IEEE Wireless Communications and Networking Conference Workshops (WCNCW), Nanjing, 2021, or b) a technique for reducing a crest factor, or c) a technique for controlling an effective isotropic radiated power (EIRP).

Some examples relate to an apparatus for a network device, the apparatus comprising means for determining an expected radio resource demand, performing, at least based on the expected radio resource demand, at least one of a) a micro discontinuous transmission technique, or b) a multiple input multiple output, MIMO, muting technique, or c) a power-domain decision.

In some examples, the means for determining the expected radio resource demand using an artificial intelligence model, and for performing, at least based on the expected radio resource demand, at least one of a) the micro discontinuous transmission technique, or b) the multiple input multiple output, MIMO, muting technique may, e.g., comprise at least one processor, and at least one memory storing instructions that, when executed by the at least one processor, cause the apparatus to perform the aforementioned aspects of determining and performing.

In some examples, the means for determining the expected radio resource demand using an artificial intelligence model, and for performing, at least based on the expected radio resource demand, at least one of a) the micro discontinuous transmission technique, or b) the multiple input multiple output, MIMO, muting technique may, e.g., comprise circuitry configured to perform the aforementioned aspects of determining and performing.

Some examples relate to a network device, e.g., base station, e.g., gNB, for a communication system comprising at least one apparatus according to the disclosure.

Some examples relate to a communication system comprising: at least one apparatus according to the disclosure.

Some examples relate to a method for a network device, comprising: determining an expected radio resource demand, performing, at least based on the expected radio resource demand, at least one of a) a micro discontinuous transmission technique, or b) a multiple input multiple output, MIMO, muting technique, or c) a power-domain decision.

Some examples relate to a computer program comprising instructions which, when executed by an apparatus, cause the apparatus to perform the method according to the disclosure.

Some examples relate to a computer-readable storage medium, for example a non-transitory computer-readable storage medium, comprising the computer program according to the disclosure.

Some examples relate to a data carrier signal carrying and/or characterizing the computer program according to the disclosure.

BRIEF DESCRIPTION OF THE FIGURES

FIG. 1A schematically depicts a block diagram according to some examples,

FIG. 1B schematically depicts a block diagram according to some examples,

FIG. 2 schematically depicts a block diagram according to some examples,

FIG. 3 schematically depicts a flow chart according to some examples,

FIG. 4 schematically depicts a flow chart according to some examples,

FIG. 5 schematically depicts a flow chart according to some examples,

FIG. 6 schematically depicts a flow chart according to some examples,

FIG. 7 schematically depicts a flow chart according to some examples,

FIG. 8 schematically depicts a flow chart according to some examples,

FIG. 9 schematically depicts antenna patterns according to some examples,

FIG. 10 schematically depicts a block diagram according to some examples.

DESCRIPTION OF SOME EXAMPLE EMBODIMENTS

Some example embodiments, see, for example, FIG. 1A, 2, 3, relate to an apparatus 100 for a network device 10, the apparatus 100 comprising at least one processor 102, and at least one memory 104 storing instructions 106 that, when executed by the at least one processor 102, cause the network device 10 to: determine 202 an expected radio resource demand DEM-RR, perform 204, at least based on the expected radio resource demand DEM-RR, at least one of a) a micro discontinuous transmission technique μDTX-TECH, or b) a multiple input multiple output, MIMO, muting technique MUT-TECH, or c) a power-domain decision POW-DD. In some examples this enables to, e.g., jointly, control an operation of at least some techniques and/or aspects that may influence energy efficiency, e.g., of the network device 10.

In some examples, FIG. 2, the network device 10 may adhere to and/or may be based on some accepted (and/or planned) standard, such as, e.g. 3G, 4G, 5G, 6G, or some other wireless communication standard.

In some embodiments, FIG. 2, the network device 10 may be a base station, e.g., a gNB. In some examples, the gNB may be configured to serve one or more terminal devices 20, 20a, . . .

In some examples, the expected radio resource demand DEM-RR is an expected radio resource demand for a predetermined time, e.g., a predetermined amount of time resources, e.g., for one, e.g., current, slot.

In some examples, FIG. 2, the artificial intelligence model AI-M is a machine learning (ML) model.

In some examples, FIG. 3, the determining 202 of the expected radio resource demand DEM-RR is performed upon taking a scheduling decision in a time period. In some examples, the time period comprises or is a plurality of slots, e.g., time slots. In some examples, the time period comprises or is a single slot.

In some examples, FIG. 3, the determining 202 of the expected radio resource demand DEM-RR comprises using an artificial intelligence model AI-M.

In some examples, FIG. 3, the instructions 106, when executed by the at least one processor 102, cause the network device 10 to: determine 200 a number NUM-20 of terminal devices 20, 20a, . . . which are connected to the network device 10. In some examples, the terminal devices 20, 20a, . . . which are connected to the network device 10 are those terminals which are currently in a connected state, e.g., in an RRC CONNECTED state.

In some examples, FIG. 2, the number NUM-20 of connected terminal devices may, e.g., be used as input information for the artificial intelligence model AI-M.

In some examples, FIG. 4, the instructions 106, when executed by the at least one processor 102, cause the network device 10 to perform at least one of: a) providing 210 a convolutional neural network CNN as the artificial intelligence model AI-M, or b) training 212 the artificial intelligence model AI-M, e.g., the convolutional neural network CNN, using a supervised learning approach SV-L.

In some examples, FIG. 5, the instructions 106, when executed by the at least one processor 102, cause the network device 10 to: model 220 the micro discontinuous transmission technique μDTX-TECH by a first Markov decision process MDP-1, wherein a state variable s(t) of the first Markov decision process MDP-1 is characterized by at least one of: a) the expected radio resource demand DEM-RR, or b) a signal to interference plus noise ratio, SINR, associated with at least one terminal device, or c) at least one parameter characterizing a quality of service, QoS, associated with at least one terminal device 20, wherein a reward function r(t) of the first Markov decision process MDP-1 is based on an achievable fair sum-rate R_sumfor the at least one terminal device 20 and on an energy consumption E_consassociated with the network device 10.

The optional block 222 of FIG. 5 symbolizes determining a policy POL-1 for the first Markov decision process MDP-1 according to some examples, which are explained in detail further below.

In some examples, FIG. 6, the instructions 106, when executed by the at least one processor 102, cause the network device 10 to: model 230, e.g., jointly model 232, the micro discontinuous transmission technique μDTX-TECH and the MIMO muting technique MUT-TECH by a second Markov decision process MDP-2.

In some examples, FIG. 2, 6, elements of an action space of the second Markov decision process MDP-2 characterize at least one of: a) information, whether at least one of a plurality of radio frequency chains (e.g., of the gNB 10) should be activated, or b) information how many antenna elements and/or radio frequency chains should be used, e.g., for a predetermined time resource, e.g., a slot, or c) information indicating at least one of c1) a predetermined muting pattern for MIMO muting, or c2) a micro discontinuous transmission technique operation.

The optional block 232 of FIG. 6 symbolizes determining a policy POL-2 for the second Markov decision process MDP-2 according to some examples, which are explained in detail further below.

In some examples, FIG. 7, the instructions 106, when executed by the at least one processor 102, cause the network device 10 to: determine 240 whether to apply at least one further technique for improving energy efficiency, e.g., of the network device, and, based on the determination 240, apply 242 the at least one further technique for improving energy efficiency. In other words, in some examples, if the determination 240 yields that the energy efficiency of the gNB may be (further) improved by the at least one further technique for improving the energy efficiency, this at least one further technique is applied according to block 242 of FIG. 7. However, if the determination 240 yields that the energy efficiency of the gNB may not be (further) improved by the at least one further technique for improving the energy efficiency, block 242 may be omitted.

In some examples, FIG. 7, the at least one further technique for improving energy efficiency comprises at least one of: a) a power domain technique for reducing a transmit power for at least one specific transmission to at least one terminal device, e.g., according to the POLITE-type, e.g., as disclosed by S. Mandelli, A. Lieto, P. Baracca, A. Weber and T. Wild, “Power Optimization for Low Interference and Throughput Enhancement for 5G and 6G systems,” in 2021 IEEE Wireless Communications and

Networking Conference Workshops (WCNCW), Nanjing, 2021, or b) a technique for reducing a crest factor, or c) a technique for controlling an effective isotropic radiated power (EIRP).

Some examples, FIG. 1B, relate to an apparatus 100′ for a network device 10, the apparatus 100′ comprising means 102′ for determining 202 an expected radio resource demand DEM-RR, performing 204, at least based on the expected radio resource demand, at least one of a) a micro discontinuous transmission technique, or b) a multiple input multiple output, MIMO, muting technique, or c) a power-domain decision POW-DD.

In some examples, the power-domain decision POW-DD or technique may, e.g., comprise the above-explained POLITE technique.

In some examples, FIG. 1B, the means 102′ for determining 202 the expected radio resource demand using an artificial intelligence model, and for performing 204, at least based on the expected radio resource demand, at least one of a) the micro discontinuous transmission technique, or b) the multiple input multiple output, MIMO, muting technique may, e.g., comprise at least one processor 102 (see, for example, FIG. 1A), and at least one memory 104 storing instructions 106 that, when executed by the at least one processor 102, cause the apparatus 100′ to perform the aforementioned aspects of determining 202 and performing 204.

In some examples, FIG. 1B, the means 102′ for determining the expected radio resource demand using an artificial intelligence model, and for performing, at least based on the expected radio resource demand, at least one of a) the micro discontinuous transmission technique, or b) the multiple input multiple output, MIMO, muting technique may, e.g., comprise circuitry 104′ configured to perform the aforementioned aspects of determining 202 and performing 204.

Some examples, FIG. 2, relate to a network device 10, e.g., base station, e.g., gNB, for a communication system 1 comprising at least one apparatus 100, 100′ according to the disclosure.

Some examples, FIG. 2, relate to a communication system 1 comprising: at least one apparatus 100, 100′ according to the disclosure.

Some examples, FIG. 3, relate to a method for a network device 10, comprising: determining 202 an expected radio resource demand, performing 204, at least based on the expected radio resource demand, at least one of a) a micro discontinuous transmission technique, or b) a multiple input multiple output, MIMO, muting technique, or c) a power-domain decision.

In the following, further aspects and examples are disclosed, which, in some examples, may be combined with each other and/or with at least one of the aforementioned aspects or examples.

FIG. 8 schematically depicts a flow chart according to some examples.

Element e1 symbolizes determining connected terminal devices, e.g., at least similar to block 200 of FIG. 3. In some examples, in element e1, a list of terminal devices 20, 20a, . . . (FIG. 2), which are currently in an RRC_CONNECTED state, is determined.

Element e2 symbolizes determining an expected radio resource demand DEM-RR, e.g., at least similar to block 202 of FIG. 3, e.g., based on the list of terminal devices as, e.g., obtained by element el of FIG. 8. In some examples, the expected radio resource demand DEM-RR provides a forecast of an expected resource usage for a current slot based on the AI model AI-M (FIG. 2).

Element e3 of FIG. 8 symbolizes determining a set of terminal devices to be served in, e.g., a current time slot, e.g., based on a conventional technique using at least one of a) QoS, or b) fairness metrics.

Element e4 of FIG. 8 symbolizes performing at least one of the micro discontinuous transmission technique μDTX-TECH and the MIMO muting technique MUT-TECH, e.g., at least similar to block 204 of FIG. 3, e.g., based on the expected radio resource demand DEM-RR as obtained by element e2 and based on the set of terminal devices as determined by element e3. In other words, in some examples, one or more decisions as may, e.g., be obtained by at least one of the micro discontinuous transmission technique μDTX-TECH and the MIMO muting technique MUT-TECH may depend on the expected radio resource demand DEM-RR as provided by element e2.

Element e5 of FIG. 8 symbolizes determining whether an energy efficiency may be further improved, e.g., after performing the aspects of element e4 (e.g., in the form of a joint μDTX+mMIMO muting decision), and, if the determination is positive, the procedure continues with element e6.

In some examples, FIG. 8, the determination of element e5 may be based on checking if one or more necessary conditions for execution of at least one further technique for improving energy efficiency are satisfied. If so, as mentioned above, the procedure continues with element e6, which symbolizes performing the at least one further technique for improving energy efficiency. If not, i.e., if one or more necessary conditions for the execution of at least one further technique for improving energy efficiency are not satisfied, in some examples, the procedure continues with element e7, e.g., omitting element e6.

In some examples, FIG. 8, the at least one further technique for improving energy efficiency comprises at least one of: a) a power domain technique for reducing a transmit power for at least one specific transmission to at least one terminal device, e.g., according to the POLITE-type, as already mentioned above, or b) a technique for reducing a crest factor (e.g., “crest factor reduction”, CFR), or c) a technique for controlling an effective isotropic radiated power (EIRP).

Element e7 of FIG. 8 symbolizes allocating resources, e.g., for a specific, e.g., current, time slot. In some examples, one or more further operations, e.g., associated with a first layer (e.g., L1) or a second layer (e.g., L2), such as, e.g., beamforming, may be performed.

Element e8 of FIG. 8 symbolizes a perceived radio resource demand, e.g., as opposed to the expected radio resource demand DEM-RR, as may, in some examples, e.g., be determined based on the AI model AI-M, see, for example, element e2.

Element e9 of FIG. 8 symbolizes an experience buffer, which, in some examples, may be used for training of the AI model AI-M (FIG. 2). In some examples, the experience buffer e9 may at least temporarily store information as may, e.g., be obtained by at least one of the blocks el, e4, e6, e8, see, for example, the arrows a1, a2, a3, a4 of FIG. 8. In some examples at least a part of this information may be used for training the AI model AI-M.

Element e10 of FIG. 8 symbolizes one or more training procedures, e.g., for training the AI model AI-M, e.g., based on the information as may be stored by the experience buffer e9, see arrow a5.

Arrow a6 of FIG. 8 symbolizes a model update, e.g., an update of the AI model AI-M, e.g., based on the one or more training procedures e10.

In some examples, at least some of the following notations may be used:

Let :={1,2, . . . , N} denote a set of all RRC CONNECTED terminal devices, e.g., UEs, 20, 20a, . . . (FIG. 2) associated with the gNB 10, with N∈ denoting a maximum number of UEs supported by the gNB 10. In some examples, the choice of N is gNB specific, which may, e.g., depend on at least one network operation feature.

In some examples, a slot (e.g., time slot) is denoted as t∈{1,2, . . . }, and a set of UEs resulting from an UE selection operation (see, for example, element e3 of FIG. 8) is denoted as (t)⊆, s.t |(t)|:=N(t)≤N. In some examples, M∈ denotes a maximum number of radio frequency (RF) chains at the gNB 10, e.g., predetermined based on the underlying hardware. In some examples, the vector notation, Δ(t)∈[0, Δ_max]^N×1, γ(t)∈, x(t)∈, corresponds to UEs' traffic/buffer information (e.g., indicating a number of bits), SINR estimates and QoS requirements respectively, where, Δ_max∈ denotes a maximum buffer size, which in some examples is a gNB specific hyper-parameter.

In some examples, the expected (e.g., normalized) radio resource demand DEM-RR (see, for example, block 202 of FIG. 3 or an output of element e2 of FIG. 8) is denoted by {circumflex over (β)}(t)∈[0,1], and a perceived radio resource usage (see element e8, FIG. 8) is denoted as β(t)∈[0,1]. In some examples, as for the decision parameters, μ(t)∈{0,1}, m(t)∈:={1,2, . . . , M} and ρ(t)∈[0, {circumflex over (β)}(t)]^N(t)×1represents a μDTX (micro discontinuous transmission) decision, mMIMO muting decision and POLITE spreading factor, respectively. In some examples, μDTX may be characterized by a binary set, where μ(t)=1 corresponds to no data transmission during slot t and otherwise when μ(t)=0. In some examples, a mMIMO muting decision may be characterized by a number of RF chains to be activated.

In some examples, the perceived radio resource usage may be characterized by the ratio between resources that would be used if no energy efficient operation (that could, e.g., increase the resource usage) were happening and the total amount of resources available.

In some examples, a decision variable for the POLITE technique may, e.g., be characterized as disclosed by S. Mandelli, A. Lieto, P. Baracca, A. Weber and T. Wild, “Power Optimization for Low Interference and Throughput Enhancement for 5G and 6G systems,” in 2021 IEEE Wireless Communications and Networking Conference Workshops (WCNCW), Nanjing, 2021, see, for example, equation 9.

In some examples, the determination of the expected radio resource demand {circumflex over (β)}(t), also see reference sign DEM-RR of FIG. 2, is performed using the AI model AI-M (FIG. 2).

In some embodiments, the determination, e.g., computation, of {circumflex over (β)}(t) may be modelled using a deep supervised learning approach, wherein, e.g., a convolutional neural network CNN (or a variant of a convolutional neural network) is trained to estimate the value of the expected radio resource demand {circumflex over (β)}(t), e.g., using labeled data samples.

In some examples, the convolutional neural network CNN can be trained either purely offline, e.g., applying a training or pretraining, e.g., based on simulation data, or online, e.g., at regular training epochs, e.g., using data samples as may be collected from operations of the gNB, e.g., L2 operations of the gNB, e.g., evolving over time.

In some examples, nonetheless, the training procedure as such may, e.g., be carried out as non-real-time operation.

In the following, example aspects of an online training process according to some examples are explained, since they also encapsulate an offline training process according to some examples.

In some examples, e.g., at every training epoch τ, the training procedure e10 (FIG. 8) aims at minimizing a regularized l₂loss where, training is performed on a set of features and their corresponding labels given by

$𝒟 := {(x_{i}, β_{i}}_{i = 1}^{D} .$

Here, the i^thfeature vector is characterized as x_i:=[Δ_i, μ_i, m_i, ρ_i] and its corresponding label is characterized by the perceived radio resource usage, β_i(e.g., as obtained by element e8 of FIG. 8). In some examples, D is a gNB specific parameter denoting a length of data samples used for training. In some examples, assuming θ denotes the trainable parameters of the architecture of the convolutional neural network CNN, then {circumflex over (β)}_i(θ) denotes the expected radio resource demand value parametrized w.r.t θ.

In some examples, a loss function, l(τ; θ) can be determined, e.g., computed as:

$l (τ; θ) := \frac{1}{D} \sum_{i = 1}^{D} { {\hat{β}}_{i} (θ) - β_{i} }^{2} + λ \cdot { θ }^{2},$

where, λ>0 is a regularizing parameter, which is, in some examples, predetermined at the gNB 10 (FIG. 2). In some examples, the above loss function l(τ; 0) may be used to optimize the trainable parameters θ, e.g., by backpropagation, e.g. via the stochastic gradient descent algorithm.

Note that in some examples, it can be observed that the feature vector x_icomprises of not, e.g., just, the traffic/buffer information pertaining to UEs in the set U, but may also consider individual decisions of individual energy driver(s) (e.g., operational innovations and component technology advancements associated with aspects of energy efficiency), thereby inducing a novel feedback/information-exchange mechanism in some examples.

In the following, aspects related to the micro discontinuous transmission (μDTX) technique and the mMIMO muting technique according to some examples are provided. In some examples, joint decisions related to μDTX and mMIMO muting, e.g., together with other aspects or energy drivers according to the disclosure, are enabled.

In some examples, a μDTX operation may switch ON or OFF power amplifiers (PAs) of the gNB 10, e.g., for a duration of OFDM symbols pertaining to a data transmission in a downlink direction.

In some examples, the mMIMO muting technique comprises of selecting an appropriate subset of antenna elements and/or RF chains of the gNB 10, e.g., to successfully enable transmission to (t). In some examples, such an operation is NP-hard in nature, e.g., with exponential computational complexity, e.g., for very large antenna systems at the gNB 10.

In some examples, it can be observed that both the energy drivers μDTX and mMIMO muting may have a direct impact on an output of at least one power amplifier of the gNB 10, which, in some examples, may be exploited from an operational, e.g., optimization, perspective.

In some examples, AI or ML algorithms with a comparatively low or, e.g., reasonable, complexity may be used, e.g., to jointly determine μDTX and mMIMO muting decisions, e.g., by maximizing an energy efficiency, e.g., subject to QoS constraints. In some examples, this is different from conventional realizations, e.g., of an L2 packet scheduler, where in some conventional approaches, a spectral efficiency and/or QoS forms the sole focus of an optimization problem.

In some examples, a micro discontinuous transmission technique μDTX-TECH may be performed, e.g., without mMIMO Muting. In other words, in such examples, a μDTX operation may be considered as a standalone optimization problem.

In some examples, e.g., to this extent, the μDTX operation may be casted, e.g., described, as, e.g., modeled by, an infinite-horizon discounted Markov decision process (MDP), see, for example the first Markov decision process MDP-1 of FIG. 5. In some examples, a solution, e.g., online solution, for the first Markov decision process MDP-1 may be provided by using, for example low-complexity, algorithms, e.g., from an approximate dynamic programming (ADP) framework.

In some examples, the first Markov decision process MDP-1, which is associated with the μDTX operation, may comprise the following components:

State Information (inputs): The state variable is characterized by a tuple which is formally defined as s(t):=<{circumflex over (β)}(t), Ŷ(t), {circumflex over (X)}(t)>, where the vectors Ŷ(t) and {circumflex over (X)}(t) denote the SINR and QoS values of the UEs in (t).

Actions (outputs): Since in some examples, the μDTX operation consists of ON or OFF decisions, an action space can be modelled as ∈{0,1}, and the individual actions at slot t, a(t)=0 may, e.g., corresponds to μDTX OFF and a(t)=0 may, e.g., correspond to μDTX ON.

Immediate reward function: In some examples, the immediate reward function is modelled after an energy efficiency (“EE”-) metric, which, in some examples, may, e.g., be a function of the state variable s(t) and action a at t.

Formally, in some examples, the immediate reward may, e.g., be denoted as r(t)∈ and may, e.g., be defined as follows:

$r (t) = r (s (t), a (t)) := \frac{R_{s u m} (s (t), a (t))}{E_{c o n s} (s (t), a (t))},$

where, R_sum(s(t), a(t)) is an achievable fair sum-rate for UEs in (t), which is given by

$R_{sum} (s (t), a (t); ϵ) := {\begin{matrix} \sum_{\hat{𝒰} (t)} r_{u}^{1 - ϵ}, & ϵ \neq 1 \\ \sum_{\hat{𝒰} (t)} \log_{2} (r_{u}), & ϵ = 1 \end{matrix},$

where, r_uis the achievable rate of a UE u∈Û(t). On the other hand, E_cons(s(t), a(t)) denotes energy consumption, which, in some examples, is given as follows:

$E_{c o n s} (s (t), a (t)) := {\begin{matrix} P_{rad} (s (t)), & if a (t) = 1, \\ 0, & if a (t) = 0 . \end{matrix}$

In the above equation, P_raddenotes a total radiated power (TRP).

Notice that in some examples, E_cons(s(t), a(t)) is modelled, e. g., only, as a function of TRP since other terms contributing to energy consumption may, e.g., be considered as fixed terms, e.g., as a consequence of the μDTX operation.

In some examples, an online policy π(t) (e.g., deterministic or stochastic) is determined, e.g., provided, which maps s(t) to a(t), e.g., to maximize the expected discounted return,

$G (t) := \sum_{k = 0}^{\infty} α (k) r (t + k + 1),$

where, α(k) is the discount factor, which is, e. g., predetermined at the gNB 10. In some examples, it may be resorted to, e.g., low-complexity, numerical online policy iteration algorithms, e.g., as disclosed by W. B. Powell, Approximate Dynamic Programming: Solving the curses of dimensionality, John Wiley & Sons, 2007.

In some examples, a mMIMO muting operation which encapsulates aspects of the μDTX technique may be performed. In other words, in some examples, the μDTX decision may be subsumed by the mMIMO muting operation. More specifically, in some examples, the mMIMO muting operation is cast as, e.g., may be modeled by, an infinite-horizon discounted MDP, e.g., the second Markov decision process MDP-2 (see FIG. 6), for which, in some examples, the state information s(t) may, e.g., be considered to be the same as in the previous examples related to the first Markov decision process MDP-1, i.e., s(t):=<{circumflex over (β)}(t), Ŷ(t), {circumflex over (X)}(t)>.

In some examples, however, the action space, reward and the policy optimization may be adapted for the second Markov decision process MDP-2, e.g., as compared to the first Markov decision process MDP-1, e.g., to efficiently solve a mMIMO muting operation as described below:

Actions: In some examples, an original action space for a mMIMO muting decision may be defined by a composite action space, =₁×. . . ×_M, where _j:={0,1}, ∀j∈ with _j:={0,1} being a binary set indicating whether or not the j^thRF chain of the gNB 10 should be activated. However, in some examples, since is exponential w.r.t M, it induces NP-hardness.

Thus, alternatively, in some examples, a sub-optimal MDP with a finite action space may be considered and solved, wherein individual actions a(t) correspond to, e.g., determining, e.g., just, a number of antenna elements and/or RF chains at every slot t, i.e., a(t)∈{0} ∪. In some examples, the action a(t)=0 corresponds to a μDTX operation.

In some examples, e.g., for the sub-optimal action space, the immediate reward function, r(t) may be defined as:

$r (s (t), a (t)) := \frac{R_{sum} (s (t), a (t))}{E_{cons} (s (t), a (t))},$ $where R_{s u m} (s (t), a (t) = m) := \sum_{\hat{𝒰} (t)} \log_{2} (1 + f_{u} (s (t), m)),$

with f(.) being an achievable SINR, which, in some examples, may be a strictly non-decreasing function in m characterizing a rate of each UE in (t).

In some examples, the energy consumption in the denominator of the immediate reward function r(t) can be characterized by a predetermined energy consumption model. For instance, in some examples, E_cons(s(t), a(t)) may be given by the energy consumption model as disclosed by S. Wesemann, J. Du and H. Vishwanathan, “Energy Efficient Extreme MIMO: Design Goals and Directions,” arXiv e-print, no. doi: 10.48550/arXiv.2301.01119, 2023.

In some examples, the energy consumption, e.g., E_cons(s(t), a(t)), may be determined, e.g., computed, based on, e.g., as a function of, at least one of: a) the micro discontinuous transmission technique, or b) the multiple input multiple output, MIMO, e.g., mMIMO, muting technique, or c) the power-domain decision or technique, e.g., according to the POLITE approach.

Policy: As in the previous examples, a policy may be devised which maximizes the expected discounted return, G(t), for which it can be resorted to, e.g., low-complexity, numerical online policy iteration algorithms as described in W. B. Powell, Approximate Dynamic Programming: Solving the curses of dimensionality, John Wiley & Sons, 2007.

In some examples, related to antenna pattern selection, once the policy determines the number of antenna elements and/or RF chains, e.g., to enable a successful transmission to (t), e.g., in L2, a suitable antenna pattern can jointly be obtained with, e.g., a precoding functionality in L1.

In some examples, a fixed antenna pattern based mMIMO muting is proposed, wherein the state information, the immediate reward function and the policy design may, e.g., be the same as for the preceding examples. However, for the fixed antenna pattern based mMIMO muting, the action space comprises of predetermined antenna muting patterns, see, for example, FIG. 9 which schematically depicts various antenna muting patterns ap-1, ap-2, ap-3 according to some examples, which are explained in detail further below.

Note that, in some examples, predetermined, e.g., fixed, antenna muting patterns may support gNB hardware employing standard-, e.g., 3GPP-, compliant codebook-based precoding at L1. In some examples, an action space for fixed antenna muting patterns is described as follows:

Actions: The set of all muting patterns available at the gNB 10 may collectively be denoted using such that :={0,1, . . . , {tilde over (M)}}, with {tilde over (M)} denoting the maximum number of patterns, which is different from M considered in the preceding examples. In the present examples, each index in the muting pattern set corresponds to a predetermined muting pattern, with index 0, e.g., denoting μDTX operation. Three such example patterns are depicted by FIG. 9, where an index “1” corresponds to a full panel operation, see antenna muting pattern ap-1, with each cross representing an antenna element. Similarly, indices “2” and “3” of the muting pattern set may correspond to, e.g., half panel operation, e.g., in a vertical direction (see pattern ap-2 of FIG. 9) and a horizontal direction (see pattern ap-3 of FIG. 9), e.g., with two muting patterns associated with each index. In some examples, a specific choice of mapping these patterns to their corresponding index can be considered as a predetermined operation at the gNB 10.

In some examples, e.g., once a mMIMO muting operation according to the disclosure is successfully carried out, e.g., for a successful transmission of data (and control information) the gNB 10 may perform further L2 tasks, such as: a) a selection of a modulation and coding scheme, MCS, e.g., for each UE in (t), and b) transmit power and resource allocation (wherein, in some examples, the resources are allocated in units of one or more physical resource blocks, PRBs) to (t).

In some examples, e.g., related to element e5 of FIG. 8, it may be checked whether an energy efficient link adaptation mechanism such as, e.g., based on the POLITE-type, as mentioned above, should be performed, e.g., based on the expected radio resource demand DEM-RR as obtained by element e2 of FIG. 8 and on the mMIMO muting as determined by element e4 of FIG. 8.

In some examples, FIG. 8, element e5 may determine, e.g., for a given {circumflex over (β)}(t) and a mMIMO muting action m(t):

$ϕ (t) := {\begin{matrix} 1, & if R_{sum} (\hat{β} (t), m (t)) > R_{sum} (\hat{β} (t), m (t), ρ_{thresh}) \\ 0, & otherwise . \end{matrix}$

where ρ_threshcorresponds to a maximum tolerable MCS degradation, e.g., without pushing the power amplifier(s) of the gNB 10 into an inefficiency region. In some examples, this is because, the MCS obtained w.r.t ρ_threshcorresponds to the lowest TRP, at which point the PAS operate in higher backoff, thus leading to reduced efficiency. In some examples, the value ρ_threshmay, e.g., depend on factors such as EIRP constraints and power amplifier operation regimes which can be determined via the component technology advancement category energy drivers at the gNB 10.

In some examples, the muting action m(t) may, e.g., be defined in a discrete space. In some examples, the muting action m(t) may, e.g., correspond with at least one muting option as depicted by FIG. 9, e.g., characterizing a number of antenna elements that are active, or any, e.g., specific, embedding, e.g., of the muting pattern.

In some examples, for a POLITE operation, it is proposed: If ϕ(t)=1 in the above example, the L2-RT (layer 2 real-time) operations invoke POLITE operation, which, in some examples, may, e.g., use the following tuple of input information denoted by Ω(t):

$Ω (t) := 〈 \hat{β} (t), m (t), Δ (t), χ (t), \bar{η} (t), P_{EIRP} (t) 〉$

where, η(t) denotes a vector of aggressive MCS values for the UEs in (t) determined w.r.t γ(t) and P_EIRP(t) denotes the EIRP constraint pertaining to angular directions being monitored. In some examples, an algorithmic implementation of the POLITE operation as, e.g., disclosed by WO 2022/028702 A1, may be used.

Note that in some examples, similar approaches to the POLITE technique, which may generally, e.g., be defined as “power-domain techniques”, may also be applied in element e5, e.g., instead of the POLITE technique. In some examples, the general idea for element e5 is that a technique performed according to element e5 may reduce the transmit power needed for a specific transmission to a UE, e.g., like the POLITE technique does.

In some examples, FIG. 2, the gNB 10 may also carry out advanced Crest Factor Reduction (CFR) based transmission, e.g., for PAPR (peak to average power ratio) reduction, thus, e.g., further improving an energy efficiency at the gNB 10, since the power amplifier(s) of the gNB 10 may operate at a power closer to their nominal power, with higher resulting energy efficiency. In some examples, e.g., since some techniques for PAPR, such as, e.g., J. Tellado and C. J. M, “Peak power reduction for multicarrier transmission,” in IEEE GLOBECOM, 1998, may demand for, e.g., specific carriers to be allocated to transmit special symbols for the purpose of reducing PAPR (also refer to the aspect of “tone reservation”), this resource usage may be properly deduced from the available resources to perform data allocation and transmission. In some examples, e.g., once the resources (and, optionally, the power limit) are known for the UEs in (t), CFR symbols can be computed using prior art techniques.

In some examples, which relate to EIRP control, an EIRP operational threshold for a slot “t” can, e.g., be determined, e.g., computed, according to, e.g., based on, at least one of: a) an expected (e.g., pessimistic, thus upper-bounded) CRF symbols power consumption on angular directions to be monitored, or b) UEs' precoding and power reduction. In some examples, based on this, the number of resources to be allocated in a data channel to each UE can be reduced, e.g., to comply with the EIRP limit in the current slot for the desired angular direction.

Some examples, FIG. 10, relate to a computer program PRG comprising instructions INSTR which, when executed by an apparatus 100, 100′, cause the apparatus 100, 100′ to perform the method according to the disclosure.

Some examples, FIG. 10, relate to a computer-readable storage medium ST-M, for example a non-transitory computer-readable storage medium ST-M, comprising the computer program PRG according to the disclosure.

Some examples, FIG. 10, relate to a data carrier signal DCS carrying and/or characterizing the computer program PRG according to the disclosure.

The principle of the disclosure enables to provide an architecture with a holistic view of energy drivers residing in a gNB 10, which manages their operations, e.g., to ensure minimum energy consumption while, e.g., primarily satisfying target QoS requirements. Additionally, the architecture enabled by the principle of the disclosure is easily implementable and real-time executable, e.g., in the qNB 10, e.g., without impacting other, e.g., regular L2 mechanisms.

The principle of the disclosure facilitates interaction among various energy driver operations associated with the qNB 10 (e.g., μDTX, mMIMO muting and POLITE), being also forward compatible with other features, like CFR and EIRP control. In some examples, this allows to control L2 packet scheduler operations to maximize the overall energy efficiency, e.g., while also controlling the Qos.

Claims

1. An apparatus for a network device, the apparatus comprising:

at least one processor; and

at least one memory storing instructions that, when executed with the at least one processor, cause the network device to: determine an expected radio resource demand; and perform, at least based on the expected radio resource demand, at least one of: a micro discontinuous transmission technique, a multiple input multiple output muting technique, or a power-domain decision.

2. The apparatus according to claim 1, wherein the instructions, when executed with the at least one processor, cause the apparatus to perform the determining upon taking a scheduling decision in a time period.

3. The apparatus according to claim 1, wherein the instructions, when executed with the at least one processor, cause the apparatus to perform the determining using an artificial intelligence model.

4. The apparatus according to claim 3, wherein the instructions, when executed with the at least one processor, cause the network device to perform at least one of: providing a convolutional neural network as the artificial intelligence model, or training the artificial intelligence model using a supervised learning approach.

5. The apparatus according to claim 1, wherein the instructions, when executed with the at least one processor, cause the network device to: model the micro discontinuous transmission technique with a first Markov decision process, wherein a state variable s(t) of the first Markov decision process is characterized with at least one of: wherein a reward function r(t) of the first Markov decision process is based on an achievable fair sum-rate Rsum for the at least one terminal device and on an energy consumption Econs associated with the network device.

the expected radio resource demand,

a signal to interference plus noise ratio associated with at least one terminal device, or c), or

at least one parameter characterizing a quality of service associated with at least one terminal device,

6. The apparatus according to claim 1, wherein the instructions, when executed with the at least one processor, cause the network device to: model the micro discontinuous transmission technique and the multiple input multiple output muting technique by with a second Markov decision process, wherein elements of an action space of the second Markov decision process characterize at least one of:

information, whether at least one of a plurality of radio frequency chains should be activated,

information how many at least one of antenna elements or radio frequency chains should be used, or

information indicating at least one of a predetermined muting pattern for multiple input multiple output muting, or a micro discontinuous transmission technique operation.

7. The apparatus according to claim 1, wherein the instructions, when executed with the at least one processor, cause the network device to: determine whether to apply at least one further technique for improving energy efficiency, and, based on the determination, apply the at least one further technique for improving energy efficiency.

8. The apparatus according to claim 7, wherein the at least one further technique for improving energy efficiency comprises at least one of:

a power domain technique for reducing a transmit power for at least one specific transmission to at least one terminal device,

a technique for reducing a crest factor, or

a technique for controlling an effective isotropic radiated power.

9. (canceled)

10. A network device for a communication system comprising at least one apparatus according to claim 1.

11. A communication system comprising: at least one apparatus according to claim 1.

12. A method for a network device, comprising:

determining an expected radio resource demand; and

performing, at least based on the expected radio resource demand, at least one of: a micro discontinuous transmission technique, a multiple input multiple output muting technique, or a power-domain decision.

13. A non-transitory program storage device readable with an apparatus, tangibly embodying a program of instructions executable with the apparatus to perform the method according to claim 12.

14. (canceled)

15. (canceled)