Summary of the invention
The object of the present invention is to provide a kind of repayment that increases cognitive base station and secondary user's effectively, and promote the implementation method of the dynamic frequency spectrum deployment of selecting based on game types of spectrum utilization.
For achieving the above object, the technical solution used in the present invention is: in communication scenes, cognitive base station (cognitive radio base station, CRBS) and secondary user's (secondary users, behavior SU) had with the next stage:
(1) CRBS proposes frequency spectrum quotation w;
(2) SU observes (and acceptance) w, selects frequency spectrum use amount L subsequently;
(3) repayment of CRBS and SU be respectively U (w, L) and π (w, L);
For any quotation w that CRBS in the phase I proposes, SU decides optimum spectrum requirement L by maximizing its repayment in second stage
*(w), promptly
Try to achieve optimum spectrum requirement L
*(w);
Backstepping is to the phase I, and CRBS selects w
*Make U (w
*, L
*(w
*)) reaching maximization, this quotation is called as the optimum quotation per unit under the dynamic game, is designated as separating of following formula:
If launch to receive assorted negotiation with regard to a frequency spectrum use between CRBS and the SU, if both sides can not reach an agreement, then both sides' utility level all is 0; If both sides reach an agreement, the utility level of CRBS is u
1, the utility level of SU is π
1, corresponding optimization aim is:
Get the CRBS optimal spectrum quotation per unit w of this moment
BWith SU optimal spectrum purchase volume L
B
The dynamic frequency spectrum deployment method is as follows:
(1) CRBS periodically announce oneself the repayment function U (w, L), SU in the networking initialization procedure, announce oneself repayment function π (w, L);
(2) CRBS proposes the frequency spectrum w that initially offers
*, this quotation is the optimum bid of the CRBS in the conventional dynamic game;
(3) SU observes w
*, and at oneself dynamic optimal spectrum use amount L
*With receive assorted bargaining solution frequency spectrum use amount L
BIn select one and report CRBS;
(4) CRBS is that SU assigns corresponding frequency spectrum quantity, but contract quoting is at L
*Be w
*, at L
BBe w
B, SU will offer as agreed and repay the frequency spectrum cost of use.
The present invention is by the repayment of more complete and perfect dynamic game and Na Shi negotiation, and cognitive base station and secondary user's can be selected optimal strategy separately, promptly optimum frequency spectrum quotation per unit and frequency spectrum purchase volume flexibly liberally.Simultaneously, for realizing both sides' selection operation, designed a kind of simple frequency spectrum access process, with traditional mobile calls access process compatibility.Can increase the repayment of cognitive base station and secondary user's effectively, and promote spectrum utilization.
Embodiment
Below in conjunction with accompanying drawing the present invention is described in further detail.
Consider the scene of single cognitive multiple access sub-district, as shown in Figure 1.Cognitive base station has the usable spectrum that obtains in secondary frequency spectrum marketing, and can reshuffle flexibly self with and the terminal communication of polytype (as types such as WCDMA, CDMA2000EV-DO).Dissimilar users (SU) needing from the communication of acquisition frequency spectrum in base station with oneself to thirst in the sub-district.Because the spectrum opportunities of CRBS is paid acquisition,, SU pays a price so being necessary for the frequency spectrum use of oneself.CRBS proposes the unit frequency spectrum and uses price w, has used the SU of L unit's frequency spectrum to need to pay expense wL to CRBS.
The repayment function of supposing CRBS be U (w, L).Obviously
Be that U is the increasing function of quotation w and use amount L.The repayment function of SU is that (w, L)=R (L)-wL, wherein R (L) is the communication revenue function of SU to π.Obviously, income is big more more at most for the frequency spectrum that SU uses, but the marginal benefit that its frequency spectrum uses is then more and more littler.Therefore, R (L) is the increasing function of L and is concave function.R ' (0)=∞, R ' (∞)=0, promptly R ' (L) ∈ [0, ∞).
In the communication scenes of routine, the behavior of CRBS and SU had with the next stage:
(1) CRBS proposes frequency spectrum quotation w;
(2) SU observes (and acceptance) w, selects frequency spectrum use amount L subsequently;
(3) repayment of CRBS and SU be respectively U (w, L) and π (w, L).
Therefore, CRBS and SU all are faced with and how obtain maximum repayment in the frequency spectrum transaction.
For SU, it is clear know CRBS repayment U (w L) is equal to the in-and-out price difference of frequency spectrum, promptly has:
U(w,L)=(w-w
0)L
W wherein
0Be CRBS per unit frequency spectrum secondary market transaction cost.Obviously, w 〉=w
0〉=0.
CRBS can be configured to different mode and communicate by letter with SU, for example from the WCDMA mode switch be the CDMA2000 pattern to satisfy the access request of certain EV-DO type SU, the clear complete characteristic of knowing the CDMA2000 terminal of CRBS then, therefore, it also knows repayment function π (w, structure L) of SU.Simply, order:
π(w,L)=R(L)-wL
The repayment function (net profit) of SU equals frequency spectrum and uses income R (L) to deduct cost of use wL.It is the increasing function of L that the frequency spectrum of SU uses income, but its growth slows down gradually.This can explain like this: have more frequency spectrum, SU just can carry out more business (as video calling) or improve the professional income (as reducing power consumption) that obtains; But, L hour, the frequency spectrum purchase volume can not satisfy the business of SU carries out, and increases purchase volume and can improve business income significantly; When L was big, the traffic load of SU was saturated substantially, increased purchase volume and promoted very little to improving business income.
Because the version of repayment function is known to the both sides before transaction beginning and CRBS and SU all can observe the other side's action, sees that according to game theoretic viewpoint this communication scenes is a complete and perfect dynamic game.Player 1 is CRBS, and its strategic space is { to propose frequency spectrum quotation w}; Player 2 is SU, and its strategic space is { decision frequency spectrum use amount L}.
For complete and perfect dynamic game, can use reverse induction to find the solution.
For any quotation w that CRBS in the phase I proposes, SU decides optimum spectrum requirement L by maximizing its repayment in second stage
*(w), promptly
By first-order condition R ' (L)=w solves:
Backstepping is to the phase I now, and CRBS selects then w
*Make U (w
*, L
*(w
*)) reach maximization, promptly CRBS in the target of phase I is
Can obtain best CRBS quotation and corresponding SU use amount is:
Then after complete and perfect information game, both sides' income is:
Fig. 2 has provided the picture specification of above solution procedure.U
2And U
0Be the indifference curve of CRBS, on the indifference curve all each points (w, L) combination brings identical repayment to CRBS, obviously U
2>U
0π
2And π
0Be the indifference curve (waiting acknowledge line) of SU, when using identical frequency spectrum quantity (income is identical), the quotation that the upper curve correspondence is bigger, obviously π
2>π
0L
*(w) be the optimal spectrum purchase volume of the different quotations of SU reply CRBS, therefore, curve L
*(w) and the point of contact of CRBS indifference curve be exactly the equilibrium point of game.
Obviously, if the combined spot of w and L is arranged in Fig. 2 dash area, promptly the quotation of CRBS is than w
*Reduce, the frequency spectrum purchase volume of SU is than L
*(w
*) increase, then the repayment of CRBS and SU all can improve simultaneously, and just new combined spot is that Pareto is effective with respect to the equilibrium point.
Carrying out unlimited repeated game can be so that the game both sides selects effectively strategy combination of Pareto, and still, the probability that the mobility of SU and professional bursting property make unlimited repeated game take place is very little.A more real model is to receive assorted bargaining model, also is to use the expansion consultation with regard to a frequency spectrum between CRBS and the SU, if both sides can not reach an agreement, then both sides' utility level all is 0; If both sides reach an agreement, the utility level of CRBS is u
1, the utility level of SU is π
1Corresponding optimization aim is:
In view of U (w, L) and π (w L) is continuous function while bounded on the domain of definition, so must there be optimal solution in the problems referred to above, and the assorted negotiation of promptly receiving can obtain (u
1, π
1), the both sides that then participate in the frequency spectrum transaction have two selections, the dynamic game or the assorted negotiation of receiving.
Find the solution this optimization problem, can be at point
Obtain maximum:
The cotype dynamic game compares, and both sides' effectiveness that the assorted negotiation of obviously receiving obtains is better than the effectiveness of dynamic game.This mainly is the quotation that reduces oneself by CRBS, has encouraged the frequency spectrum of SU to use, thereby improved the repayment of oneself under the prerequisite of not damaging the SU repayment.
The Ruo Nashi negotiation is more excellent separating, and then CRBS or SU wish participating in this negotiation.Wish that as CRBS SU can improve the frequency spectrum use amount when oneself reduces quotation.But the analyzing examples of front shows, this moment, SU can not make a profit the assorted negotiation from receiving, and therefore, its strategy can be to keep original purchase volume, and this can damage the interests of CRBS.Therefore, need introducing certain constraint makes both sides participate under the prerequisite of not damaging number one to receive in the assorted negotiation.
For this reason, introduce new spectrum allocation may flow process:
(1) CRBS periodically announce oneself the repayment function U (w, L), SU in the networking initialization procedure, announce oneself repayment function π (w, L);
(2) CRBS proposes the frequency spectrum w that initially offers
*
(3) SU observes w
*, calculate and at oneself dynamic optimal spectrum use amount L
*With receive assorted bargaining solution frequency spectrum use amount L
BIn select one and report CRBS;
(4) CRBS is that SU assigns corresponding frequency spectrum quantity, but contract quoting is at L
*Be w
*, at L
BBe w
BSU will offer as agreed and repay the frequency spectrum cost of use.
The frequency spectrum that SU uses is that CRBS distributes, so its purchase volume need be through the affirmation of CRBS.As provide the long-term communication serving base station can be for the profit of once transaction wrong valuation (demand L
BThe time by the quotation w
*Charge) or the wrong frequency spectrum quantity of paying, can damage oneself prestige like this.Therefore, it is feasible receiving assorted negotiation.
Analyze now the optimal policy of CRBS and SU under general situation.Both sides' repayment function is as follows:
U(w,L)=(w-w
0)L
π(w,L)=R(L)-wL
R(L)=L
α?α∈(0,1)
Be cost price w
0Be variable, the income of SU also is variable.Then in dynamic game, quotation and the frequency spectrum purchase volume that can obtain the best are:
Corresponding both sides' repayment this moment is:
Receiving assorted negotiation back best quotation and frequency spectrum purchase volume be:
Corresponding both sides' repayment this moment is:
From top formula as can be seen, no matter be dynamic game or receive assorted negotiation, given both sides' repayment function, it is very little that the player makes more required amount of calculation.Therefore, the selection of this spectrum allocation may strategy can be carried out fast in existing type equipment and base station survey.
Can prove that for SU the superiority of assorted negotiation received depends on the parameter alpha in the revenue function.And CRBS always wishes to receive assorted negotiation.Less cost price can encourage frequency spectrum to use simultaneously.This provides an instrument that the regulation and control frequency spectrum uses for frequency spectrum supervision mandate department.
Utilize Matlab software that the conclusion that obtains is previously carried out the system emulation checking.The cost price that Fig. 3 has provided at CRBS is 1 o'clock, and the variation of α is to the influence of CRBS and the final repayment of SU.As can be seen, when α<0.5, SU tends to dynamic game, and in α>0.5 o'clock, the assorted negotiation that participates in receiving will bring more income for SU.This is consistent with the conclusion in 3.4.For CRBS, the repayment of negotiation (note the repayment curve of this moment and SU negotiation repayment curve overlap) always be better than dynamic game.
The cost price that Fig. 4 and Fig. 5 have provided respectively at CRBS is 1 o'clock, and the variation of α is to the influence of SU frequency spectrum purchase volume and CRBS quotation per unit.As can be seen, the quotation the when quotation of CRBS when participating in receiving assorted negotiation always is lower than dynamic game, corresponding therewith is, SU always is higher than the purchase volume of dynamic game receiving frequency spectrum purchase volume after the assorted negotiation.This has verified that the analysis conclusion of Fig. 2: CRBS can reduce bid and use more frequency spectrum to stimulate SU, thereby improves the repayment (referring to Fig. 3) of self.